最近在做近存储计算方面的调研,主要关注RRAM在近存储计算方面的发展情况和前景,这个系列是阅读笔记,比较杂乱。系统些的小结会放到其他栏目中

Near-memory computing: Past, present, and future
Abstract

The conventional approach of moving data to the CPU for computation has become a significant performance bottleneck for emerging scale-out data-intensive applications due to their limited data reuse.

The advancement in 3D integration technologies has made the decade-old concept of coupling compute units close to the memory - called near-memory computing (NMC) - more viable.

1. Introduction

memory wall: memory technology has not been able to keep up with advancements in processor technology in terms of latency and energy consumption.

memory hierarchies

Dennard scaling ???

applocations exhibit massive data parallelism and low operational intensity with a limited locality.

NMC: aims at processing close to where the data resides.

three-dimensional stacking: true enabler of precessing close to the memory. Allows the stacking of logic and memory together using through silicon via’s (TSV) that helps in reducing memory access latency, power consumption and provides much higher bandwidth.

competing products in the 3D memory arena:

  • Micron: Hybrid Memory Cube (HMC)

  • AMD and Hynix: High Bandwidth Memory (HBM)

  • Samsumg: Wide I/O

paper structure:

  • section 2: a historical overview of near-memory computing and related work
  • section 3, 4, 5: evaluation and classification scheme for NMC at main memory
  • section 6: challenges with cache coherence, virtual memory, the lack of programming models, and data mapping schemes for NMC.
  • section 7: tools and techiniques used to perform the design space exploration for these systems.
  • section 8: their approach to near-memory computing.
  • section 9: the lessons learned and future research directions
2. Background and related work

Idea: 1960s

First appearance: 1990s, Vector IRAM (VIRAM)

the reason of resurgence:

  • techonological advancements in 3D and 2.5D stacking (blends logic and memory in the same package)
  • sidestepping the performance and energy buttlenecks
  • advent of modern data-intensive applications

manifested with names: processing-in memory(PIM), near-data processing(NDP), near-memory processing (NMP), in-storage processing(ISP)

3. Classification and evaluation

Introduces the classification and evaluation metrics that are used in Section 4 & 5.

  • Memory - The decision of using what kind of memory
  • Processing - Processing unit implemented, and the granularity of processing it
  • Tool - Tool infrastructure
  • Interoperability(互用性)
  • Application

感觉前两个是用来分类的,这些都可以用来评价。

4. Processing near main memory

使用programmable processing unit的有若干,使用fixed-function processing unit的也有若干。

5. Processing near storage class memory

分为Fixed-function unit, programmable unit和 re-configurable unit

6. Challenges of near-memory computing
  • Virtual memory support
  • Cache coherence
  • Programming Model
  • Data mapping
7. Design space exploration for NMC
8. Practicalities of near-memory computing
An 8-bit RRAM based Multiplier for Hybrid Memory Computing
0. Abstract

Hybrid-memory computing can reconcile the speed of in-memory computing and the power of near-memory. The simulation results show that the proposed RRAM based multiplier achieves a calculation speed of 1.6us and 1.32 mW from 1V power supply.

1. Introduction

In a hybrid-memory computing process, the data will be processed in memory firstly

and then in the circuit near memory.

In-Memory Computing: the next generation AI computing paradigm
IMC Based on Emerging NVM

NVM is demonstrated not only to serve as memory macro, but also in the computing operations, such as matrix-matrix, matrix-vector multiplication, convolution, etc.

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-QEsFCJHq-1625797501155)(C:/Users/Lucky-H6/AppData/Roaming/Typora/typora-user-images/image-20210705171945429.png)]

Flash Memory-Based

Inherently analog.

advantages: mature fabrication process, ready for large-volume integration.

Yet, commercial flash memories are optimized for digital storage applications and require circuit-level physical design modifications to enable high-precision analog threshold voltage tuning. Its incompatibility with state-of-the-art standard logic CMOS technology in terms of technology nodes and operation voltage levels also limits the application and the level of integration.

ReRAM-based

none-volatile memory with near-zero leakage energy and high density. Moreover, with its crossbar array structure, ReRAM can perform the matrix-vector multiplication efficiently. ReRAM can perform MAC operations efficiently in a crossbar structure and has been widely studied to represent synapses in the neural network computation.

Compare to SRAM and DRAM, ReRAM has high write energy and latency, which increases the overall power consumption and decreases the weight updating speed.

STT-MRAM-Based

Spin-Toque Transfer Magnetic Random Access Memory are promising substitutes for SRAM to reduce standby power. Among different types of NVMs, STT-MRAM is the most preferable due to its unique characteristics like high density and near-zero power leakage. Faces some distinct relability challenges involving storage failures.

PCM-Based

A Phase-Change Memory device can be programmed to a certain desired resistance value through iterative programming by applying several pulses in a closed-loop manner.

RRAM/ Near Memory Computing (NMC) Survey - Reading Notes 0705相关推荐

  1. RRAM/ Near Memory Computing (NMC) Survey - Reading Notes 0707

    Reading Notes of Resistive Random Access Memory – Day 2 Chapter 3 RRAM Characterization and Modeling ...

  2. Code Style Guidelines for Contributors Reading Notes

    Reading Notes: 1.You must handle every Exception in your code in some principled way. (if you are co ...

  3. in memory computing 存内计算是学术圈自娱自乐还是真有价值?

    Python微信订餐小程序课程视频 https://edu.csdn.net/course/detail/36074 Python实战量化交易理财系统 https://edu.csdn.net/cou ...

  4. 《深度学习之TensorFlow》reading notes(3)—— MNIST手写数字识别之二

    文章目录 模型保存 模型读取 测试模型 搭建测试模型 使用模型 模型可视化 本文是在上一篇文章 <深度学习之TensorFlow>reading notes(2)-- MNIST手写数字识 ...

  5. In Memory Computing(存内计算、存算一体、内存内计算)

    什么是In Memory Computing(存内计算.存算一体.内存内计算)? In-memory Computing 技术就是以 RAM 取代 hard disk ,将 data 与 CPU 之间 ...

  6. 论文解读:Critical Security Issues in Cloud Computing: A Survey (2018)

    参考论文: [1]X. Sun, "Critical Security Issues in Cloud Computing: A Survey," 2018 IEEE 4th In ...

  7. A Survey of Zero-Shot Learning: Settings, Methods, and Applications [reading notes]

    原文链接:https://joselynzhao.top/2019/04/15/A-Survey-of-Zero-Shot-Learning_-Settings,-Methods,-and-Appli ...

  8. Interviewing at Amazon — Leadership Principles Reading Notes

    Reference: Interviewing at Amazon - Leadership Principles 这里面有好多大义凛然的话我们可以去说.基本上句句经典. In summary, wh ...

  9. 「MICCAI 2017」Reading Notes

    Sina Weibo:东莞小锋子Sexyphone Tencent E-mail:403568338@qq.com http://blog.csdn.net/dgyuanshaofeng/articl ...

最新文章

  1. (操作系统)实验二 作业调度
  2. Linux开机启动顺序
  3. cortex-M3与ARM7的比较
  4. swift选择类或结构体
  5. GitHub上读北大:覆盖AI高数等130多门课,讲义考题答案全都有,标星已3k+
  6. python遍历目录压缩文件夹_Python实现多级目录压缩与解压文件的方法
  7. python中的位置怎么看_如何知道项目在Python有序字典中的位置
  8. File,FileInputStream,FileReader,InputStreamReader,BufferReader 的区别使用
  9. [ckeditor系列]ckeditor 自己写的一个简单的image上传js 运用iframe的ajax上传
  10. 程序员面试金典 - 面试题 16.02. 单词频率(哈希表/Trie树)
  11. (转)淘淘商城系列——发布dubbo服务
  12. 【BZOJ】【1096】【ZJOI2007】仓库建设
  13. Ecliplse安装tomcat插件
  14. 操作系统编写之引导扇区 1
  15. python 拍照搜题_大学慕课2020用Python玩转数据答案搜题公众号
  16. ARRI阿莱MXF修复方法
  17. 羲云社区团购微信小程序多门店版,首页开发
  18. GPS模块运用: GPS模块数据提取、常规参数配置(脉冲频率、输出指定命令、定位模式等)
  19. java 过滤 rtf 图片_忽略WPF RichTextBox中RTF文件中图像的定位
  20. 基于python的opencv图像处理对交通路口的红绿灯进行颜色检测,无人汽车驾驶第一步!

热门文章

  1. 二维码简介_二维码基本概念_二维码基本原理
  2. 阿里有群姑娘,是马老师的师兄,还是逍遥子的学长……
  3. 根据ICCID反查手机号码的五种终极方法
  4. uno牌的玩法图解_UNO优诺纸牌游戏玩法详解 这些经验不可多得
  5. Python AI:如何构建神经网络并进行预测
  6. 网址导航类的网站为什么会没落
  7. 万能注册机下载|万能注册机中文版2016
  8. 【2036】改革春风吹满地
  9. 汉字标点符号unicode
  10. 记录一次hadoop的空间清理