Collaborative Research: FuSe: R3AP: Retunable, Reconfigurable, Racetrack-Memory Acceleration Platform
合作研究:FuSe:R3AP:可重调、可重新配置、赛道内存加速平台
基本信息
- 批准号:2328972
- 负责人:
- 金额:$ 59.36万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2024
- 资助国家:美国
- 起止时间:2024-01-01 至 2026-12-31
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
In traditional Von Neumann computing systems, a significant bottleneck arises because the data transfer speed to and from the computing units has considerably fallen behind capacity, processing speed, and efficiency. To mitigate this bottleneck by bridging the gap between storage and computation, many innovative storage technologies have been introduced, along with near- and in-memory processing solutions designed for both emerging and traditional memory systems. Nonetheless, a considerable challenge remains: the prototyping and characterization of actual fabricated systems, especially those encompassing both mature technologies and cutting-edge technologies. To overcome this challenge, this project develops a cutting-edge Retunable and Reconfigurable Acceleration Platform (R3AP) based on emerging racetrack memory, leveraging a device-architecture-application co-design approach. The standout features of R3AP include its ability to function as a reconfigurable logic, a processing-in-memory (PIM) accelerator, and a high-density memory storage. It is retunable, meaning it can operate with bit-wise, integer, and floating-point precision, and can simulate analog-like storage and processing. R3AP effectively mitigates data movement inefficiencies while offering domain-specific acceleration and adaptability. With its dense, reliable, energy-efficient, and ultra-low latency computational capability, R3AP has the potential to revolutionize the storage and processing capabilities of future computing systems, such as those in Internet of Things (IoT) and Cyber-Physical Systems (CPS). It can also be applied to high-performance and cloud computing systems. The project's findings are shared through publications, workshops, design contests, tutorials, industrial courses, and technology transfer activities. Educational resources and outreach activity plans are made available on the project website, and software artifacts are released on GitHub.To realize R3AP, the project comprises a series of interrelated research tasks spanning multiple system layers. At the device level, the project integrates the voltage-controlled skyrmion motion mechanism with the industrial-grade 8-inch wafer magnetic tunneling junction stack and demonstrates a fully functional Skyrmion racetrack memory (SRTM), including the formation, shifting, and detection of the skyrmion stream. Additionally, it evaluates the performance of SRTM, focusing on aspects such as write-error-rate, shift-error-rate, read-error-rate, operation speed, and energy consumption. It also addresses and mitigates non-idealities, such as the pinning effect, and goes on to develop and demonstrate CMOS-integrated SRTM. On the architecture and circuit layers, the project involves the creation of a mutable lookup table, compute, and memory unit. This unit performs like multi-context Field-Programmable Gate Array (FPGA) logic, parallel PIM logic, massively parallel accumulators, and analog-like storage and compute structures, leveraging the unique properties of SRTM. This layer ensures high-speed memory access from a hierarchy consisting of banks, subarrays, tiles, etc., and further adds links via configurable switch boxes and a mesh-based network-on-chip to enable data movement operations for PIM that would otherwise be challenging. At the application layer, the project develops novel modeling, analysis, design space exploration, and runtime adjustment techniques to exploit the high degree of reconfigurability provided by R3AP. The goal is to adapt future IoT and CPS applications to changing environments and requirements, optimize resource usage, withstand external disturbances, and enhance overall system performance, resilience, and sustainability. Across all these layers, the project develops a scalable computer-aided design (CAD) flow. This involves a multi-level intermediate representation-based compilation flow, which can compile high-level description languages such as PyTorch and C/C++ into binaries for the R3AP device. This flow uses a multi-level hierarchy including front-end, middle-end, and back-end compilation of the designs, and abstracts various optimization and management problems to a suitable level for efficient resolution.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
在传统的冯·诺依曼计算系统中,由于往返于计算单元的数据传输速度已经大大落后于容量、处理速度和效率,因此出现了显著的瓶颈。为了通过弥合存储和计算之间的差距来缓解这一瓶颈,人们引入了许多创新的存储技术,以及为新兴和传统存储系统设计的近内存和内存中处理解决方案。沿着。尽管如此,仍然存在相当大的挑战:实际制造系统的原型设计和表征,特别是那些包含成熟技术和尖端技术的系统。为了克服这一挑战,该项目基于新兴的赛道存储器开发了一个先进的可重新调整和可重新配置的加速平台(R3 AP),利用设备-架构-应用协同设计方法。R3 AP的突出功能包括其作为可重新配置逻辑、内存处理(PIM)加速器和高密度内存存储的能力。它是可重调的,这意味着它可以按位、整数和浮点精度操作,并且可以模拟类似于模拟的存储和处理。R3 AP有效地降低了数据移动效率,同时提供特定于域的加速和适应性。凭借其密集、可靠、节能和超低延迟的计算能力,R3 AP有可能彻底改变未来计算系统的存储和处理能力,例如物联网(IoT)和网络物理系统(CPS)。它也可以应用于高性能和云计算系统。通过出版物、讲习班、设计竞赛、教程、工业课程和技术转让活动分享该项目的成果。教育资源和推广活动计划在项目网站上提供,软件产品在GitHub上发布。为了实现R3 AP,该项目包括一系列跨越多个系统层的相互关联的研究任务。在器件层面,该项目将电压控制的Skyrmion运动机制与工业级8英寸晶圆磁性隧道结堆栈集成在一起,并展示了一个功能齐全的Skyrmion赛道存储器(SRTM),包括Skyrmion流的形成、移动和检测。此外,它评估SRTM的性能,重点是写错误率,移位错误率,读错误率,操作速度和能耗等方面。它还解决和减轻非理想性,如钉扎效应,并继续开发和演示CMOS集成SRTM。在架构和电路层,该项目涉及到创建一个可变的查找表,计算和内存单元。该单元的性能类似于多上下文现场可编程门阵列(FPGA)逻辑、并行PIM逻辑、大规模并行FPGA以及类似模拟的存储和计算结构,充分利用了SRTM的独特属性。这一层确保了从由存储体、子阵列、瓦片等组成的层次结构进行高速存储器访问,并且进一步经由可配置的开关盒和基于网格的片上网络添加链路,以实现PIM的数据移动操作,否则这将是具有挑战性的。在应用层,该项目开发了新的建模、分析、设计空间探索和运行时调整技术,以利用R3 AP提供的高度可重构性。其目标是使未来的物联网和CPS应用适应不断变化的环境和要求,优化资源使用,抵御外部干扰,并增强整体系统性能,弹性和可持续性。在所有这些层面上,该项目开发了一个可扩展的计算机辅助设计(CAD)流程。这涉及到一个基于多级中间表示的编译流程,可以将PyTorch和C/C++等高级描述语言编译成R3 AP设备的二进制文件。该流程采用多层次结构,包括前端、中间端和后端的设计编译,并将各种优化和管理问题抽象到适当的层次,以有效解决问题。该奖项反映了NSF的法定使命,并通过使用基金会的智力价值和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(3)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Challenges and Opportunities to Enable Large-Scale Computing via Heterogeneous Chiplets
- DOI:10.1109/asp-dac58780.2024.10473961
- 发表时间:2023-11
- 期刊:
- 影响因子:0
- 作者:Zhuoping Yang;Shixin Ji;Xingzhen Chen;Jinming Zhuang;Weifeng Zhang;Dharmesh Jani;Peipei Zhou
- 通讯作者:Zhuoping Yang;Shixin Ji;Xingzhen Chen;Jinming Zhuang;Weifeng Zhang;Dharmesh Jani;Peipei Zhou
SSR: Spatial Sequential Hybrid Architecture for Latency Throughput Tradeoff in Transformer Acceleration
- DOI:10.1145/3626202.3637569
- 发表时间:2024-01
- 期刊:
- 影响因子:0
- 作者:Jinming Zhuang;Zhuoping Yang;Shixin Ji;Heng Huang;Alex K. Jones;Jingtong Hu;Yiyu Shi;Peipei Zhou
- 通讯作者:Jinming Zhuang;Zhuoping Yang;Shixin Ji;Heng Huang;Alex K. Jones;Jingtong Hu;Yiyu Shi;Peipei Zhou
AIM: Accelerating Arbitrary-Precision Integer Multiplication on Heterogeneous Reconfigurable Computing Platform Versal ACAP
目的:在异构可重构计算平台 Versal ACAP 上加速任意精度整数乘法
- DOI:10.1109/iccad57390.2023.10323754
- 发表时间:2023
- 期刊:
- 影响因子:0
- 作者:Yang, Zhuoping;Zhuang, Jinming;Yin, Jiaqi;Yu, Cunxi;Jones, Alex K.;Zhou, Peipei
- 通讯作者:Zhou, Peipei
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Jingtong Hu其他文献
FlexLevel NAND Flash Storage System Design to Reduce LDPC Latency
FlexLevel NAND 闪存存储系统设计可减少 LDPC 延迟
- DOI:
10.1109/tcad.2016.2619480 - 发表时间:
2017-07 - 期刊:
- 影响因子:2.9
- 作者:
Jie Guo;Wujie Wen;Jingtong Hu;王党辉;Hai Lu;Yiran Chen - 通讯作者:
Yiran Chen
Stack-Size Sensitive On-Chip Memory Backup for Self-Powered Nonvolatile Processors
适用于自供电非易失性处理器的堆栈大小敏感片上内存备份
- DOI:
10.1109/tcad.2017.2666606 - 发表时间:
2017-02 - 期刊:
- 影响因子:2.9
- 作者:
Mengying Zhao;Chenchen Fu;Zewei Li;Qing'an Li;Mimi Xie;Yongpan Liu;Jingtong Hu;Zhiping Jia;Chun Jason Xue - 通讯作者:
Chun Jason Xue
Development of A Real-time POCUS Image Quality Assessment and Acquisition Guidance System
实时 POCUS 图像质量评估和采集引导系统的开发
- DOI:
10.48550/arxiv.2212.08624 - 发表时间:
2022 - 期刊:
- 影响因子:0
- 作者:
Zhenge Jia;Yiyu Shi;Jingtong Hu;Lei Yang;B. Nti - 通讯作者:
B. Nti
Algorithm-hardware Co-design of Attention Mechanism on FPGA Devices
FPGA器件上注意力机制的算法-硬件协同设计
- DOI:
10.1145/3477002 - 发表时间:
2021 - 期刊:
- 影响因子:0
- 作者:
Xinyi Zhang;Yawen Wu;Peipei Zhou;Xulong Tang;Jingtong Hu - 通讯作者:
Jingtong Hu
Learning to Learn Personalized Neural Network for Ventricular Arrhythmias Detection on Intracardiac EGMs
学习学习用于心内 EGM 室性心律失常检测的个性化神经网络
- DOI:
10.24963/ijcai.2021/359 - 发表时间:
2021 - 期刊:
- 影响因子:0
- 作者:
Zhenge Jia;Zhepeng Wang;Feng Hong;Lichuan Ping;Yiyu Shi;Jingtong Hu - 通讯作者:
Jingtong Hu
Jingtong Hu的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Jingtong Hu', 18)}}的其他基金
Collaborative Research: DESC: Type I: FLEX: Building Future-proof Learning-Enabled Cyber-Physical Systems with Cross-Layer Extensible and Adaptive Design
合作研究:DESC:类型 I:FLEX:通过跨层可扩展和自适应设计构建面向未来的、支持学习的网络物理系统
- 批准号:
2324937 - 财政年份:2024
- 资助金额:
$ 59.36万 - 项目类别:
Standard Grant
Collaborative Research: CNS Core: Small: Towards Unsupervised Learning on Resource Constrained Edge Devices with Novel Statistical Contrastive Learning Scheme
合作研究:CNS 核心:小型:利用新颖的统计对比学习方案在资源受限的边缘设备上实现无监督学习
- 批准号:
2122320 - 财政年份:2021
- 资助金额:
$ 59.36万 - 项目类别:
Standard Grant
Collaborative Research: CNS Core:Small:IMPERIAL: In-Memory Processing Enhanced Racetrack Inspired by Accessing Laterally
协作研究:CNS Core:Small:IMPERIAL:受横向访问启发的内存处理增强赛道
- 批准号:
2133267 - 财政年份:2021
- 资助金额:
$ 59.36万 - 项目类别:
Standard Grant
Collaborative Research:CNS Core: Small: Intermittent and Incremental Inference with Statistical Neural Network for Energy-Harvesting Powered Devices
合作研究:CNS 核心:小型:利用统计神经网络对能量收集供电设备进行间歇和增量推理
- 批准号:
2007274 - 财政年份:2020
- 资助金额:
$ 59.36万 - 项目类别:
Standard Grant
RAPID:Collaborative:Independent Component Analysis Inspired Statistical Neural Networks for 3D CT Scan Based Edge Screening of COVID-19
RAPID:协作:独立成分分析启发的统计神经网络,用于基于 3D CT 扫描的 COVID-19 边缘筛查
- 批准号:
2027546 - 财政年份:2020
- 资助金额:
$ 59.36万 - 项目类别:
Standard Grant
IRES Track I: International Research Experience for Students on Non-Volatile Processor Based Self-Powered Embedded Systems
IRES Track I:基于非易失性处理器的自供电嵌入式系统学生的国际研究经验
- 批准号:
1827009 - 财政年份:2018
- 资助金额:
$ 59.36万 - 项目类别:
Standard Grant
SHF: Small: Collaborative Research: Multi-level Non-volatile FPGA Synthesis to Empower Efficient Self-adaptive System Implementations
SHF:小型:协作研究:多级非易失性 FPGA 综合,实现高效自适应系统
- 批准号:
1820537 - 财政年份:2017
- 资助金额:
$ 59.36万 - 项目类别:
Standard Grant
CRII: CSR: Enabling Efficient Non-Volatile Processors on Energy Harvesting Powered Embedded Systems
CRII:CSR:在能量收集供电的嵌入式系统上启用高效的非易失性处理器
- 批准号:
1830891 - 财政年份:2017
- 资助金额:
$ 59.36万 - 项目类别:
Standard Grant
SHF: Small: Collaborative Research: Multi-level Non-volatile FPGA Synthesis to Empower Efficient Self-adaptive System Implementations
SHF:小型:协作研究:多级非易失性 FPGA 综合,实现高效自适应系统
- 批准号:
1527506 - 财政年份:2015
- 资助金额:
$ 59.36万 - 项目类别:
Standard Grant
CRII: CSR: Enabling Efficient Non-Volatile Processors on Energy Harvesting Powered Embedded Systems
CRII:CSR:在能量收集供电的嵌入式系统上启用高效的非易失性处理器
- 批准号:
1464429 - 财政年份:2015
- 资助金额:
$ 59.36万 - 项目类别:
Standard Grant
相似国自然基金
复杂电子产品超精密加工及检测关键技术研究与应用
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
基于合成生物学的动物底盘品种优化及中试应用研究
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
运用组学整合技术探索萆薢分清散联合化疗治疗晚期胰腺癌的临床研究
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
九里香等提取物多靶向制剂抗肺癌的作用及机制研究
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
升血小板方治疗原发免疫性血小板减少症的临床研究
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
八髎穴微波热疗在女性膀胱过度活动症治疗中的价值研究
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
基于 miR-455-5p 介导的氧化应激机制探讨糖尿病视网膜病变中医分型治疗的临床研究
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
基于 UPLC-Q-TOF-MS/MS 分析的 异功散活性成分评价及提取工艺研究
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
无创电针对于痉挛型双瘫脑 瘫患儿的有效性与安全性研究:一项随机 单盲前瞻性队列研究
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
弹压式手法与体外冲击波治疗肱骨外上髁炎的对比研究
- 批准号:
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
相似海外基金
Collaborative Research: FuSe: R3AP: Retunable, Reconfigurable, Racetrack-Memory Acceleration Platform
合作研究:FuSe:R3AP:可重调、可重新配置、赛道内存加速平台
- 批准号:
2328975 - 财政年份:2024
- 资助金额:
$ 59.36万 - 项目类别:
Continuing Grant
Collaborative Research: FuSe: R3AP: Retunable, Reconfigurable, Racetrack-Memory Acceleration Platform
合作研究:FuSe:R3AP:可重调、可重新配置、赛道内存加速平台
- 批准号:
2328973 - 财政年份:2024
- 资助金额:
$ 59.36万 - 项目类别:
Continuing Grant
Collaborative Research: FuSe: R3AP: Retunable, Reconfigurable, Racetrack-Memory Acceleration Platform
合作研究:FuSe:R3AP:可重调、可重新配置、赛道内存加速平台
- 批准号:
2328974 - 财政年份:2024
- 资助金额:
$ 59.36万 - 项目类别:
Continuing Grant
Collaborative Research: FuSe: Indium selenides based back end of line neuromorphic accelerators
合作研究:FuSe:基于硒化铟的后端神经形态加速器
- 批准号:
2328741 - 财政年份:2023
- 资助金额:
$ 59.36万 - 项目类别:
Continuing Grant
Collaborative Research: FuSe: Interconnects with Co-Designed Materials, Topology, and Wire Architecture
合作研究:FuSe:与共同设计的材料、拓扑和线路架构互连
- 批准号:
2328906 - 财政年份:2023
- 资助金额:
$ 59.36万 - 项目类别:
Continuing Grant
Collaborative Research: FuSe: Interconnects with Co-Designed Materials, Topology, and Wire Architecture
合作研究:FuSe:与共同设计的材料、拓扑和线路架构互连
- 批准号:
2328908 - 财政年份:2023
- 资助金额:
$ 59.36万 - 项目类别:
Continuing Grant
Collaborative Research: FuSe: Collaborative Optically Disaggregated Arrays of Extreme-MIMO Radio Units (CODAeMIMO)
合作研究:FuSe:Extreme-MIMO 无线电单元的协作光学分解阵列 (CODAeMIMO)
- 批准号:
2328947 - 财政年份:2023
- 资助金额:
$ 59.36万 - 项目类别:
Continuing Grant
FuSe/Collaborative Research: Heterogeneous Integration in Power Electronics for High-Performance Computing (HIPE-HPC)
FuSe/合作研究:用于高性能计算的电力电子异构集成 (HIPE-HPC)
- 批准号:
2329063 - 财政年份:2023
- 资助金额:
$ 59.36万 - 项目类别:
Continuing Grant
Collaborative Research: FuSe: High-throughput Discovery of Phase Change Materials for Co-designed Electronic and Optical Computational Devices (PHACEO)
合作研究:FuSe:用于共同设计的电子和光学计算设备的相变材料的高通量发现(PHACEO)
- 批准号:
2329087 - 财政年份:2023
- 资助金额:
$ 59.36万 - 项目类别:
Continuing Grant
Collaborative Research: FuSe: Monolithic 3D Integration (M3D) of 2D Materials-Based CFET Logic Elements towards Advanced Microelectronics
合作研究:FuSe:面向先进微电子学的基于 2D 材料的 CFET 逻辑元件的单片 3D 集成 (M3D)
- 批准号:
2329189 - 财政年份:2023
- 资助金额:
$ 59.36万 - 项目类别:
Standard Grant