Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.09144
Cited By
FORMS: Fine-grained Polarized ReRAM-based In-situ Computation for Mixed-signal DNN Accelerator
16 June 2021
Geng Yuan
Payman Behnam
Zhengang Li
Ali Shafiee
Sheng Lin
Xiaolong Ma
Hang Liu
Xuehai Qian
M. N. Bojnordi
Yanzhi Wang
Caiwen Ding
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FORMS: Fine-grained Polarized ReRAM-based In-situ Computation for Mixed-signal DNN Accelerator"
12 / 12 papers shown
Title
RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation
Jun Liu
Zhenglun Kong
Peiyan Dong
Changdi Yang
Xuan Shen
...
Wei Niu
Wenbin Zhang
Xue Lin
Dong Huang
Yanzhi Wang
ALM
36
2
0
08 Jan 2025
CIM-MLC: A Multi-level Compilation Stack for Computing-In-Memory Accelerators
Songyun Qu
Shixin Zhao
Bing Li
Yintao He
Xuyi Cai
Lei Zhang
Ying Wang
16
4
0
23 Jan 2024
Zero-Space Cost Fault Tolerance for Transformer-based Language Models on ReRAM
Bingbing Li
Geng Yuan
Zigeng Wang
Shaoyi Huang
Hongwu Peng
Payman Behnam
Wujie Wen
Hang Liu
Caiwen Ding
14
5
0
22 Jan 2024
RACE-IT: A Reconfigurable Analog CAM-Crossbar Engine for In-Memory Transformer Acceleration
Lei Zhao
Luca Buonanno
Ron M. Roth
Sergey Serebryakov
Archit Gajjar
John Moon
Jim Ignowski
Giacomo Pedretti
23
3
0
29 Nov 2023
Subgraph Stationary Hardware-Software Inference Co-Design
Payman Behnam
Jianming Tong
Alind Khare
Yang Chen
Yue Pan
Pranav Gadikar
A. Bambhaniya
T. Krishna
Alexey Tumanov
17
3
0
21 Jun 2023
Sparse Attention Acceleration with Synergistic In-Memory Pruning and On-Chip Recomputation
Amir Yazdanbakhsh
Ashkan Moradifirouzabadi
Zheng Li
Mingu Kang
19
31
0
01 Sep 2022
ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network Quantization
Cong Guo
Chen Zhang
Jingwen Leng
Zihan Liu
Fan Yang
Yun-Bo Liu
Minyi Guo
Yuhao Zhu
MQ
14
54
0
30 Aug 2022
Heterogeneous Data-Centric Architectures for Modern Data-Intensive Applications: Case Studies in Machine Learning and Databases
Geraldo F. Oliveira
Amirali Boroumand
Saugata Ghose
Juan Gómez Luna
O. Mutlu
26
7
0
29 May 2022
SparseP: Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Systems
Christina Giannoula
Ivan Fernandez
Juan Gómez Luna
N. Koziris
G. Goumas
O. Mutlu
MoE
16
26
0
13 Jan 2022
Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration
Yifan Gong
Geng Yuan
Zheng Zhan
Wei Niu
Zhengang Li
...
Sijia Liu
Bin Ren
Xue Lin
Xulong Tang
Yanzhi Wang
20
10
0
22 Nov 2021
Resistive Neural Hardware Accelerators
Kamilya Smagulova
M. Fouda
Fadi J. Kurdahi
K. Salama
A. Eltawil
23
11
0
08 Sep 2021
On the Accuracy of Analog Neural Network Inference Accelerators
T. Xiao
Ben Feinberg
C. Bennett
V. Prabhakar
Prashant Saxena
V. Agrawal
S. Agarwal
M. Marinella
20
32
0
03 Sep 2021
1