Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.03986
Cited By
SwiftTron: An Efficient Hardware Accelerator for Quantized Transformers
8 April 2023
Alberto Marchisio
David Durà
Maurizio Capra
Maurizio Martina
Guido Masera
Muhammad Shafique
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SwiftTron: An Efficient Hardware Accelerator for Quantized Transformers"
9 / 9 papers shown
Title
Shrinking the Giant : Quasi-Weightless Transformers for Low Energy Inference
Shashank Nag
Alan T. L. Bacellar
Zachary Susskind
Anshul Jha
Logan Liberty
...
Krishnan Kailas
P. Lima
Neeraja J. Yadwadkar
F. M. G. França
L. John
33
0
0
04 Nov 2024
From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of Deep Neural Networks
Xue Geng
Zhe Wang
Chunyun Chen
Qing Xu
Kaixin Xu
...
Zhenghua Chen
M. Aly
Jie Lin
Min-man Wu
Xiaoli Li
31
1
0
09 May 2024
TinyCL: An Efficient Hardware Architecture for Continual Learning on Autonomous Systems
Eugenio Ressa
Alberto Marchisio
Maurizio Martina
Guido Masera
Muhammad Shafique
32
0
0
15 Feb 2024
Stochastic Spiking Attention: Accelerating Attention with Stochastic Computing in Spiking Networks
Zihang Song
Prabodh Katti
Osvaldo Simeone
Bipin Rajendran
16
2
0
14 Feb 2024
BETA: Binarized Energy-Efficient Transformer Accelerator at the Edge
Yuhao Ji
Chao Fang
Zhongfeng Wang
30
3
0
22 Jan 2024
Auxiliary Features-Guided Super Resolution for Monte Carlo Rendering
Qiqi Hou
Feng Liu
SupR
13
4
0
20 Oct 2023
I-ViT: Integer-only Quantization for Efficient Vision Transformer Inference
Zhikai Li
Qingyi Gu
MQ
46
94
0
04 Jul 2022
I-BERT: Integer-only BERT Quantization
Sehoon Kim
A. Gholami
Z. Yao
Michael W. Mahoney
Kurt Keutzer
MQ
86
340
0
05 Jan 2021
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
294
6,943
0
20 Apr 2018
1