Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.00526
Cited By
v1
v2 (latest)
A Compression-Compilation Framework for On-mobile Real-time BERT Applications
30 May 2021
Wei Niu
Zhenglun Kong
Geng Yuan
Weiwen Jiang
Jiexiong Guan
Caiwen Ding
Pu Zhao
Sijia Liu
Bin Ren
Yanzhi Wang
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Compression-Compilation Framework for On-mobile Real-time BERT Applications"
2 / 2 papers shown
Title
Quantized Transformer Language Model Implementations on Edge Devices
Mohammad Wali Ur Rahman
Murad Mehrab Abrar
Hunter Gibbons Copening
Salim Hariri
Sicong Shao
Pratik Satam
Soheil Salehi
MQ
68
11
0
06 Oct 2023
SPViT: Enabling Faster Vision Transformers via Soft Token Pruning
Zhenglun Kong
Zhaoyang Han
Xiaolong Ma
Xin Meng
Mengshu Sun
...
Geng Yuan
Bin Ren
Minghai Qin
Hao Tang
Yanzhi Wang
ViT
93
154
0
27 Dec 2021
1