Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.11474
Cited By
RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition
Design Automation Conference (DAC), 2020
19 February 2020
Zhaoyang Han
Siyue Wang
Wei Niu
Chengming Zhang
Sheng Lin
Hao Sun
Yifan Gong
Bin Ren
Xinyu Lin
Yanzhi Wang
Dingwen Tao
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition"
23 / 23 papers shown
Uncovering Critical Features for Deepfake Detection through the Lottery Ticket Hypothesis
Lisan Al Amin
Md. Ismail Hossain
Thanh Thi Nguyen
Tasnim Jahan
M. Islam
Faisal Quader
225
2
0
21 Jul 2025
Robust Group Anomaly Detection for Quasi-Periodic Network Time Series
IEEE Transactions on Network Science and Engineering (IEEE T-NSE), 2022
Kai Yang
Shaoyu Dou
Pan Luo
Xin Wang
H. Vincent Poor
AI4TS
197
3
0
20 Jun 2025
Pursing the Sparse Limitation of Spiking Deep Learning Structures
Hao-Ran Cheng
Jiahang Cao
Erjia Xiao
Mengshu Sun
Le Yang
Jize Zhang
Xue Lin
B. Kailkhura
Kaidi Xu
Renjing Xu
218
1
0
18 Nov 2023
DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models
Interspeech (Interspeech), 2023
Yifan Peng
Yui Sudo
Muhammad Shakeel
Shinji Watanabe
304
59
0
28 May 2023
I3D: Transformer architectures with input-dependent dynamic depth for speech recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yifan Peng
Jaesong Lee
Shinji Watanabe
342
44
0
14 Mar 2023
Structured Pruning of Self-Supervised Pre-trained Models for Speech Recognition and Understanding
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yifan Peng
Kwangyoun Kim
Felix Wu
Prashant Sridhar
Shinji Watanabe
230
56
0
27 Feb 2023
A Comprehensive Review and a Taxonomy of Edge Machine Learning: Requirements, Paradigms, and Techniques
Applied Informatics (AI), 2023
Wenbin Li
Hakim Hacid
Ebtesam Almazrouei
Merouane Debbah
392
24
0
16 Feb 2023
HALOC: Hardware-Aware Automatic Low-Rank Compression for Compact Neural Networks
AAAI Conference on Artificial Intelligence (AAAI), 2023
Jinqi Xiao
Chengming Zhang
Yu Gong
Miao Yin
Yang Sui
Lizhi Xiang
Dingwen Tao
Bo Yuan
342
35
0
20 Jan 2023
All-in-One: A Highly Representative DNN Pruning Framework for Edge Devices with Dynamic Power Management
Yifan Gong
Zheng Zhan
Pu Zhao
Yushu Wu
Chaoan Wu
Caiwen Ding
Weiwen Jiang
Minghai Qin
Yanzhi Wang
216
9
0
09 Dec 2022
TDC: Towards Extremely Efficient CNNs on GPUs via Hardware-Aware Tucker Decomposition
ACM SIGPLAN Symposium on Principles & Practice of Parallel Programming (PPoPP), 2022
Lizhi Xiang
Miao Yin
Chengming Zhang
Aravind Sukumaran-Rajam
P. Sadayappan
Bo Yuan
Dingwen Tao
3DV
253
10
0
07 Nov 2022
Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training
Neural Information Processing Systems (NeurIPS), 2022
Geng Yuan
Yanyu Li
Sheng Li
Zhenglun Kong
Sergey Tulyakov
Xulong Tang
Yanzhi Wang
Jian Ren
330
23
0
22 Sep 2022
SparCL: Sparse Continual Learning on the Edge
Neural Information Processing Systems (NeurIPS), 2022
Zifeng Wang
Zheng Zhan
Yifan Gong
Geng Yuan
Wei Niu
T. Jian
Bin Ren
Stratis Ioannidis
Yanzhi Wang
Jennifer Dy
CLL
365
85
0
20 Sep 2022
Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution
European Conference on Computer Vision (ECCV), 2022
Yushu Wu
Yifan Gong
Pu Zhao
Yanyu Li
Zheng Zhan
Wei Niu
Hao Tang
Minghai Qin
Bin Ren
Yanzhi Wang
SupR
MQ
294
36
0
25 Jul 2022
Quantum Neural Network Compression
Zhirui Hu
Zhaoyang Han
Zhepeng Wang
Youzuo Lin
Yanzhi Wang
Weiwen Jiang
GNN
415
40
0
04 Jul 2022
CoCoPIE XGen: A Full-Stack AI-Oriented Optimizing Framework
Xiaofeng Li
Bin Ren
Xipeng Shen
Yanzhi Wang
GNN
166
0
0
21 Jun 2022
Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration
Yifan Gong
Geng Yuan
Zheng Zhan
Wei Niu
Zhengang Li
...
Sijia Liu
Bin Ren
Xue Lin
Xulong Tang
Yanzhi Wang
198
13
0
22 Nov 2021
MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge
Geng Yuan
Xiaolong Ma
Wei Niu
Zhengang Li
Zhenglun Kong
...
Minghai Qin
Bin Ren
Yanzhi Wang
Sijia Liu
Xue Lin
465
116
0
26 Oct 2021
DNNFusion: Accelerating Deep Neural Networks Execution with Advanced Operator Fusion
ACM Transactions on Architecture and Code Optimization (TACO) (TACO), 2020
Wei Niu
Jiexiong Guan
Yanzhi Wang
G. Agrawal
Bin Ren
AI4CE
339
213
0
30 Aug 2021
GRIM: A General, Real-Time Deep Learning Inference Framework for Mobile Devices based on Fine-Grained Structured Weight Sparsity
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Wei Niu
Zhengang
Xiaolong Ma
Zhaoyang Han
Gang Zhou
Xuehai Qian
Xue Lin
Yanzhi Wang
Bin Ren
194
23
0
25 Aug 2021
Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search
Zheng Zhan
Yifan Gong
Pu Zhao
Geng Yuan
Wei Niu
...
Malith Jayaweera
David Kaeli
Bin Ren
Xue Lin
Yanzhi Wang
SupR
294
60
0
18 Aug 2021
Achieving Real-Time Object Detection on MobileDevices with Neural Pruning Search
Pu Zhao
Wei Niu
Geng Yuan
Yuxuan Cai
Bin Ren
Yanzhi Wang
Xue Lin
3DPC
157
2
0
28 Jun 2021
NPAS: A Compiler-aware Framework of Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration
Computer Vision and Pattern Recognition (CVPR), 2020
Zhengang Li
Geng Yuan
Wei Niu
Pu Zhao
Yanyu Li
...
Sijia Liu
Kaiyuan Yang
Bin Ren
Yanzhi Wang
Xue Lin
MQ
433
32
0
01 Dec 2020
ClickTrain: Efficient and Accurate End-to-End Deep Learning Training via Fine-Grained Architecture-Preserving Pruning
Chengming Zhang
Geng Yuan
Wei Niu
Jiannan Tian
Sian Jin
...
Zhe Jiang
Yanzhi Wang
Bin Ren
Shuaiwen Leon Song
Dingwen Tao
3DV
487
2
0
20 Nov 2020
1
Page 1 of 1