RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition

Design Automation Conference (DAC), 2020

19 February 2020

Dingwen Tao

Papers citing "RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition"

23 / 23 papers shown

Uncovering Critical Features for Deepfake Detection through the Lottery Ticket Hypothesis

225

21 Jul 2025

Robust Group Anomaly Detection for Quasi-Periodic Network Time SeriesIEEE Transactions on Network Science and Engineering (IEEE T-NSE), 2022

197

20 Jun 2025

Pursing the Sparse Limitation of Spiking Deep Learning Structures

Jiahang Cao

Renjing Xu

218

18 Nov 2023

DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech ModelsInterspeech (Interspeech), 2023

304

28 May 2023

I3D: Transformer architectures with input-dependent dynamic depth for speech recognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Yifan Peng

Jaesong Lee

Shinji Watanabe

342

14 Mar 2023

Structured Pruning of Self-Supervised Pre-trained Models for Speech Recognition and UnderstandingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Kwangyoun Kim

230

27 Feb 2023

A Comprehensive Review and a Taxonomy of Edge Machine Learning: Requirements, Paradigms, and TechniquesApplied Informatics (AI), 2023

Wenbin Li

Hakim Hacid

Ebtesam Almazrouei

Merouane Debbah

392

16 Feb 2023

HALOC: Hardware-Aware Automatic Low-Rank Compression for Compact Neural NetworksAAAI Conference on Artificial Intelligence (AAAI), 2023

Dingwen Tao

342

20 Jan 2023

All-in-One: A Highly Representative DNN Pruning Framework for Edge Devices with Dynamic Power Management

Caiwen Ding

216

09 Dec 2022

TDC: Towards Extremely Efficient CNNs on GPUs via Hardware-Aware Tucker DecompositionACM SIGPLAN Symposium on Principles & Practice of Parallel Programming (PPoPP), 2022

Lizhi Xiang

Miao Yin

Chengming Zhang

Aravind Sukumaran-Rajam

P. Sadayappan

Bo Yuan

Dingwen Tao

3DV

253

07 Nov 2022

Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse TrainingNeural Information Processing Systems (NeurIPS), 2022

330

22 Sep 2022

SparCL: Sparse Continual Learning on the EdgeNeural Information Processing Systems (NeurIPS), 2022

Jennifer Dy

365

20 Sep 2022

Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-ResolutionEuropean Conference on Computer Vision (ECCV), 2022

294

25 Jul 2022

Quantum Neural Network Compression

415

04 Jul 2022

CoCoPIE XGen: A Full-Stack AI-Oriented Optimizing Framework

166

21 Jun 2022

Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration

...

198

22 Nov 2021

MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge

...

465

116

26 Oct 2021

DNNFusion: Accelerating Deep Neural Networks Execution with Advanced Operator FusionACM Transactions on Architecture and Code Optimization (TACO) (TACO), 2020

339

213

30 Aug 2021

GRIM: A General, Real-Time Deep Learning Inference Framework for Mobile Devices based on Fine-Grained Structured Weight SparsityIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021

194

25 Aug 2021

Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search

...

294

18 Aug 2021

Achieving Real-Time Object Detection on MobileDevices with Neural Pruning Search

157

28 Jun 2021

NPAS: A Compiler-aware Framework of Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile AccelerationComputer Vision and Pattern Recognition (CVPR), 2020

...

433

01 Dec 2020

ClickTrain: Efficient and Accurate End-to-End Deep Learning Training via Fine-Grained Architecture-Preserving Pruning

...

Dingwen Tao

487

20 Nov 2020