v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015

Song Han

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,631 papers shown

Revisiting Structured DropoutAsian Conference on Machine Learning (ACML), 2022

182

05 Oct 2022

Streaming Video Analytics On The Edge With Asynchronous Cloud Support

Venkat N. Padmanabhan

193

04 Oct 2022

spred: Solving

L_1

Penalty with SGDInternational Conference on Machine Learning (ICML), 2022

Liu Ziyin

Zihao Wang

539

03 Oct 2022

Limitations of neural network training due to numerical instability of backpropagationAdvances in Computational Mathematics (ACM), 2022

Clemens Karner

V. Kazeev

P. Petersen

259

03 Oct 2022

EAPruning: Evolutionary Pruning for Vision Transformers and CNNsBritish Machine Vision Conference (BMVC), 2022

132

01 Oct 2022

Diving into Unified Data-Model Sparsity for Class-Imbalanced Graph Representation Learning

Chao Huang

139

01 Oct 2022

Compressed Gastric Image Generation Based on Soft-Label Dataset Distillation for Medical Data Sharing

Guang Li

Ren Togo

Takahiro Ogawa

Miki Haseyama

233

29 Sep 2022

$Physics-aware Differentiable Discrete Codesign for Diffractive Optical Neural Networks$

Physics-aware Differentiable Discrete Codesign for Diffractive Optical Neural Networks

197

28 Sep 2022

Adaptive Sparse ViT: Towards Learnable Adaptive Token Pruning by Fully Exploiting Self-AttentionInternational Joint Conference on Artificial Intelligence (IJCAI), 2022

205

28 Sep 2022

Sauron U-Net: Simple automated redundancy elimination in medical image segmentation via filter pruningNeurocomputing (Neurocomputing), 2022

208

27 Sep 2022

Neural Network Panning: Screening the Optimal Sparse Network Before TrainingAsian Conference on Computer Vision (ACCV), 2022

123

27 Sep 2022

Outlier Suppression: Pushing the Limit of Low-bit Transformer Language ModelsNeural Information Processing Systems (NeurIPS), 2022

Shanghang Zhang

Xianglong Liu

378

194

27 Sep 2022

Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse TrainingNeural Information Processing Systems (NeurIPS), 2022

290

22 Sep 2022

Deep Learning on Home Drone: Searching for the Optimal ArchitectureIEEE International Conference on Robotics and Automation (ICRA), 2022

Daniela Rus

125

21 Sep 2022

State-driven Implicit Modeling for Sparsity and Robustness in Neural Networks

207

19 Sep 2022

Tree-based Text-Vision BERT for Video Search in Baidu Video Advertising

136

19 Sep 2022

Enabling Conversational Interaction with Mobile UI using Large Language ModelsInternational Conference on Human Factors in Computing Systems (CHI), 2022

Bryan Wang

Gang Li

Yang Li

402

174

18 Sep 2022

Improving the Performance of DNN-based Software Services using Automated Layer Caching

129

18 Sep 2022

Pruning Neural Networks via Coresets and Convex Geometry: Towards No AssumptionsNeural Information Processing Systems (NeurIPS), 2022

187

18 Sep 2022

Learning to Weight Samples for Dynamic Early-exiting NetworksEuropean Conference on Computer Vision (ECCV), 2022

Gao Huang

269

17 Sep 2022

PPT: token-Pruned Pose Transformer for monocular and multi-view human pose estimationEuropean Conference on Computer Vision (ECCV), 2022

Haoyu Ma

Zhe Wang

Yifei Chen

180

16 Sep 2022

Self-Attentive Pooling for Efficient Deep LearningIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

259

16 Sep 2022

Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask

322

15 Sep 2022

MSREP: A Fast yet Light Sparse Matrix Framework for Multi-GPU Systems

149

15 Sep 2022

Neural Networks Reduction via LumpingInternational Conference of the Italian Association for Artificial Intelligence (AIxIA), 2022

226

15 Sep 2022

Efficient Quantized Sparse Matrix Operations on Tensor CoresInternational Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2022

Shigang Li

Kazuki Osawa

Torsten Hoefler

415

14 Sep 2022

Federated Pruning: Improving Neural Network Efficiency with Federated LearningInterspeech (Interspeech), 2022

Ding Zhao

146

14 Sep 2022

Sparsity-guided Network Design for Frame Interpolation

Tianyi Chen

232

09 Sep 2022

ApproxTrain: Fast Simulation of Approximate Multipliers for DNN Training and InferenceIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (IEEE TCAD), 2022

249

09 Sep 2022

Seeking Interpretability and Explainability in Binary Activated Neural Networks

Benjamin Leblanc

Pascal Germain

FAtt

449

07 Sep 2022

Improving the Cross-Lingual Generalisation in Visual Question AnsweringAAAI Conference on Artificial Intelligence (AAAI), 2022

Farhad Nooralahzadeh

Rico Sennrich

245

07 Sep 2022

Mimose: An Input-Aware Checkpointing Planner for Efficient Training on GPU

...

Zicheng Zhang

152

06 Sep 2022

Low-Power Hardware-Based Deep-Learning Diagnostics Support Case StudyBiomedical Circuits and Systems Conference (BioCAS), 2018

Khushal Sethi

V. Parmar

Manan Suri

03 Sep 2022

Incremental Online Learning Algorithms Comparison for Gesture and Visual Smart SensorsIEEE International Joint Conference on Neural Network (IJCNN), 2022

Alessandro Avi

Andrea Albanese

Davide Brunelli

235

01 Sep 2022

On Quantizing Implicit Neural RepresentationsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

309

01 Sep 2022

ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network QuantizationMicro (MICRO), 2022

Cong Guo

Chen Zhang

Jingwen Leng

Zihan Liu

Fan Yang

Yun-Bo Liu

Minyi Guo

Yuhao Zhu

179

30 Aug 2022

Symmetric Pruning in Quantum Neural NetworksInternational Conference on Learning Representations (ICLR), 2022

261

30 Aug 2022

A Deep Neural Networks ensemble workflow from hyperparameter search to inference leveraging GPU clustersInternational Conference on High Performance Computing in Asia-Pacific Region (HPCAsia), 2022

216

30 Aug 2022

Reducing Computational Complexity of Neural Networks in Optical Channel Equalization: From Concepts to ImplementationJournal of Lightwave Technology (JLT), 2022

Jaroslaw E. Prilepsky

288

26 Aug 2022

Complexity-Driven CNN Compression for Resource-constrained Edge AIIEEE Transactions on Artificial Intelligence (IEEE TAI), 2022

Muhammad Zawish

Steven Davy

L. Abraham

210

26 Aug 2022

Anytime-Lidar: Deadline-aware 3D Object DetectionIEEE International Conference on Embedded and Real-Time Computing Systems and Applications (RTCSA), 2022

Ahmet Soyyigit

Shuochao Yao

H. Yun

3DPC

125

25 Aug 2022