v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015

Song Han

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,628 papers shown

Title
TinyissimoYOLO: A Quantized, Low-Memory Footprint, TinyML Object Detection Network for Low Power MicrocontrollersInternational Conference on Artificial Intelligence Circuits and Systems (ICAICS), 2023 Julian Moosmann Marco Giordano Christian Vogt Michele Magno MQ ObjD 205 30 0 22 May 2023
HighLight: Efficient and Flexible DNN Acceleration with Hierarchical Structured Sparsity Yannan Nellie Wu Po-An Tsai Saurav Muralidharan A. Parashar Vivienne Sze J. Emer 165 41 0 22 May 2023
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language ModelsIEEE International Conference on Multimedia and Expo (ICME), 2023 Yijia Zhang Lingran Zhao Shijie Cao Wenqiang Wang Ting Cao Fan Yang Mao Yang Shanghang Zhang Ningyi Xu MQ 139 24 0 21 May 2023
Self-Distillation with Meta Learning for Knowledge Graph CompletionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 Yunshui Li Junhao Liu Chengming Li Min Yang 184 8 0 20 May 2023
Efficient Prompting via Dynamic In-Context Learning Wangchunshu Zhou Yuchen Eleanor Jiang Robert Bamler Mrinmaya Sachan 157 25 0 18 May 2023
PDP: Parameter-free Differentiable Pruning is All You NeedNeural Information Processing Systems (NeurIPS), 2023 Minsik Cho Saurabh N. Adya Devang Naik VLM 187 15 0 18 May 2023
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized AttentionInternational Journal of Computer Vision (IJCV), 2023 Guangxuan Xiao Tianwei Yin William T. Freeman F. Durand Song Han VGen DiffM 295 334 0 17 May 2023
Analyzing Compression Techniques for Computer Vision Maniratnam Mandal Imran Khan 154 1 0 14 May 2023
TIPS: Topologically Important Path Sampling for Anytime Neural NetworksInternational Conference on Machine Learning (ICML), 2023 Guihong Li Kartikeya Bhardwaj Yuedong Yang R. Marculescu AAML 279 0 0 13 May 2023
Efficient Asynchronize Stochastic Gradient Algorithm with Structured Data Zhao Song Mingquan Ye 199 4 0 13 May 2023
Accelerator-Aware Training for Transducer-Based Speech RecognitionSpoken Language Technology Workshop (SLT), 2023 Suhaila M. Shakiah Rupak Vignesh Swaminathan Hieu Duy Nguyen Raviteja Chinta Tariq Afzal Nathan Susanj Athanasios Mouchtaris Grant P. Strimel Ariya Rastrow 133 1 0 12 May 2023
Divide-and-Conquer the NAS puzzle in Resource Constrained Federated Learning SystemsNeural Networks (Neural Netw.), 2023 Yeshwanth Venkatesha Youngeun Kim Hyoungseob Park Priyadarshini Panda FedML 114 6 0 11 May 2023
Post-training Model Quantization Using GANs for Synthetic Data Generation Athanasios Masouris Mansi Sharma Adrian Boguszewski Alexander Kozlov Zhuo Wu Raymond Lo MQ 146 0 0 10 May 2023
VEDLIoT -- Next generation accelerated AIoT systems and applicationsACM International Conference on Computing Frontiers (CF), 2023 Kevin Mika R. Griessl N. Kucza F. Porrmann M. Kaiser ... Mario Porrmann Hans-Martin Heyn E. Knauss Yufei Mao Franz Meierhofer 114 6 0 09 May 2023
DietCNN: Multiplication-free Inference for Quantized CNNsIEEE International Joint Conference on Neural Network (IJCNN), 2023 Swarnava Dey P. Dasgupta P. Chakrabarti MQ 237 1 0 09 May 2023
FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance Lingjiao Chen Matei A. Zaharia James Zou LLMAG 346 378 0 09 May 2023
CrAFT: Compression-Aware Fine-Tuning for Efficient Visual Task Adaptation J. Heo S. Azizi A. Fayyazi Massoud Pedram 205 1 0 08 May 2023
Compressing audio CNNs with graph centrality based filter pruningIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2023 James A. King Ashutosh Kumar Singh Mark D. Plumbley GNN 122 2 0 05 May 2023
CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device LearningInternational Symposium on High-Performance Computer Architecture (HPCA), 2023 Sai Qian Zhang Thierry Tambe Nestor Cuevas Gu-Yeon Wei David Brooks 206 9 0 04 May 2023
Input Layer Binarization with Bit-Plane EncodingInternational Conference on Artificial Neural Networks (ICANN), 2023 Lorenzo Vorabbi Davide Maltoni Stefano Santi MQ 162 8 0 04 May 2023
A Constrained BA Algorithm for Rate-Distortion and Distortion-Rate FunctionsCSIAM Transactions on Applied Mathematics (TCAM), 2023 Lin Chen Shitong Wu Wen-Long Ye Huihui Wu Wen-Ying Zhang Hao Wu Bo Bai 59 9 0 04 May 2023
Cuttlefish: Low-Rank Model Training without All the TuningConference on Machine Learning and Systems (MLSys), 2023 Hongyi Wang Saurabh Agarwal Pongsakorn U-chupala Yoshiki Tanaka Eric P. Xing Dimitris Papailiopoulos OffRL 270 26 0 04 May 2023
Dynamic Sparse Training with Structured SparsityInternational Conference on Learning Representations (ICLR), 2023 Mike Lasby A. Golubeva Utku Evci Mihai Nica Yani Andrew Ioannou 566 33 0 03 May 2023
A Digital Twin Empowered Lightweight Model Sharing Scheme for Multi-Robot SystemsIEEE Internet of Things Journal (IEEE IoT J.), 2023 Kai Xiong Zhihong Wang S. Leng Jianhua He 109 14 0 03 May 2023
BCEdge: SLO-Aware DNN Inference Services with Adaptive Batching on Edge Platforms Ziyang Zhang Huan Li Yang Zhao Changyao Lin Jie Liu 145 5 0 01 May 2023
CORSD: Class-Oriented Relational Self DistillationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 Muzhou Yu S. Tan Kailu Wu Runpei Dong Linfeng Zhang Kaisheng Ma 102 1 0 28 Apr 2023
Sparsified Model Zoo Twins: Investigating Populations of Sparsified Neural Network Models D. Honegger Konstantin Schurholt Damian Borth 236 5 0 26 Apr 2023
Optimizing Deep Learning Models For Raspberry Pi Sa Ameen Kangaranmulle Siriwardana Theodoros Theodoridis VLM 90 11 0 25 Apr 2023
Multiplierless In-filter Computing for tinyML PlatformsInternational Conference on VLSI Design (VLSID), 2023 Abhishek Ramdas Nair P. Nath S. Chakrabartty Chetan Singh Thakur 95 1 0 24 Apr 2023
The Case for Hierarchical Deep Learning Inference at the Network Edge Ghina Al-Atat Andrea Fresa Adarsh Prasad Behera Vishnu Narayanan Moothedath James Gross J. Champati 149 12 0 23 Apr 2023
Deep Convolutional Tables: Deep Learning without ConvolutionsIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023 S. Dekel Y. Keller Aharon Bar-Hillel 3DV 250 0 0 23 Apr 2023
QuMoS: A Framework for Preserving Security of Quantum Machine Learning ModelInternational Conference on Quantum Computing and Engineering (QCE), 2023 Zhepeng Wang Jinyang Li Zhirui Hu Blake Gage Elizabeth Iwasawa Weiwen Jiang 261 16 0 23 Apr 2023
Identifying Appropriate Intellectual Property Protection Mechanisms for Machine Learning Models: A Systematization of Watermarking, Fingerprinting, Model Access, and AttacksIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023 Isabell Lederer Rudolf Mayer Andreas Rauber 228 29 0 22 Apr 2023
Securing Neural Networks with Knapsack Optimization Yakir Gorski Amir Jevnisek S. Avidan AAML 104 1 0 20 Apr 2023
Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive ReviewIEEE Transactions on Intelligent Vehicles (TIV), 2023 Shanliang Yao Runwei Guan Xiaoyu Huang Zhuoxiao Li Xiangyu Sha ... Eng Gee Lim H. Seo Ka Lok Man Xiaohui Zhu Yutao Yue 244 171 0 20 Apr 2023
Knowledge Distillation Under Ideal Joint Classifier AssumptionNeural Networks (Neural Netw.), 2023 Huayu Li Xiwen Chen G. Ditzler Janet Roveda Ao Li 140 2 0 19 Apr 2023
Adaptive Scheduling for Edge-Assisted DNN ServingIEEE International Conference on Mobile Adhoc and Sensor Systems (MASS), 2023 Jian He Chen-Shun Yang Zhaoyuan He Ghufran Baig L. Qiu 104 1 0 19 Apr 2023
Model Pruning Enables Localized and Efficient Federated Learning for Yield Forecasting and Data SharingExpert systems with applications (ESWA), 2023 An-dong Li Milan Markovic P. Edwards Georgios Leontidis FedML 131 24 0 19 Apr 2023
Neural Network Quantisation for Faster Homomorphic EncryptionIEEE International Symposium on On-Line Testing and Robust System Design (IOLTS), 2023 Wouter Legiest Jan-Pieter DÁnvers Furkan Turan Michiel Van Beirendonck Ingrid Verbauwhede MQ 148 6 0 19 Apr 2023
Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scalingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 Xiuying Wei Yunchen Zhang Yuhang Li Xiangguo Zhang Yazhe Niu Jian Ren Zhengang Li MQ 203 57 0 18 Apr 2023
Frequency Regularization: Restricting Information Redundancy of Convolutional Neural NetworksIEEE Access (IEEE Access), 2023 Chenqiu Zhao Guanfang Dong Shupei Zhang Zijie Tan Anup Basu 311 4 0 17 Apr 2023
Evil from Within: Machine Learning Backdoors through Hardware Trojans Alexander Warnecke Julian Speith Janka Möller Konrad Rieck C. Paar AAML 478 3 0 17 Apr 2023
SalientGrads: Sparse Models for Communication Efficient and Data Aware Distributed Federated Training Riyasat Ohib Bishal Thapaliya Pratyush Gaggenapalli Qingbin Liu Vince D. Calhoun Sergey Plis FedML 132 2 0 15 Apr 2023
Generating Adversarial Examples with Better Transferability via Masking Unimportant Parameters of Surrogate ModelIEEE International Joint Conference on Neural Network (IJCNN), 2023 Dingcheng Yang Wenjian Yu Zihao Xiao Jiaqi Luo AAML DiffM 161 6 0 14 Apr 2023
A Survey on Approximate Edge AI for Energy Efficient Autonomous Driving ServicesIEEE Communications Surveys and Tutorials (COMST), 2023 Dewant Katare Diego Perino J. Nurmi M. Warnier Marijn Janssen Aaron Yi Ding 262 61 0 13 Apr 2023
Learning Accurate Performance Predictors for Ultrafast Automated Model CompressionInternational Journal of Computer Vision (IJCV), 2023 Ziwei Wang Jiwen Lu Han Xiao Shengyu Liu Jie Zhou OffRL 150 1 0 13 Apr 2023
Boosting Convolutional Neural Networks with Middle Spectrum Grouped ConvolutionIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023 Z. Su Jiehua Zhang Tianpeng Liu Zhen Liu Shuanghui Zhang M. Pietikäinen Tianpeng Liu 154 5 0 13 Apr 2023
EcoFed: Efficient Communication for DNN Partitioning-based Federated LearningIEEE Transactions on Parallel and Distributed Systems (TPDS), 2023 Di Wu R. Ullah Philip Rodgers Peter Kilpatrick I. Spence Blesson Varghese FedML 255 8 0 11 Apr 2023
Scale-Space Hypernetworks for Efficient Biomedical Imaging Jose Javier Gonzalez Ortiz John Guttag Adrian Dalca 210 0 0 11 Apr 2023
SamurAI: A Versatile IoT Node With Event-Driven Wake-Up and Embedded ML AccelerationIEEE Journal of Solid-State Circuits (JSSC), 2023 I. Miro-Panadès Benoît Tain J. Christmann David Coriat R. Lemaire ... Jean-Marc Philippe Y. Thonnart A. Valentian Frédéric Heitzmann F. Clermidy 83 19 0 11 Apr 2023