v1v2v3v4 (latest)

AMC: AutoML for Model Compression and Acceleration on Mobile Devices

10 February 2018

Zhijian Liu

Song Han

Papers citing "AMC: AutoML for Model Compression and Acceleration on Mobile Devices"

50 / 632 papers shown

Accelerated Execution of Bayesian Neural Networks using a Single Probabilistic Forward Pass and Code Generation

234

28 Nov 2025

Decomposed Trust: Privacy, Adversarial Robustness, Ethics, and Fairness in Low-Rank LLMs

Daniel Agyei Asante

Md Mokarram Chowdhury

Yang Li

173

27 Nov 2025

FastForward Pruning: Efficient LLM Pruning via Single-Step Reinforcement Learning

470

24 Nov 2025

Large-Scale Pre-training Enables Multimodal AI Differentiation of Radiation Necrosis from Brain Metastasis Progression on Routine MRI

...

157

22 Nov 2025

Breaking Expert Knowledge Limits: Self-Pruning for Large Language Models

251

19 Nov 2025

CAMP-HiVe: Cyclic Pair Merging based Efficient DNN Pruning with Hessian-Vector Approximation for Resource-Constrained Systems

250

09 Nov 2025

Which Heads Matter for Reasoning? RL-Guided KV Cache Compression

216

09 Oct 2025

Resource-Aware Neural Network Pruning Using Graph-based Reinforcement Learning

166

04 Sep 2025

One Shot vs. Iterative: Rethinking Pruning Strategies for Model Compression

173

19 Aug 2025

Tricks and Plug-ins for Gradient Boosting in Image Classification

276

30 Jul 2025

Meta Pruning via Graph Metanetworks : A Universal Meta Learning Framework for Network Pruning

416

24 May 2025

Automatic Complementary Separation Pruning Toward Lightweight CNNs

David Levin

Gonen Singer

266

19 May 2025

One-for-All Pruning: A Universal Model for Customized Compression of Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Rongguang Ye

Ming Tang

368

18 May 2025

Efficient Unstructured Pruning of Mamba State-Space Models for Resource-Constrained Environments

541

13 May 2025

Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression TechniquesAnnual International Computer Software and Applications Conference (COMPSAC), 2025

Sanjay Surendranath Girija

470

05 May 2025

A Wireless Collaborated Inference Acceleration Framework for Plant Disease RecognitionInternational Conference on Intelligent Computing (ICIC), 2025

363

05 May 2025

BackSlash: Rate Constrained Optimized Training of Large Language Models

Jun Wu

Jiangtao Wen

Yuxing Han

540

23 Apr 2025

CUT: Pruning Pre-Trained Multi-Task Models into Compact Models for Edge DevicesInternational Conference on Intelligent Computing (ICIC), 2025

274

14 Apr 2025

Tin-Tin: Towards Tiny Learning on Tiny Devices with Integer-based Neural Network Training

375

13 Apr 2025

Kernel-Level Energy-Efficient Neural Architecture Search for Tabular DatasetAsian Conference on Intelligent Information and Database Systems (ACIIDS), 2025

Hoang-Loc La

Phuong Hoai Ha

353

11 Apr 2025

Thanos: A Block-wise Pruning Algorithm for Efficient Large Language Model Compression

Ivan Ilin

Peter Richtárik

199

06 Apr 2025

Maximum Redundancy Pruning: A Principle-Driven Layerwise Sparsity Allocation for LLMs

432

24 Mar 2025

EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language ModelsComputer Vision and Pattern Recognition (CVPR), 2025

328

19 Mar 2025

Towards Extreme Pruning of LLMs with Plug-and-Play Mixed Sparsity

238

14 Mar 2025

Automatic Joint Structured Pruning and Quantization for Efficient Neural Network Training and CompressionComputer Vision and Pattern Recognition (CVPR), 2025

391

23 Feb 2025

Advancing Weight and Channel Sparsification with Enhanced SaliencyIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025

273

05 Feb 2025

Pruning-aware Loss Functions for STOI-Optimized Pruned Recurrent Autoencoders for the Compression of the Stimulation Patterns of Cochlear Implants at Zero DelayAsilomar Conference on Signals, Systems and Computers (ACSSC), 2024

Reemt Hinrichs

Jörn Ostermann

374

04 Feb 2025

B-FPGM: Lightweight Face Detection via Bayesian-Optimized Soft FPGM Pruning

Nikolaos Kaparinos

Vasileios Mezaris

CVBM

488

28 Jan 2025

Hardware-Aware DNN Compression for Homogeneous Edge Devices

341

25 Jan 2025

Playing the Lottery With Concave Regularizers for Sparse Trainable Neural NetworksIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024

324

19 Jan 2025

AutoSculpt: A Pattern-based Model Auto-pruning Framework Using Reinforcement Learning and Graph Learning

345

24 Dec 2024

Holistic Adversarially Robust PruningInternational Conference on Learning Representations (ICLR), 2024

Qi Zhao

Christian Wressnegger

332

19 Dec 2024

Deep Convolutional Neural Networks Structured Pruning via Gravity Regularization

Abdesselam Ferdi

340

25 Nov 2024

Electrostatic Force Regularization for Neural Structured Pruning

383

17 Nov 2024

EvoPress: Accurate Dynamic Model Compression via Evolutionary Search

567

18 Oct 2024

DISP-LLM: Dimension-Independent Structural Pruning for Large Language ModelsNeural Information Processing Systems (NeurIPS), 2024

324

15 Oct 2024

ReTok: Replacing Tokenizer to Enhance Representation Efficiency in Large Language Model

Guang Liu

295

06 Oct 2024

Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network SelectionEuropean Conference on Computer Vision (ECCV), 2024

Zhe Lin

Heng Huang

DiffM

380

23 Sep 2024

An Efficient Privacy-aware Split Learning Framework for Satellite CommunicationsIEEE Journal on Selected Areas in Communications (JSAC), 2024

285

13 Sep 2024

HESSO: Towards Automatic Efficient and User Friendly Any Neural Network Training and Pruning

568

11 Sep 2024

Towards Energy-Efficiency by Navigating the Trilemma of Energy, Latency, and AccuracyInternational Symposium on Mixed and Augmented Reality (ISMAR), 2024

341

06 Sep 2024

PSE-Net: Channel Pruning for Convolutional Neural Networks with Parallel-subnets EstimatorNeural Networks (NN), 2024

Shiguang Wang

Tao Xie

Haijun Liu

Xingcheng Zhang

Jian Cheng

267

29 Aug 2024

An Effective Information Theoretic Framework for Channel Pruning

Yihao Chen

Zefang Wang

350

14 Aug 2024

PENDRAM: Enabling High-Performance and Energy-Efficient Processing of Deep Neural Networks through a Generalized DRAM Data Mapping Policy

Rachmad Vidya Wicaksana Putra

Muhammad Abdullah Hanif

Mohamed Bennai

223

05 Aug 2024

Realizing Unaligned Block-wise Pruning for DNN Acceleration on Mobile Devices

Hayun Lee

Dongkun Shin

265

29 Jul 2024

Survey on Knowledge Distillation for Large Language Models: Methods, Evaluation, and Application

Chuanpeng Yang

Wang Lu

Yao Zhu

Yidong Wang

Yiqiang Chen

311

101

02 Jul 2024

Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs

Enshu Liu

Huazhong Yang

Yu Wang

MoE

288

01 Jul 2024

LayerMerge: Neural Network Depth Compression through Layer Pruning and MergingInternational Conference on Machine Learning (ICML), 2024

401

18 Jun 2024

Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models

Heng Huang

518

17 Jun 2024

Pick-or-Mix: Dynamic Channel Sampling for ConvNets

308

16 Jun 2024