Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1802.03494
Cited By
v1
v2
v3
v4 (latest)
AMC: AutoML for Model Compression and Acceleration on Mobile Devices
10 February 2018
Yihui He
Ji Lin
Zhijian Liu
Hanrui Wang
Li Li
Song Han
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"AMC: AutoML for Model Compression and Acceleration on Mobile Devices"
50 / 632 papers shown
Accelerated Execution of Bayesian Neural Networks using a Single Probabilistic Forward Pass and Code Generation
Bernhard Klein
Falk Selker
Hendrik Borras
Sophie Steger
Franz Pernkopf
Holger Fröning
UQCV
BDL
234
0
0
28 Nov 2025
Decomposed Trust: Privacy, Adversarial Robustness, Ethics, and Fairness in Low-Rank LLMs
Daniel Agyei Asante
Md Mokarram Chowdhury
Yang Li
173
0
0
27 Nov 2025
FastForward Pruning: Efficient LLM Pruning via Single-Step Reinforcement Learning
Xin Yuan
S. Li
Jiateng Wei
Chengrui Zhu
Yanming Wu
Qingpeng Li
Jiajun Lv
Xiaoke Lan
Jun Chen
Yong-Jin Liu
OffRL
470
0
0
24 Nov 2025
Large-Scale Pre-training Enables Multimodal AI Differentiation of Radiation Necrosis from Brain Metastasis Progression on Routine MRI
A. Gomaa
Annette Schwarz
Ludwig Singer
Arnd Dörfler
M. May
...
Andrea Wittig
R. Fietkau
Christoph Bert
Stefanie Corradini
F. Putz
157
0
0
22 Nov 2025
Breaking Expert Knowledge Limits: Self-Pruning for Large Language Models
Haidong Kang
Lihong Lin
Enneng Yang
Hongning Dai
Hao Wang
LRM
251
0
0
19 Nov 2025
CAMP-HiVe: Cyclic Pair Merging based Efficient DNN Pruning with Hessian-Vector Approximation for Resource-Constrained Systems
M. H. Uddin
Sai Krishna Ghanta
Liam Seymour
S. Baidya
250
0
0
09 Nov 2025
Which Heads Matter for Reasoning? RL-Guided KV Cache Compression
Wenjie Du
Li Jiang
Keda Tao
Xue Liu
Huan Wang
LRM
216
5
0
09 Oct 2025
Resource-Aware Neural Network Pruning Using Graph-based Reinforcement Learning
Dieter Balemans
Thomas Huybrechts
Jan Steckel
Siegfried Mercelis
166
0
0
04 Sep 2025
One Shot vs. Iterative: Rethinking Pruning Strategies for Model Compression
Mikołaj Janusz
Tomasz Wojnar
Yawei Li
Luca Benini
Kamil Adamczewski
VLM
173
3
0
19 Aug 2025
Tricks and Plug-ins for Gradient Boosting in Image Classification
Biyi Fang
J. Utke
Truong Vo
Diego Klabjan
276
0
0
30 Jul 2025
Meta Pruning via Graph Metanetworks : A Universal Meta Learning Framework for Network Pruning
Yewei Liu
Xiyuan Wang
Muhan Zhang
DD
GNN
416
0
0
24 May 2025
Automatic Complementary Separation Pruning Toward Lightweight CNNs
David Levin
Gonen Singer
266
0
0
19 May 2025
One-for-All Pruning: A Universal Model for Customized Compression of Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Rongguang Ye
Ming Tang
368
2
0
18 May 2025
Efficient Unstructured Pruning of Mamba State-Space Models for Resource-Constrained Environments
Ibne Farabi Shihab
Sanjeda Akter
Anuj Sharma
Mamba
541
7
0
13 May 2025
Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression Techniques
Annual International Computer Software and Applications Conference (COMPSAC), 2025
Sanjay Surendranath Girija
Shashank Kapoor
Lakshit Arora
Dipen Pradhan
Aman Raj
Ankit Shetgaonkar
470
11
0
05 May 2025
A Wireless Collaborated Inference Acceleration Framework for Plant Disease Recognition
International Conference on Intelligent Computing (ICIC), 2025
Hele Zhu
Xinyi Huang
Haojia Gao
Mengfei Jiang
Haohua Que
Lei Mu
363
1
0
05 May 2025
BackSlash: Rate Constrained Optimized Training of Large Language Models
Jun Wu
Jiangtao Wen
Yuxing Han
540
1
0
23 Apr 2025
CUT: Pruning Pre-Trained Multi-Task Models into Compact Models for Edge Devices
International Conference on Intelligent Computing (ICIC), 2025
Jingxuan Zhou
Weidong Bao
Ji Wang
Zhengyi Zhong
274
0
0
14 Apr 2025
Tin-Tin: Towards Tiny Learning on Tiny Devices with Integer-based Neural Network Training
Yi Hu
Jinhang Zuo
Eddie Zhang
Bob Iannucci
Carlee Joe-Wong
375
2
0
13 Apr 2025
Kernel-Level Energy-Efficient Neural Architecture Search for Tabular Dataset
Asian Conference on Intelligent Information and Database Systems (ACIIDS), 2025
Hoang-Loc La
Phuong Hoai Ha
353
2
0
11 Apr 2025
Thanos: A Block-wise Pruning Algorithm for Efficient Large Language Model Compression
Ivan Ilin
Peter Richtárik
199
5
0
06 Apr 2025
Maximum Redundancy Pruning: A Principle-Driven Layerwise Sparsity Allocation for LLMs
Chang Gao
Kang Zhao
Runqi Wang
Jianfei Chen
Liping Jing
432
1
0
24 Mar 2025
EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models
Computer Vision and Pattern Recognition (CVPR), 2025
Yinan Liang
Xiping Hu
Xiuwei Xu
Jie Zhou
Jiwen Lu
VLM
LRM
328
10
0
19 Mar 2025
Towards Extreme Pruning of LLMs with Plug-and-Play Mixed Sparsity
Chi Xu
Gefei Zhang
Yantong Zhu
Luca Benini
Guosheng Hu
Yawei Li
Zhihong Zhang
238
1
0
14 Mar 2025
Automatic Joint Structured Pruning and Quantization for Efficient Neural Network Training and Compression
Computer Vision and Pattern Recognition (CVPR), 2025
Xiaoyi Qu
David Aponte
Colby R. Banbury
Daniel P. Robinson
Tianyu Ding
K. Koishida
Ilya Zharkov
Tianyi Chen
MQ
391
12
0
23 Feb 2025
Advancing Weight and Channel Sparsification with Enhanced Saliency
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Xinglong Sun
Maying Shen
Hongxu Yin
Lei Mao
Pavlo Molchanov
Jose M. Alvarez
273
1
0
05 Feb 2025
Pruning-aware Loss Functions for STOI-Optimized Pruned Recurrent Autoencoders for the Compression of the Stimulation Patterns of Cochlear Implants at Zero Delay
Asilomar Conference on Signals, Systems and Computers (ACSSC), 2024
Reemt Hinrichs
Jörn Ostermann
374
0
0
04 Feb 2025
B-FPGM: Lightweight Face Detection via Bayesian-Optimized Soft FPGM Pruning
Nikolaos Kaparinos
Vasileios Mezaris
CVBM
488
2
0
28 Jan 2025
Hardware-Aware DNN Compression for Homogeneous Edge Devices
Kunlong Zhang
Guiying Li
Ning Lu
Peng Yang
Shengcai Liu
341
2
0
25 Jan 2025
Playing the Lottery With Concave Regularizers for Sparse Trainable Neural Networks
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Giulia Fracastoro
Sophie M. Fosson
Andrea Migliorati
G. Calafiore
324
3
0
19 Jan 2025
AutoSculpt: A Pattern-based Model Auto-pruning Framework Using Reinforcement Learning and Graph Learning
Lixian Jing
Haobing Liu
Junyu Dong
Yanwei Yu
3DPC
AI4CE
345
2
0
24 Dec 2024
Holistic Adversarially Robust Pruning
International Conference on Learning Representations (ICLR), 2024
Qi Zhao
Christian Wressnegger
332
13
0
19 Dec 2024
Deep Convolutional Neural Networks Structured Pruning via Gravity Regularization
Abdesselam Ferdi
340
2
0
25 Nov 2024
Electrostatic Force Regularization for Neural Structured Pruning
Abdesselam Ferdi
A. Taleb-Ahmed
A. Nakib
Youcef Ferdi
383
1
0
17 Nov 2024
EvoPress: Accurate Dynamic Model Compression via Evolutionary Search
Oliver Sieberling
Denis Kuznedelev
Eldar Kurtic
Dan Alistarh
MQ
567
5
0
18 Oct 2024
DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models
Neural Information Processing Systems (NeurIPS), 2024
Shangqian Gao
Chi-Heng Lin
Ting Hua
Tang Zheng
Yilin Shen
Hongxia Jin
Yen-Chang Hsu
324
26
0
15 Oct 2024
ReTok: Replacing Tokenizer to Enhance Representation Efficiency in Large Language Model
Shuhao Gu
Mengdi Zhao
Bowen Zhang
Liangdong Wang
Jijie Li
Guang Liu
295
0
0
06 Oct 2024
Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network Selection
European Conference on Computer Vision (ECCV), 2024
Alireza Ganjdanesh
Yan Kang
Yuchen Liu
Richard Y. Zhang
Zhe Lin
Heng Huang
DiffM
380
13
0
23 Sep 2024
An Efficient Privacy-aware Split Learning Framework for Satellite Communications
IEEE Journal on Selected Areas in Communications (JSAC), 2024
Jianfei Sun
Cong Wu
Shahid Mumtaz
Junyi Tao
Mingsheng Cao
Mei Wang
Valerio Frascolla
285
23
0
13 Sep 2024
HESSO: Towards Automatic Efficient and User Friendly Any Neural Network Training and Pruning
Tianyi Chen
Xiaoyi Qu
David Aponte
Colby R. Banbury
Jongwoo Ko
Tianyu Ding
Yong Ma
Vladimir Lyapunov
Ilya Zharkov
Luming Liang
568
3
0
11 Sep 2024
Towards Energy-Efficiency by Navigating the Trilemma of Energy, Latency, and Accuracy
International Symposium on Mixed and Augmented Reality (ISMAR), 2024
Boyuan Tian
Yihan Pang
Muhammad Huzaifa
Shenlong Wang
Sarita Adve
341
3
0
06 Sep 2024
PSE-Net: Channel Pruning for Convolutional Neural Networks with Parallel-subnets Estimator
Neural Networks (NN), 2024
Shiguang Wang
Tao Xie
Haijun Liu
Xingcheng Zhang
Jian Cheng
267
4
0
29 Aug 2024
An Effective Information Theoretic Framework for Channel Pruning
Yihao Chen
Zefang Wang
350
12
0
14 Aug 2024
PENDRAM: Enabling High-Performance and Energy-Efficient Processing of Deep Neural Networks through a Generalized DRAM Data Mapping Policy
Rachmad Vidya Wicaksana Putra
Muhammad Abdullah Hanif
Mohamed Bennai
223
0
0
05 Aug 2024
Realizing Unaligned Block-wise Pruning for DNN Acceleration on Mobile Devices
Hayun Lee
Dongkun Shin
MQ
265
0
0
29 Jul 2024
Survey on Knowledge Distillation for Large Language Models: Methods, Evaluation, and Application
Chuanpeng Yang
Wang Lu
Yao Zhu
Yidong Wang
Qian Chen
Chenlong Gao
Bingjie Yan
Yiqiang Chen
ALM
KELM
311
101
0
02 Jul 2024
Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs
Enshu Liu
Junyi Zhu
Zinan Lin
Xuefei Ning
Matthew B. Blaschko
Shengen Yan
Guohao Dai
Huazhong Yang
Yu Wang
MoE
288
34
0
01 Jul 2024
LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging
International Conference on Machine Learning (ICML), 2024
Jinuk Kim
Marwa El Halabi
Mingi Ji
Hyun Oh Song
401
5
0
18 Jun 2024
Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models
Alireza Ganjdanesh
Reza Shirkavand
Shangqian Gao
Heng Huang
DiffM
VLM
518
11
0
17 Jun 2024
Pick-or-Mix: Dynamic Channel Sampling for ConvNets
Ashish Kumar
Daneul Kim
Jaesik Park
Laxmidhar Behera
308
4
0
16 Jun 2024
1
2
3
4
...
11
12
13
Next
Page 1 of 13
Page
of 13
Go