ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.03494
  4. Cited By
AMC: AutoML for Model Compression and Acceleration on Mobile Devices
v1v2v3v4 (latest)

AMC: AutoML for Model Compression and Acceleration on Mobile Devices

10 February 2018
Yihui He
Ji Lin
Zhijian Liu
Hanrui Wang
Li Li
Song Han
ArXiv (abs)PDFHTML

Papers citing "AMC: AutoML for Model Compression and Acceleration on Mobile Devices"

50 / 632 papers shown
Accelerated Execution of Bayesian Neural Networks using a Single Probabilistic Forward Pass and Code Generation
Accelerated Execution of Bayesian Neural Networks using a Single Probabilistic Forward Pass and Code Generation
Bernhard Klein
Falk Selker
Hendrik Borras
Sophie Steger
Franz Pernkopf
Holger Fröning
UQCVBDL
234
0
0
28 Nov 2025
Decomposed Trust: Privacy, Adversarial Robustness, Ethics, and Fairness in Low-Rank LLMs
Decomposed Trust: Privacy, Adversarial Robustness, Ethics, and Fairness in Low-Rank LLMs
Daniel Agyei Asante
Md Mokarram Chowdhury
Yang Li
173
0
0
27 Nov 2025
FastForward Pruning: Efficient LLM Pruning via Single-Step Reinforcement Learning
FastForward Pruning: Efficient LLM Pruning via Single-Step Reinforcement Learning
Xin Yuan
S. Li
Jiateng Wei
Chengrui Zhu
Yanming Wu
Qingpeng Li
Jiajun Lv
Xiaoke Lan
Jun Chen
Yong-Jin Liu
OffRL
470
0
0
24 Nov 2025
Large-Scale Pre-training Enables Multimodal AI Differentiation of Radiation Necrosis from Brain Metastasis Progression on Routine MRI
Large-Scale Pre-training Enables Multimodal AI Differentiation of Radiation Necrosis from Brain Metastasis Progression on Routine MRI
A. Gomaa
Annette Schwarz
Ludwig Singer
Arnd Dörfler
M. May
...
Andrea Wittig
R. Fietkau
Christoph Bert
Stefanie Corradini
F. Putz
157
0
0
22 Nov 2025
Breaking Expert Knowledge Limits: Self-Pruning for Large Language Models
Breaking Expert Knowledge Limits: Self-Pruning for Large Language Models
Haidong Kang
Lihong Lin
Enneng Yang
Hongning Dai
Hao Wang
LRM
251
0
0
19 Nov 2025
CAMP-HiVe: Cyclic Pair Merging based Efficient DNN Pruning with Hessian-Vector Approximation for Resource-Constrained Systems
CAMP-HiVe: Cyclic Pair Merging based Efficient DNN Pruning with Hessian-Vector Approximation for Resource-Constrained Systems
M. H. Uddin
Sai Krishna Ghanta
Liam Seymour
S. Baidya
250
0
0
09 Nov 2025
Which Heads Matter for Reasoning? RL-Guided KV Cache Compression
Which Heads Matter for Reasoning? RL-Guided KV Cache Compression
Wenjie Du
Li Jiang
Keda Tao
Xue Liu
Huan Wang
LRM
216
5
0
09 Oct 2025
Resource-Aware Neural Network Pruning Using Graph-based Reinforcement Learning
Resource-Aware Neural Network Pruning Using Graph-based Reinforcement Learning
Dieter Balemans
Thomas Huybrechts
Jan Steckel
Siegfried Mercelis
166
0
0
04 Sep 2025
One Shot vs. Iterative: Rethinking Pruning Strategies for Model Compression
One Shot vs. Iterative: Rethinking Pruning Strategies for Model Compression
Mikołaj Janusz
Tomasz Wojnar
Yawei Li
Luca Benini
Kamil Adamczewski
VLM
173
3
0
19 Aug 2025
Tricks and Plug-ins for Gradient Boosting in Image Classification
Tricks and Plug-ins for Gradient Boosting in Image Classification
Biyi Fang
J. Utke
Truong Vo
Diego Klabjan
276
0
0
30 Jul 2025
Meta Pruning via Graph Metanetworks : A Universal Meta Learning Framework for Network Pruning
Meta Pruning via Graph Metanetworks : A Universal Meta Learning Framework for Network Pruning
Yewei Liu
Xiyuan Wang
Muhan Zhang
DDGNN
416
0
0
24 May 2025
Automatic Complementary Separation Pruning Toward Lightweight CNNs
Automatic Complementary Separation Pruning Toward Lightweight CNNs
David Levin
Gonen Singer
266
0
0
19 May 2025
One-for-All Pruning: A Universal Model for Customized Compression of Large Language Models
One-for-All Pruning: A Universal Model for Customized Compression of Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Rongguang Ye
Ming Tang
368
2
0
18 May 2025
Efficient Unstructured Pruning of Mamba State-Space Models for Resource-Constrained Environments
Efficient Unstructured Pruning of Mamba State-Space Models for Resource-Constrained Environments
Ibne Farabi Shihab
Sanjeda Akter
Anuj Sharma
Mamba
541
7
0
13 May 2025
Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression Techniques
Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression TechniquesAnnual International Computer Software and Applications Conference (COMPSAC), 2025
Sanjay Surendranath Girija
Shashank Kapoor
Lakshit Arora
Dipen Pradhan
Aman Raj
Ankit Shetgaonkar
470
11
0
05 May 2025
A Wireless Collaborated Inference Acceleration Framework for Plant Disease Recognition
A Wireless Collaborated Inference Acceleration Framework for Plant Disease RecognitionInternational Conference on Intelligent Computing (ICIC), 2025
Hele Zhu
Xinyi Huang
Haojia Gao
Mengfei Jiang
Haohua Que
Lei Mu
363
1
0
05 May 2025
BackSlash: Rate Constrained Optimized Training of Large Language Models
BackSlash: Rate Constrained Optimized Training of Large Language Models
Jun Wu
Jiangtao Wen
Yuxing Han
540
1
0
23 Apr 2025
CUT: Pruning Pre-Trained Multi-Task Models into Compact Models for Edge Devices
CUT: Pruning Pre-Trained Multi-Task Models into Compact Models for Edge DevicesInternational Conference on Intelligent Computing (ICIC), 2025
Jingxuan Zhou
Weidong Bao
Ji Wang
Zhengyi Zhong
274
0
0
14 Apr 2025
Tin-Tin: Towards Tiny Learning on Tiny Devices with Integer-based Neural Network Training
Tin-Tin: Towards Tiny Learning on Tiny Devices with Integer-based Neural Network Training
Yi Hu
Jinhang Zuo
Eddie Zhang
Bob Iannucci
Carlee Joe-Wong
375
2
0
13 Apr 2025
Kernel-Level Energy-Efficient Neural Architecture Search for Tabular Dataset
Kernel-Level Energy-Efficient Neural Architecture Search for Tabular DatasetAsian Conference on Intelligent Information and Database Systems (ACIIDS), 2025
Hoang-Loc La
Phuong Hoai Ha
353
2
0
11 Apr 2025
Thanos: A Block-wise Pruning Algorithm for Efficient Large Language Model Compression
Thanos: A Block-wise Pruning Algorithm for Efficient Large Language Model Compression
Ivan Ilin
Peter Richtárik
199
5
0
06 Apr 2025
Maximum Redundancy Pruning: A Principle-Driven Layerwise Sparsity Allocation for LLMs
Maximum Redundancy Pruning: A Principle-Driven Layerwise Sparsity Allocation for LLMs
Chang Gao
Kang Zhao
Runqi Wang
Jianfei Chen
Liping Jing
432
1
0
24 Mar 2025
EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models
EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language ModelsComputer Vision and Pattern Recognition (CVPR), 2025
Yinan Liang
Xiping Hu
Xiuwei Xu
Jie Zhou
Jiwen Lu
VLMLRM
328
10
0
19 Mar 2025
Towards Extreme Pruning of LLMs with Plug-and-Play Mixed Sparsity
Towards Extreme Pruning of LLMs with Plug-and-Play Mixed Sparsity
Chi Xu
Gefei Zhang
Yantong Zhu
Luca Benini
Guosheng Hu
Yawei Li
Zhihong Zhang
238
1
0
14 Mar 2025
Automatic Joint Structured Pruning and Quantization for Efficient Neural Network Training and Compression
Automatic Joint Structured Pruning and Quantization for Efficient Neural Network Training and CompressionComputer Vision and Pattern Recognition (CVPR), 2025
Xiaoyi Qu
David Aponte
Colby R. Banbury
Daniel P. Robinson
Tianyu Ding
K. Koishida
Ilya Zharkov
Tianyi Chen
MQ
391
12
0
23 Feb 2025
Advancing Weight and Channel Sparsification with Enhanced Saliency
Advancing Weight and Channel Sparsification with Enhanced SaliencyIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Xinglong Sun
Maying Shen
Hongxu Yin
Lei Mao
Pavlo Molchanov
Jose M. Alvarez
273
1
0
05 Feb 2025
Pruning-aware Loss Functions for STOI-Optimized Pruned Recurrent Autoencoders for the Compression of the Stimulation Patterns of Cochlear Implants at Zero Delay
Pruning-aware Loss Functions for STOI-Optimized Pruned Recurrent Autoencoders for the Compression of the Stimulation Patterns of Cochlear Implants at Zero DelayAsilomar Conference on Signals, Systems and Computers (ACSSC), 2024
Reemt Hinrichs
Jörn Ostermann
374
0
0
04 Feb 2025
B-FPGM: Lightweight Face Detection via Bayesian-Optimized Soft FPGM Pruning
B-FPGM: Lightweight Face Detection via Bayesian-Optimized Soft FPGM Pruning
Nikolaos Kaparinos
Vasileios Mezaris
CVBM
488
2
0
28 Jan 2025
Hardware-Aware DNN Compression for Homogeneous Edge Devices
Hardware-Aware DNN Compression for Homogeneous Edge Devices
Kunlong Zhang
Guiying Li
Ning Lu
Peng Yang
Shengcai Liu
341
2
0
25 Jan 2025
Playing the Lottery With Concave Regularizers for Sparse Trainable Neural Networks
Playing the Lottery With Concave Regularizers for Sparse Trainable Neural NetworksIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Giulia Fracastoro
Sophie M. Fosson
Andrea Migliorati
G. Calafiore
324
3
0
19 Jan 2025
AutoSculpt: A Pattern-based Model Auto-pruning Framework Using Reinforcement Learning and Graph Learning
AutoSculpt: A Pattern-based Model Auto-pruning Framework Using Reinforcement Learning and Graph Learning
Lixian Jing
Haobing Liu
Junyu Dong
Yanwei Yu
3DPCAI4CE
345
2
0
24 Dec 2024
Holistic Adversarially Robust Pruning
Holistic Adversarially Robust PruningInternational Conference on Learning Representations (ICLR), 2024
Qi Zhao
Christian Wressnegger
332
13
0
19 Dec 2024
Deep Convolutional Neural Networks Structured Pruning via Gravity
  Regularization
Deep Convolutional Neural Networks Structured Pruning via Gravity Regularization
Abdesselam Ferdi
340
2
0
25 Nov 2024
Electrostatic Force Regularization for Neural Structured Pruning
Abdesselam Ferdi
A. Taleb-Ahmed
A. Nakib
Youcef Ferdi
383
1
0
17 Nov 2024
EvoPress: Accurate Dynamic Model Compression via Evolutionary Search
EvoPress: Accurate Dynamic Model Compression via Evolutionary Search
Oliver Sieberling
Denis Kuznedelev
Eldar Kurtic
Dan Alistarh
MQ
567
5
0
18 Oct 2024
DISP-LLM: Dimension-Independent Structural Pruning for Large Language
  Models
DISP-LLM: Dimension-Independent Structural Pruning for Large Language ModelsNeural Information Processing Systems (NeurIPS), 2024
Shangqian Gao
Chi-Heng Lin
Ting Hua
Tang Zheng
Yilin Shen
Hongxia Jin
Yen-Chang Hsu
324
26
0
15 Oct 2024
ReTok: Replacing Tokenizer to Enhance Representation Efficiency in Large
  Language Model
ReTok: Replacing Tokenizer to Enhance Representation Efficiency in Large Language Model
Shuhao Gu
Mengdi Zhao
Bowen Zhang
Liangdong Wang
Jijie Li
Guang Liu
295
0
0
06 Oct 2024
Mixture of Efficient Diffusion Experts Through Automatic Interval and
  Sub-Network Selection
Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network SelectionEuropean Conference on Computer Vision (ECCV), 2024
Alireza Ganjdanesh
Yan Kang
Yuchen Liu
Richard Y. Zhang
Zhe Lin
Heng Huang
DiffM
380
13
0
23 Sep 2024
An Efficient Privacy-aware Split Learning Framework for Satellite
  Communications
An Efficient Privacy-aware Split Learning Framework for Satellite CommunicationsIEEE Journal on Selected Areas in Communications (JSAC), 2024
Jianfei Sun
Cong Wu
Shahid Mumtaz
Junyi Tao
Mingsheng Cao
Mei Wang
Valerio Frascolla
285
23
0
13 Sep 2024
HESSO: Towards Automatic Efficient and User Friendly Any Neural Network Training and Pruning
HESSO: Towards Automatic Efficient and User Friendly Any Neural Network Training and Pruning
Tianyi Chen
Xiaoyi Qu
David Aponte
Colby R. Banbury
Jongwoo Ko
Tianyu Ding
Yong Ma
Vladimir Lyapunov
Ilya Zharkov
Luming Liang
568
3
0
11 Sep 2024
Towards Energy-Efficiency by Navigating the Trilemma of Energy, Latency,
  and Accuracy
Towards Energy-Efficiency by Navigating the Trilemma of Energy, Latency, and AccuracyInternational Symposium on Mixed and Augmented Reality (ISMAR), 2024
Boyuan Tian
Yihan Pang
Muhammad Huzaifa
Shenlong Wang
Sarita Adve
341
3
0
06 Sep 2024
PSE-Net: Channel Pruning for Convolutional Neural Networks with
  Parallel-subnets Estimator
PSE-Net: Channel Pruning for Convolutional Neural Networks with Parallel-subnets EstimatorNeural Networks (NN), 2024
Shiguang Wang
Tao Xie
Haijun Liu
Xingcheng Zhang
Jian Cheng
267
4
0
29 Aug 2024
An Effective Information Theoretic Framework for Channel Pruning
An Effective Information Theoretic Framework for Channel Pruning
Yihao Chen
Zefang Wang
350
12
0
14 Aug 2024
PENDRAM: Enabling High-Performance and Energy-Efficient Processing of
  Deep Neural Networks through a Generalized DRAM Data Mapping Policy
PENDRAM: Enabling High-Performance and Energy-Efficient Processing of Deep Neural Networks through a Generalized DRAM Data Mapping Policy
Rachmad Vidya Wicaksana Putra
Muhammad Abdullah Hanif
Mohamed Bennai
223
0
0
05 Aug 2024
Realizing Unaligned Block-wise Pruning for DNN Acceleration on Mobile
  Devices
Realizing Unaligned Block-wise Pruning for DNN Acceleration on Mobile Devices
Hayun Lee
Dongkun Shin
MQ
265
0
0
29 Jul 2024
Survey on Knowledge Distillation for Large Language Models: Methods,
  Evaluation, and Application
Survey on Knowledge Distillation for Large Language Models: Methods, Evaluation, and Application
Chuanpeng Yang
Wang Lu
Yao Zhu
Yidong Wang
Qian Chen
Chenlong Gao
Bingjie Yan
Yiqiang Chen
ALMKELM
311
101
0
02 Jul 2024
Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models:
  Enhancing Performance and Reducing Inference Costs
Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs
Enshu Liu
Junyi Zhu
Zinan Lin
Xuefei Ning
Matthew B. Blaschko
Shengen Yan
Guohao Dai
Huazhong Yang
Yu Wang
MoE
288
34
0
01 Jul 2024
LayerMerge: Neural Network Depth Compression through Layer Pruning and
  Merging
LayerMerge: Neural Network Depth Compression through Layer Pruning and MergingInternational Conference on Machine Learning (ICML), 2024
Jinuk Kim
Marwa El Halabi
Mingi Ji
Hyun Oh Song
401
5
0
18 Jun 2024
Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models
Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models
Alireza Ganjdanesh
Reza Shirkavand
Shangqian Gao
Heng Huang
DiffMVLM
518
11
0
17 Jun 2024
Pick-or-Mix: Dynamic Channel Sampling for ConvNets
Pick-or-Mix: Dynamic Channel Sampling for ConvNets
Ashish Kumar
Daneul Kim
Jaesik Park
Laxmidhar Behera
308
4
0
16 Jun 2024
1234...111213
Next
Page 1 of 13
Pageof 13