ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1710.09282
  4. Cited By
A Survey of Model Compression and Acceleration for Deep Neural Networks

A Survey of Model Compression and Acceleration for Deep Neural Networks

23 October 2017
Yu Cheng
Duo Wang
Pan Zhou
Zhang Tao
ArXivPDFHTML

Papers citing "A Survey of Model Compression and Acceleration for Deep Neural Networks"

50 / 111 papers shown
Title
Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning
Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning
Sanghwan Bae
Jiwoo Hong
Min Young Lee
Hanbyul Kim
Jeongyeon Nam
Donghyun Kwak
OffRL
LRM
48
3
0
04 Apr 2025
Data Generation for Hardware-Friendly Post-Training Quantization
Data Generation for Hardware-Friendly Post-Training Quantization
Lior Dikstein
Ariel Lapid
Arnon Netzer
H. Habi
MQ
130
0
0
29 Oct 2024
Compress then Serve: Serving Thousands of LoRA Adapters with Little Overhead
Compress then Serve: Serving Thousands of LoRA Adapters with Little Overhead
Rickard Brüel-Gabrielsson
Jiacheng Zhu
Onkar Bhardwaj
Leshem Choshen
Kristjan Greenewald
Mikhail Yurochkin
Justin Solomon
38
5
0
17 Jun 2024
Tiny Models are the Computational Saver for Large Models
Tiny Models are the Computational Saver for Large Models
Qingyuan Wang
B. Cardiff
Antoine Frappé
Benoît Larras
Deepu John
29
2
0
26 Mar 2024
Choosing Wisely and Learning Deeply: Selective Cross-Modality
  Distillation via CLIP for Domain Generalization
Choosing Wisely and Learning Deeply: Selective Cross-Modality Distillation via CLIP for Domain Generalization
Jixuan Leng
Yijiang Li
Haohan Wang
VLM
29
0
0
26 Nov 2023
Federated learning compression designed for lightweight communications
Federated learning compression designed for lightweight communications
Lucas Grativol Ribeiro
Mathieu Léonardon
Guillaume Muller
Virginie Fresse
Matthieu Arzel
FedML
25
3
0
23 Oct 2023
Language Modeling Is Compression
Language Modeling Is Compression
Grégoire Delétang
Anian Ruoss
Paul-Ambroise Duquenne
Elliot Catt
Tim Genewein
...
Wenliang Kevin Li
Matthew Aitchison
Laurent Orseau
Marcus Hutter
J. Veness
AI4CE
30
129
0
19 Sep 2023
Training Acceleration of Low-Rank Decomposed Networks using Sequential
  Freezing and Rank Quantization
Training Acceleration of Low-Rank Decomposed Networks using Sequential Freezing and Rank Quantization
Habib Hajimolahoseini
Walid Ahmed
Yang Liu
OffRL
MQ
19
6
0
07 Sep 2023
Neural Networks at a Fraction with Pruned Quaternions
Neural Networks at a Fraction with Pruned Quaternions
Sahel Mohammad Iqbal
Subhankar Mishra
22
4
0
13 Aug 2023
Large Language Models and Foundation Models in Smart Agriculture:
  Basics, Opportunities, and Challenges
Large Language Models and Foundation Models in Smart Agriculture: Basics, Opportunities, and Challenges
Jiajia Li
Mingle Xu
Lirong Xiang
Dong Chen
Weichao Zhuang
Xunyuan Yin
Zhao Li
25
3
0
13 Aug 2023
Accelerating Distributed ML Training via Selective Synchronization
Accelerating Distributed ML Training via Selective Synchronization
S. Tyagi
Martin Swany
FedML
24
3
0
16 Jul 2023
Proximity to Losslessly Compressible Parameters
Proximity to Losslessly Compressible Parameters
Matthew Farrugia-Roberts
30
0
0
05 Jun 2023
Radar-Camera Fusion for Object Detection and Semantic Segmentation in
  Autonomous Driving: A Comprehensive Review
Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive Review
Shanliang Yao
Runwei Guan
Xiaoyu Huang
Zhuoxiao Li
Xiangyu Sha
...
Eng Gee Lim
H. Seo
Ka Lok Man
Xiaohui Zhu
Yutao Yue
31
91
0
20 Apr 2023
Domain Adaptation for Inertial Measurement Unit-based Human Activity
  Recognition: A Survey
Domain Adaptation for Inertial Measurement Unit-based Human Activity Recognition: A Survey
Avijoy Chakma
A. Faridee
Indrajeet Ghosh
Nirmalya Roy
16
4
0
07 Apr 2023
Learning to Zoom and Unzoom
Learning to Zoom and Unzoom
Chittesh Thavamani
Mengtian Li
Francesco Ferroni
Deva Ramanan
17
8
0
27 Mar 2023
HEAR4Health: A blueprint for making computer audition a staple of modern
  healthcare
HEAR4Health: A blueprint for making computer audition a staple of modern healthcare
Andreas Triantafyllopoulos
Alexander Kathan
Alice Baird
Lukas Christ
Alexander Gebhard
...
Shahin Amiriparian
K. D. Bartl-Pokorny
A. Batliner
Florian B. Pokorny
Björn W. Schuller
39
7
0
25 Jan 2023
FSCNN: A Fast Sparse Convolution Neural Network Inference System
FSCNN: A Fast Sparse Convolution Neural Network Inference System
Bo Ji
Tianyi Chen
16
3
0
17 Dec 2022
Unbiased Knowledge Distillation for Recommendation
Unbiased Knowledge Distillation for Recommendation
Gang Chen
Jiawei Chen
Fuli Feng
Sheng Zhou
Xiangnan He
19
27
0
27 Nov 2022
Efficient Incremental Text-to-Speech on GPUs
Efficient Incremental Text-to-Speech on GPUs
Muyang Du
Chuan Liu
Jiaxing Qi
Junjie Lai
11
1
0
25 Nov 2022
PAC-Bayes Compression Bounds So Tight That They Can Explain
  Generalization
PAC-Bayes Compression Bounds So Tight That They Can Explain Generalization
Sanae Lotfi
Marc Finzi
Sanyam Kapoor
Andres Potapczynski
Micah Goldblum
A. Wilson
BDL
MLT
AI4CE
19
51
0
24 Nov 2022
R-WhONet: Recalibrated Wheel Odometry Neural Network for Vehicular
  Positioning using Transfer Learning
R-WhONet: Recalibrated Wheel Odometry Neural Network for Vehicular Positioning using Transfer Learning
Uche Onyekpe
Alicja Szkolnik
Vasile Palade
S. Kanarachos
M. Fitzpatrick
30
1
0
13 Sep 2022
Debiasing Deep Chest X-Ray Classifiers using Intra- and Post-processing
  Methods
Debiasing Deep Chest X-Ray Classifiers using Intra- and Post-processing Methods
Ricards Marcinkevics
Ece Ozkan
Julia E. Vogt
14
18
0
26 Jul 2022
FairGRAPE: Fairness-aware GRAdient Pruning mEthod for Face Attribute
  Classification
FairGRAPE: Fairness-aware GRAdient Pruning mEthod for Face Attribute Classification
Xiao-Ze Lin
Seungbae Kim
Jungseock Joo
CVBM
29
38
0
22 Jul 2022
Large-scale Knowledge Distillation with Elastic Heterogeneous Computing
  Resources
Large-scale Knowledge Distillation with Elastic Heterogeneous Computing Resources
Ji Liu
Daxiang Dong
Xi Wang
An Qin
Xingjian Li
P. Valduriez
Dejing Dou
Dianhai Yu
18
6
0
14 Jul 2022
CEG4N: Counter-Example Guided Neural Network Quantization Refinement
CEG4N: Counter-Example Guided Neural Network Quantization Refinement
J. Matos
I. Bessa
Edoardo Manino
Xidan Song
Lucas C. Cordeiro
MQ
40
2
0
09 Jul 2022
Enabling Harmonious Human-Machine Interaction with Visual-Context
  Augmented Dialogue System: A Review
Enabling Harmonious Human-Machine Interaction with Visual-Context Augmented Dialogue System: A Review
Hao Wang
Bin Guo
Y. Zeng
Yasan Ding
Chen Qiu
Ying Zhang
Li Yao
Zhiwen Yu
27
2
0
02 Jul 2022
Knowledge Distillation for Oriented Object Detection on Aerial Images
Knowledge Distillation for Oriented Object Detection on Aerial Images
Yicheng Xiao
Junpeng Zhang
ObjD
19
0
0
20 Jun 2022
FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and
  Federated Image Classification
FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and Federated Image Classification
Aliaksandra Shysheya
J. Bronskill
Massimiliano Patacchiola
Sebastian Nowozin
Richard E. Turner
3DH
FedML
38
27
0
17 Jun 2022
Zeroth-Order Topological Insights into Iterative Magnitude Pruning
Zeroth-Order Topological Insights into Iterative Magnitude Pruning
Aishwarya H. Balwani
J. Krzyston
24
2
0
14 Jun 2022
Blueprint Separable Residual Network for Efficient Image
  Super-Resolution
Blueprint Separable Residual Network for Efficient Image Super-Resolution
Zheyu Li
Yingqi Liu
Xiangyu Chen
Haoming Cai
Jinjin Gu
Yu Qiao
Chao Dong
27
131
0
12 May 2022
Serving and Optimizing Machine Learning Workflows on Heterogeneous
  Infrastructures
Serving and Optimizing Machine Learning Workflows on Heterogeneous Infrastructures
Yongji Wu
Matthew Lentz
Danyang Zhuo
Yao Lu
21
22
0
10 May 2022
Physics Community Needs, Tools, and Resources for Machine Learning
Physics Community Needs, Tools, and Resources for Machine Learning
Philip C. Harris
E. Katsavounidis
W. McCormack
D. Rankin
Yongbin Feng
...
De-huai Chen
Mark S. Neubauer
Javier Mauricio Duarte
G. Karagiorgi
Miaoyuan Liu
AI4CE
11
3
0
30 Mar 2022
On Neural Network Equivalence Checking using SMT Solvers
On Neural Network Equivalence Checking using SMT Solvers
Charis Eleftheriadis
Nikolaos Kekatos
Panagiotis Katsaros
S. Tripakis
AAML
19
12
0
22 Mar 2022
Infrastructure-free, Deep Learned Urban Noise Monitoring at $\sim$100mW
Infrastructure-free, Deep Learned Urban Noise Monitoring at ∼\sim∼100mW
Jihoon Yun
Sangeeta Srivastava
Dhrubojyoti Roy
Nathan Stohs
C. Mydlarz
Mahiny A. Salman
Bea Steers
J. P. Bello
Anish Arora
17
5
0
11 Mar 2022
Update Compression for Deep Neural Networks on the Edge
Update Compression for Deep Neural Networks on the Edge
Bo Chen
A. Bakhshi
Gustavo E. A. P. A. Batista
Brian Ng
Tat-Jun Chin
22
17
0
09 Mar 2022
Online Learning for Orchestration of Inference in Multi-User
  End-Edge-Cloud Networks
Online Learning for Orchestration of Inference in Multi-User End-Edge-Cloud Networks
Sina Shahhosseini
Dongjoo Seo
A. Kanduri
Tianyi Hu
Sung-Soo Lim
Bryan Donyanavard
Amir M.Rahmani
N. Dutt
22
17
0
21 Feb 2022
Survey on Graph Neural Network Acceleration: An Algorithmic Perspective
Survey on Graph Neural Network Acceleration: An Algorithmic Perspective
Xin Liu
Mingyu Yan
Lei Deng
Guoqi Li
Xiaochun Ye
Dongrui Fan
Shirui Pan
Yuan Xie
GNN
8
41
0
10 Feb 2022
Comparative assessment of federated and centralized machine learning
Comparative assessment of federated and centralized machine learning
Ibrahim Abdul Majeed
Sagar Kaushik
Aniruddha Bardhan
Venkata Siva Kumar Tadi
Hwang-Ki Min
K. Kumaraguru
Rajasekhara Reddy Duvvuru Muni
FedML
12
6
0
03 Feb 2022
Explaining Cognitive Computing Through the Information Systems Lens
Explaining Cognitive Computing Through the Information Systems Lens
S. Elnagar
Manoj A. Thomas
11
2
0
16 Jan 2022
The Effect of Model Compression on Fairness in Facial Expression
  Recognition
The Effect of Model Compression on Fairness in Facial Expression Recognition
Samuil Stoychev
Hatice Gunes
CVBM
17
19
0
05 Jan 2022
On the Use of External Data for Spoken Named Entity Recognition
On the Use of External Data for Spoken Named Entity Recognition
Ankita Pasad
Felix Wu
Suwon Shon
Karen Livescu
Kyu Jeong Han
32
16
0
14 Dec 2021
Attention-Based Model and Deep Reinforcement Learning for Distribution
  of Event Processing Tasks
Attention-Based Model and Deep Reinforcement Learning for Distribution of Event Processing Tasks
A. Mazayev
F. Al-Tam
N. Correia
21
5
0
07 Dec 2021
Low-rank Tensor Decomposition for Compression of Convolutional Neural
  Networks Using Funnel Regularization
Low-rank Tensor Decomposition for Compression of Convolutional Neural Networks Using Funnel Regularization
Bo-Shiuan Chu
Che-Rung Lee
15
11
0
07 Dec 2021
CondenseNeXt: An Ultra-Efficient Deep Neural Network for Embedded
  Systems
CondenseNeXt: An Ultra-Efficient Deep Neural Network for Embedded Systems
Priyank Kalgaonkar
M. El-Sharkawy
3DH
17
5
0
01 Dec 2021
Nonlinear Tensor Ring Network
Nonlinear Tensor Ring Network
Xiao Peng Li
Qi Liu
Hayden Kwok-Hay So
14
0
0
12 Nov 2021
Gabor filter incorporated CNN for compression
Gabor filter incorporated CNN for compression
Akihiro Imamura
N. Arizumi
CVBM
20
2
0
29 Oct 2021
Model based Multi-agent Reinforcement Learning with Tensor
  Decompositions
Model based Multi-agent Reinforcement Learning with Tensor Decompositions
Pascal R. van der Vaart
Anuj Mahajan
Shimon Whiteson
AI4CE
16
8
0
27 Oct 2021
Towards Mixed-Precision Quantization of Neural Networks via Constrained
  Optimization
Towards Mixed-Precision Quantization of Neural Networks via Constrained Optimization
Weihan Chen
Peisong Wang
Jian Cheng
MQ
31
61
0
13 Oct 2021
Beyond Distillation: Task-level Mixture-of-Experts for Efficient
  Inference
Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference
Sneha Kudugunta
Yanping Huang
Ankur Bapna
M. Krikun
Dmitry Lepikhin
Minh-Thang Luong
Orhan Firat
MoE
119
106
0
24 Sep 2021
Auto-Split: A General Framework of Collaborative Edge-Cloud AI
Auto-Split: A General Framework of Collaborative Edge-Cloud AI
Amin Banitalebi-Dehkordi
Naveen Vedula
J. Pei
Fei Xia
Lanjun Wang
Yong Zhang
22
89
0
30 Aug 2021
123
Next