Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1710.09282
Cited By
A Survey of Model Compression and Acceleration for Deep Neural Networks
23 October 2017
Yu Cheng
Duo Wang
Pan Zhou
Zhang Tao
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Survey of Model Compression and Acceleration for Deep Neural Networks"
50 / 111 papers shown
Title
Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning
Sanghwan Bae
Jiwoo Hong
Min Young Lee
Hanbyul Kim
Jeongyeon Nam
Donghyun Kwak
OffRL
LRM
48
3
0
04 Apr 2025
Data Generation for Hardware-Friendly Post-Training Quantization
Lior Dikstein
Ariel Lapid
Arnon Netzer
H. Habi
MQ
130
0
0
29 Oct 2024
Compress then Serve: Serving Thousands of LoRA Adapters with Little Overhead
Rickard Brüel-Gabrielsson
Jiacheng Zhu
Onkar Bhardwaj
Leshem Choshen
Kristjan Greenewald
Mikhail Yurochkin
Justin Solomon
38
5
0
17 Jun 2024
Tiny Models are the Computational Saver for Large Models
Qingyuan Wang
B. Cardiff
Antoine Frappé
Benoît Larras
Deepu John
29
2
0
26 Mar 2024
Choosing Wisely and Learning Deeply: Selective Cross-Modality Distillation via CLIP for Domain Generalization
Jixuan Leng
Yijiang Li
Haohan Wang
VLM
29
0
0
26 Nov 2023
Federated learning compression designed for lightweight communications
Lucas Grativol Ribeiro
Mathieu Léonardon
Guillaume Muller
Virginie Fresse
Matthieu Arzel
FedML
25
3
0
23 Oct 2023
Language Modeling Is Compression
Grégoire Delétang
Anian Ruoss
Paul-Ambroise Duquenne
Elliot Catt
Tim Genewein
...
Wenliang Kevin Li
Matthew Aitchison
Laurent Orseau
Marcus Hutter
J. Veness
AI4CE
30
129
0
19 Sep 2023
Training Acceleration of Low-Rank Decomposed Networks using Sequential Freezing and Rank Quantization
Habib Hajimolahoseini
Walid Ahmed
Yang Liu
OffRL
MQ
19
6
0
07 Sep 2023
Neural Networks at a Fraction with Pruned Quaternions
Sahel Mohammad Iqbal
Subhankar Mishra
22
4
0
13 Aug 2023
Large Language Models and Foundation Models in Smart Agriculture: Basics, Opportunities, and Challenges
Jiajia Li
Mingle Xu
Lirong Xiang
Dong Chen
Weichao Zhuang
Xunyuan Yin
Zhao Li
25
3
0
13 Aug 2023
Accelerating Distributed ML Training via Selective Synchronization
S. Tyagi
Martin Swany
FedML
24
3
0
16 Jul 2023
Proximity to Losslessly Compressible Parameters
Matthew Farrugia-Roberts
30
0
0
05 Jun 2023
Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive Review
Shanliang Yao
Runwei Guan
Xiaoyu Huang
Zhuoxiao Li
Xiangyu Sha
...
Eng Gee Lim
H. Seo
Ka Lok Man
Xiaohui Zhu
Yutao Yue
31
91
0
20 Apr 2023
Domain Adaptation for Inertial Measurement Unit-based Human Activity Recognition: A Survey
Avijoy Chakma
A. Faridee
Indrajeet Ghosh
Nirmalya Roy
16
4
0
07 Apr 2023
Learning to Zoom and Unzoom
Chittesh Thavamani
Mengtian Li
Francesco Ferroni
Deva Ramanan
17
8
0
27 Mar 2023
HEAR4Health: A blueprint for making computer audition a staple of modern healthcare
Andreas Triantafyllopoulos
Alexander Kathan
Alice Baird
Lukas Christ
Alexander Gebhard
...
Shahin Amiriparian
K. D. Bartl-Pokorny
A. Batliner
Florian B. Pokorny
Björn W. Schuller
39
7
0
25 Jan 2023
FSCNN: A Fast Sparse Convolution Neural Network Inference System
Bo Ji
Tianyi Chen
16
3
0
17 Dec 2022
Unbiased Knowledge Distillation for Recommendation
Gang Chen
Jiawei Chen
Fuli Feng
Sheng Zhou
Xiangnan He
19
27
0
27 Nov 2022
Efficient Incremental Text-to-Speech on GPUs
Muyang Du
Chuan Liu
Jiaxing Qi
Junjie Lai
11
1
0
25 Nov 2022
PAC-Bayes Compression Bounds So Tight That They Can Explain Generalization
Sanae Lotfi
Marc Finzi
Sanyam Kapoor
Andres Potapczynski
Micah Goldblum
A. Wilson
BDL
MLT
AI4CE
19
51
0
24 Nov 2022
R-WhONet: Recalibrated Wheel Odometry Neural Network for Vehicular Positioning using Transfer Learning
Uche Onyekpe
Alicja Szkolnik
Vasile Palade
S. Kanarachos
M. Fitzpatrick
30
1
0
13 Sep 2022
Debiasing Deep Chest X-Ray Classifiers using Intra- and Post-processing Methods
Ricards Marcinkevics
Ece Ozkan
Julia E. Vogt
14
18
0
26 Jul 2022
FairGRAPE: Fairness-aware GRAdient Pruning mEthod for Face Attribute Classification
Xiao-Ze Lin
Seungbae Kim
Jungseock Joo
CVBM
29
38
0
22 Jul 2022
Large-scale Knowledge Distillation with Elastic Heterogeneous Computing Resources
Ji Liu
Daxiang Dong
Xi Wang
An Qin
Xingjian Li
P. Valduriez
Dejing Dou
Dianhai Yu
18
6
0
14 Jul 2022
CEG4N: Counter-Example Guided Neural Network Quantization Refinement
J. Matos
I. Bessa
Edoardo Manino
Xidan Song
Lucas C. Cordeiro
MQ
40
2
0
09 Jul 2022
Enabling Harmonious Human-Machine Interaction with Visual-Context Augmented Dialogue System: A Review
Hao Wang
Bin Guo
Y. Zeng
Yasan Ding
Chen Qiu
Ying Zhang
Li Yao
Zhiwen Yu
27
2
0
02 Jul 2022
Knowledge Distillation for Oriented Object Detection on Aerial Images
Yicheng Xiao
Junpeng Zhang
ObjD
19
0
0
20 Jun 2022
FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and Federated Image Classification
Aliaksandra Shysheya
J. Bronskill
Massimiliano Patacchiola
Sebastian Nowozin
Richard E. Turner
3DH
FedML
38
27
0
17 Jun 2022
Zeroth-Order Topological Insights into Iterative Magnitude Pruning
Aishwarya H. Balwani
J. Krzyston
24
2
0
14 Jun 2022
Blueprint Separable Residual Network for Efficient Image Super-Resolution
Zheyu Li
Yingqi Liu
Xiangyu Chen
Haoming Cai
Jinjin Gu
Yu Qiao
Chao Dong
27
131
0
12 May 2022
Serving and Optimizing Machine Learning Workflows on Heterogeneous Infrastructures
Yongji Wu
Matthew Lentz
Danyang Zhuo
Yao Lu
21
22
0
10 May 2022
Physics Community Needs, Tools, and Resources for Machine Learning
Philip C. Harris
E. Katsavounidis
W. McCormack
D. Rankin
Yongbin Feng
...
De-huai Chen
Mark S. Neubauer
Javier Mauricio Duarte
G. Karagiorgi
Miaoyuan Liu
AI4CE
11
3
0
30 Mar 2022
On Neural Network Equivalence Checking using SMT Solvers
Charis Eleftheriadis
Nikolaos Kekatos
Panagiotis Katsaros
S. Tripakis
AAML
19
12
0
22 Mar 2022
Infrastructure-free, Deep Learned Urban Noise Monitoring at
∼
\sim
∼
100mW
Jihoon Yun
Sangeeta Srivastava
Dhrubojyoti Roy
Nathan Stohs
C. Mydlarz
Mahiny A. Salman
Bea Steers
J. P. Bello
Anish Arora
17
5
0
11 Mar 2022
Update Compression for Deep Neural Networks on the Edge
Bo Chen
A. Bakhshi
Gustavo E. A. P. A. Batista
Brian Ng
Tat-Jun Chin
22
17
0
09 Mar 2022
Online Learning for Orchestration of Inference in Multi-User End-Edge-Cloud Networks
Sina Shahhosseini
Dongjoo Seo
A. Kanduri
Tianyi Hu
Sung-Soo Lim
Bryan Donyanavard
Amir M.Rahmani
N. Dutt
22
17
0
21 Feb 2022
Survey on Graph Neural Network Acceleration: An Algorithmic Perspective
Xin Liu
Mingyu Yan
Lei Deng
Guoqi Li
Xiaochun Ye
Dongrui Fan
Shirui Pan
Yuan Xie
GNN
8
41
0
10 Feb 2022
Comparative assessment of federated and centralized machine learning
Ibrahim Abdul Majeed
Sagar Kaushik
Aniruddha Bardhan
Venkata Siva Kumar Tadi
Hwang-Ki Min
K. Kumaraguru
Rajasekhara Reddy Duvvuru Muni
FedML
12
6
0
03 Feb 2022
Explaining Cognitive Computing Through the Information Systems Lens
S. Elnagar
Manoj A. Thomas
11
2
0
16 Jan 2022
The Effect of Model Compression on Fairness in Facial Expression Recognition
Samuil Stoychev
Hatice Gunes
CVBM
17
19
0
05 Jan 2022
On the Use of External Data for Spoken Named Entity Recognition
Ankita Pasad
Felix Wu
Suwon Shon
Karen Livescu
Kyu Jeong Han
32
16
0
14 Dec 2021
Attention-Based Model and Deep Reinforcement Learning for Distribution of Event Processing Tasks
A. Mazayev
F. Al-Tam
N. Correia
21
5
0
07 Dec 2021
Low-rank Tensor Decomposition for Compression of Convolutional Neural Networks Using Funnel Regularization
Bo-Shiuan Chu
Che-Rung Lee
15
11
0
07 Dec 2021
CondenseNeXt: An Ultra-Efficient Deep Neural Network for Embedded Systems
Priyank Kalgaonkar
M. El-Sharkawy
3DH
17
5
0
01 Dec 2021
Nonlinear Tensor Ring Network
Xiao Peng Li
Qi Liu
Hayden Kwok-Hay So
14
0
0
12 Nov 2021
Gabor filter incorporated CNN for compression
Akihiro Imamura
N. Arizumi
CVBM
20
2
0
29 Oct 2021
Model based Multi-agent Reinforcement Learning with Tensor Decompositions
Pascal R. van der Vaart
Anuj Mahajan
Shimon Whiteson
AI4CE
16
8
0
27 Oct 2021
Towards Mixed-Precision Quantization of Neural Networks via Constrained Optimization
Weihan Chen
Peisong Wang
Jian Cheng
MQ
31
61
0
13 Oct 2021
Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference
Sneha Kudugunta
Yanping Huang
Ankur Bapna
M. Krikun
Dmitry Lepikhin
Minh-Thang Luong
Orhan Firat
MoE
119
106
0
24 Sep 2021
Auto-Split: A General Framework of Collaborative Edge-Cloud AI
Amin Banitalebi-Dehkordi
Naveen Vedula
J. Pei
Fei Xia
Lanjun Wang
Yong Zhang
22
89
0
30 Aug 2021
1
2
3
Next