ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1608.04493
  4. Cited By
Dynamic Network Surgery for Efficient DNNs
v1v2 (latest)

Dynamic Network Surgery for Efficient DNNs

Neural Information Processing Systems (NeurIPS), 2016
16 August 2016
Yiwen Guo
Anbang Yao
Yurong Chen
ArXiv (abs)PDFHTMLGithub (186★)

Papers citing "Dynamic Network Surgery for Efficient DNNs"

50 / 476 papers shown
Efficient Neural Networks with Discrete Cosine Transform Activations
Efficient Neural Networks with Discrete Cosine Transform Activations
Marc Martinez-Gost
Sara Pepe
Ana I. Pérez-Neira
M. Lagunas
99
0
0
05 Nov 2025
Fast and accurate neural reflectance transformation imaging through knowledge distillation
Fast and accurate neural reflectance transformation imaging through knowledge distillation
Tinsae G. Dulecha
Leonardo Righetto
Ruggero Pintus
Enrico Gobbetti
Andrea Giachetti
3DH
297
2
0
28 Oct 2025
Compress to Impress: Efficient LLM Adaptation Using a Single Gradient Step on 100 Samples
Compress to Impress: Efficient LLM Adaptation Using a Single Gradient Step on 100 Samples
Shiva Sreeram
Alaa Maalouf
Pratyusha Sharma
Daniela Rus
151
0
0
23 Oct 2025
Convergence, design and training of continuous-time dropout as a random batch method
Convergence, design and training of continuous-time dropout as a random batch method
Antonio Álvarez-López
Martín Hernández
133
0
0
15 Oct 2025
SQS: Bayesian DNN Compression through Sparse Quantized Sub-distributions
SQS: Bayesian DNN Compression through Sparse Quantized Sub-distributions
Ziyi Wang
Nan Jiang
Guang Lin
Qifan Song
MQ
254
0
0
10 Oct 2025
CAST: Continuous and Differentiable Semi-Structured Sparsity-Aware Training for Large Language Models
CAST: Continuous and Differentiable Semi-Structured Sparsity-Aware Training for Large Language Models
Weiyu Huang
Yuezhou Hu
Jun Zhu
Jianfei Chen
CLL
142
0
0
30 Sep 2025
DEFT: Decompositional Efficient Fine-Tuning for Text-to-Image Models
DEFT: Decompositional Efficient Fine-Tuning for Text-to-Image Models
Komal Kumar
Rao Muhammad Anwer
Fahad Shahbaz Khan
Salman Khan
Ivan Laptev
Hisham Cholakkal
139
0
0
26 Sep 2025
Efficient Reinforcement Learning by Reducing Forgetting with Elephant Activation Functions
Efficient Reinforcement Learning by Reducing Forgetting with Elephant Activation Functions
Qingfeng Lan
Gautham Vasan
A. R. Mahmood
CLL
166
0
0
23 Sep 2025
LinDeps: A Fine-tuning Free Post-Pruning Method to Remove Layer-Wise Linear Dependencies with Guaranteed Performance Preservation
LinDeps: A Fine-tuning Free Post-Pruning Method to Remove Layer-Wise Linear Dependencies with Guaranteed Performance Preservation
Maxim Henry
Adrien Deliège
A. Cioppa
Marc Van Droogenbroeck
VLM
194
0
0
29 Jul 2025
Knowledge Grafting: A Mechanism for Optimizing AI Model Deployment in Resource-Constrained Environments
Knowledge Grafting: A Mechanism for Optimizing AI Model Deployment in Resource-Constrained Environments
Osama Almurshed
Ashish Kaushal
Asmail Muftah
Nitin Auluck
Omer Rana
262
1
0
25 Jul 2025
Search-Optimized Quantization in Biomedical Ontology Alignment
Search-Optimized Quantization in Biomedical Ontology AlignmentFrontiers in Artificial Intelligence (Front. Artif. Intell.), 2025
Oussama Bouaggad
Natalia Grabar
MQ
227
0
0
18 Jul 2025
A geometric framework for momentum-based optimizers for low-rank training
A geometric framework for momentum-based optimizers for low-rank training
Steffen Schotthöfer
Timon Klein
J. Kusch
AI4CE
269
2
0
20 Jun 2025
Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity
Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity
Samin Yeasar Arnob
Scott Fujimoto
Doina Precup
OffRL
286
0
0
20 Jun 2025
Hyperpruning: Efficient Search through Pruned Variants of Recurrent Neural Networks Leveraging Lyapunov Spectrum
Hyperpruning: Efficient Search through Pruned Variants of Recurrent Neural Networks Leveraging Lyapunov Spectrum
Caleb Zheng
Eli Shlizerman
215
1
0
09 Jun 2025
Frugal Machine Learning for Energy-efficient, and Resource-aware Artificial Intelligence
Frugal Machine Learning for Energy-efficient, and Resource-aware Artificial Intelligence
John Violos
Konstantina-Christina Diamanti
I. Kompatsiaris
Symeon Papadopoulos
261
2
0
02 Jun 2025
Global Minimizers of $\ell^p$-Regularized Objectives Yield the Sparsest ReLU Neural Networks
Global Minimizers of ℓp\ell^pℓp-Regularized Objectives Yield the Sparsest ReLU Neural Networks
Julia B. Nakhleh
Robert D. Nowak
351
0
0
27 May 2025
You Don't Need All Attentions: Distributed Dynamic Fine-Tuning for Foundation Models
You Don't Need All Attentions: Distributed Dynamic Fine-Tuning for Foundation Models
Shiwei Ding
Lan Zhang
Zhenlin Wang
Giuseppe Ateniese
Xiaoyong Yuan
259
0
0
16 Apr 2025
Generative Artificial Intelligence for Internet of Things Computing: A Systematic Survey
Generative Artificial Intelligence for Internet of Things Computing: A Systematic Survey
Fabrizio Mangione
Claudio Savaglio
Giancarlo Fortino
330
7
0
10 Apr 2025
Lipschitz Constant Meets Condition Number: Learning Robust and Compact Deep Neural Networks
Lipschitz Constant Meets Condition Number: Learning Robust and Compact Deep Neural Networks
Yangqi Feng
S. J. Lin
Baoyuan Gao
Xian Wei
AAML
368
2
0
26 Mar 2025
Optimal Brain Apoptosis
Optimal Brain ApoptosisInternational Conference on Learning Representations (ICLR), 2025
Mingyuan Sun
Zheng Fang
Jiaxu Wang
Junjie Jiang
Delei Kong
Chenming Hu
Yuetong Fang
Zhanchen Zhu
AAML
487
3
0
25 Feb 2025
HASSLE-free: A unified Framework for Sparse plus Low-Rank Matrix Decomposition for LLMs
HASSLE-free: A unified Framework for Sparse plus Low-Rank Matrix Decomposition for LLMs
Mehdi Makni
Kayhan Behdin
Zheng Xu
Natalia Ponomareva
Rahul Mazumder
186
1
0
02 Feb 2025
Deriving Coding-Specific Sub-Models from LLMs using Resource-Efficient Pruning
Deriving Coding-Specific Sub-Models from LLMs using Resource-Efficient Pruning
Laura Puccioni
Alireza Farshin
Mariano Scazzariello
Changjie Wang
Marco Chiesa
Dejan Kostic
258
1
0
10 Jan 2025
Pruning-based Data Selection and Network Fusion for Efficient Deep Learning
Pruning-based Data Selection and Network Fusion for Efficient Deep Learning
Humaira Kousar
Hasnain Irshad Bhatti
Jaekyun Moon
428
1
0
03 Jan 2025
On the Compression of Language Models for Code: An Empirical Study on
  CodeBERT
On the Compression of Language Models for Code: An Empirical Study on CodeBERTIEEE International Conference on Software Analysis, Evolution, and Reengineering (SANER), 2024
Giordano dÁloisio
Luca Traini
Federica Sarro
A. Marco
284
8
0
18 Dec 2024
MOFHEI: Model Optimizing Framework for Fast and Efficient
  Homomorphically Encrypted Neural Network Inference
MOFHEI: Model Optimizing Framework for Fast and Efficient Homomorphically Encrypted Neural Network InferenceInternational Conference on Trust, Privacy and Security in Intelligent Systems and Applications (ICPSISA), 2024
Parsa Ghazvinian
Robert Podschwadt
Prajwal Panzade
Mohammad H. Rafiei
Daniel Takabi
293
1
0
10 Dec 2024
Efficient Model Compression for Bayesian Neural Networks
Efficient Model Compression for Bayesian Neural Networks
Diptarka Saha
Zihe Liu
Feng Liang
BDL
278
0
0
01 Nov 2024
Neuralink: Fast LLM Inference on Smartphones with Neuron Co-Activation Linking
Neuralink: Fast LLM Inference on Smartphones with Neuron Co-Activation Linking
Tuowei Wang
Ruwen Fan
Minxing Huang
Zixu Hao
Kun Li
Ting Cao
Youyou Lu
Yaoxue Zhang
Ju Ren
421
2
0
25 Oct 2024
GeoLoRA: Geometric integration for parameter efficient fine-tuning
GeoLoRA: Geometric integration for parameter efficient fine-tuningInternational Conference on Learning Representations (ICLR), 2024
Steffen Schotthöfer
Emanuele Zangrando
Gianluca Ceruti
Francesco Tudisco
J. Kusch
AI4CE
285
7
0
24 Oct 2024
SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep Models
SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep Models
Yuqi Li
Yao Lu
Zhihong Zhu
Chuanguang Yang
Yihao Chen
Xin Yin
Yihao Chen
Jianping Gou
Yingli Tian
T. Huang
255
37
0
14 Oct 2024
CNN Mixture-of-Depths
CNN Mixture-of-DepthsAsian Conference on Computer Vision (ACCV), 2024
Rinor Cakaj
Jens Mehnert
Bin Yang
306
2
0
25 Sep 2024
Learning effective pruning at initialization from iterative pruning
Learning effective pruning at initialization from iterative pruning
Shengkai Liu
Yaofeng Cheng
Fusheng Zha
Wei Guo
Lining Sun
Zhenshan Bing
Chenguang Yang
371
2
0
27 Aug 2024
An Effective Information Theoretic Framework for Channel Pruning
An Effective Information Theoretic Framework for Channel Pruning
Yihao Chen
Zefang Wang
347
12
0
14 Aug 2024
LeanQuant: Accurate and Scalable Large Language Model Quantization with Loss-error-aware Grid
LeanQuant: Accurate and Scalable Large Language Model Quantization with Loss-error-aware Grid
Tianyi Zhang
Anshumali Shrivastava
MQ
386
0
0
14 Jul 2024
MagMax: Leveraging Model Merging for Seamless Continual Learning
MagMax: Leveraging Model Merging for Seamless Continual Learning
Daniel Marczak
Bartłomiej Twardowski
Tomasz Trzciñski
Sebastian Cygert
MoMeCLL
250
55
0
08 Jul 2024
Data Overfitting for On-Device Super-Resolution with Dynamic Algorithm
  and Compiler Co-Design
Data Overfitting for On-Device Super-Resolution with Dynamic Algorithm and Compiler Co-Design
Gen Li
Zhihao Shu
Jie Ji
Minghai Qin
Fatemeh Afghah
Wei Niu
Xiaolong Ma
SupR
395
1
0
03 Jul 2024
ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models
ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models
Xiang Meng
Kayhan Behdin
Haoyue Wang
Rahul Mazumder
333
14
0
12 Jun 2024
Sparse Bayesian Networks: Efficient Uncertainty Quantification in
  Medical Image Analysis
Sparse Bayesian Networks: Efficient Uncertainty Quantification in Medical Image Analysis
Zeinab Abboud
Herve Lombaert
Samuel Kadoury
UQCV
278
6
0
11 Jun 2024
Reinforced Compressive Neural Architecture Search for Versatile
  Adversarial Robustness
Reinforced Compressive Neural Architecture Search for Versatile Adversarial Robustness
Dingrong Wang
Hitesh Sapkota
Zhiqiang Tao
Qi Yu
AAML
281
3
0
10 Jun 2024
PaRa: Personalizing Text-to-Image Diffusion via Parameter Rank Reduction
PaRa: Personalizing Text-to-Image Diffusion via Parameter Rank ReductionInternational Conference on Learning Representations (ICLR), 2024
Shangyu Chen
Zizheng Pan
Jianfei Cai
Dinh Q. Phung
258
6
0
09 Jun 2024
Quantifying Task Priority for Multi-Task Optimization
Quantifying Task Priority for Multi-Task Optimization
Wooseong Jeong
Kuk-Jin Yoon
339
13
0
05 Jun 2024
Diverse Subset Selection via Norm-Based Sampling and Orthogonality
Diverse Subset Selection via Norm-Based Sampling and Orthogonality
Noga Bar
Raja Giryes
CVBM
421
1
0
03 Jun 2024
Effective Interplay between Sparsity and Quantization: From Theory to Practice
Effective Interplay between Sparsity and Quantization: From Theory to Practice
Simla Burcu Harma
Ayan Chakraborty
Elizaveta Kostenok
Danila Mishin
Dongho Ha
...
Martin Jaggi
Ming Liu
Yunho Oh
Suvinay Subramanian
Amir Yazdanbakhsh
MQ
451
22
0
31 May 2024
Data-independent Module-aware Pruning for Hierarchical Vision
  Transformers
Data-independent Module-aware Pruning for Hierarchical Vision Transformers
Yang He
Qiufeng Wang
ViT
319
11
0
21 Apr 2024
Lightweight Deep Learning for Resource-Constrained Environments: A
  Survey
Lightweight Deep Learning for Resource-Constrained Environments: A Survey
Hou-I Liu
Marco Galindo
Hongxia Xie
Lai-Kuan Wong
Hong-Han Shuai
Yung-Hui Li
Wen-Huang Cheng
428
206
0
08 Apr 2024
DRIVE: Dual Gradient-Based Rapid Iterative Pruning
DRIVE: Dual Gradient-Based Rapid Iterative Pruning
Dhananjay Saikumar
Blesson Varghese
273
4
0
01 Apr 2024
Separate, Dynamic and Differentiable (SMART) Pruner for Block/Output
  Channel Pruning on Computer Vision Tasks
Separate, Dynamic and Differentiable (SMART) Pruner for Block/Output Channel Pruning on Computer Vision Tasks
Guanhua Ding
Zexi Ye
Zhen Zhong
Gang Li
David Shao
231
1
0
29 Mar 2024
ALISA: Accelerating Large Language Model Inference via Sparsity-Aware KV
  Caching
ALISA: Accelerating Large Language Model Inference via Sparsity-Aware KV Caching
Youpeng Zhao
Di Wu
Jun Wang
302
58
0
26 Mar 2024
OSSCAR: One-Shot Structured Pruning in Vision and Language Models with
  Combinatorial Optimization
OSSCAR: One-Shot Structured Pruning in Vision and Language Models with Combinatorial Optimization
Xiang Meng
Shibal Ibrahim
Kayhan Behdin
Hussein Hazimeh
Natalia Ponomareva
Rahul Mazumder
VLM
442
15
0
02 Mar 2024
MGE: A Training-Free and Efficient Model Generation and Enhancement
  Scheme
MGE: A Training-Free and Efficient Model Generation and Enhancement Scheme
Xuan Wang
Zeshan Pang
Yuliang Lu
Xuehu Yan
162
0
0
27 Feb 2024
NeuroFlux: Memory-Efficient CNN Training Using Adaptive Local Learning
NeuroFlux: Memory-Efficient CNN Training Using Adaptive Local Learning
Dhananjay Saikumar
Blesson Varghese
277
2
0
21 Feb 2024
1234...8910
Next
Page 1 of 10