v1v2v3v4v5 (latest)

Once-for-All: Train One Network and Specialize it for Efficient Deployment

International Conference on Learning Representations (ICLR), 2019

26 August 2019

Chuang Gan

Song Han

ArXiv (abs)PDF HTML Github (1916★)

Papers citing "Once-for-All: Train One Network and Specialize it for Efficient Deployment"

50 / 762 papers shown

Hardware-Algorithm Co-Optimization of Early-Exit Neural Networks for Multi-Core Edge Accelerators

182

04 Dec 2025

Network of Theseus (like the ship)

150

03 Dec 2025

hls4ml: A Flexible, Open-Source Platform for Deep Learning Acceleration on Reconfigurable Hardware

...

150

01 Dec 2025

AutoTailor: Automatic and Efficient Adaptive Model Deployment for Diverse Edge Devices

111

27 Nov 2025

CrypTorch: PyTorch-based Auto-tuning Compiler for Machine Learning with Multi-party Computation

Jinyu Liu

Gang Tan

Kiwan Maeng

130

24 Nov 2025

AdaPerceiver: Transformers with Adaptive Width, Depth, and Tokens

Purvish Jajal

Nick Eliopoulos

Benjamin Shiue-Hal Chou

George K. Thiruvathukal

Yung-Hsiang Lu

James C. Davis

173

22 Nov 2025

Stratified Knowledge-Density Super-Network for Scalable Vision Transformers

155

12 Nov 2025

CompressNAS : A Fast and Efficient Technique for Model Compression using Decomposition

Sudhakar Sah

Nikhil Chabbra

Matthieu Durnerin

143

12 Nov 2025

Beyond One-Way Pruning: Bidirectional Pruning-Regrowth for Extreme Accuracy-Sparsity Tradeoff

Junchen Liu

Yi Sheng

109

11 Nov 2025

Slimmable NAM: Neural Amp Models with adjustable runtime computational cost

Steven Atkinson

08 Nov 2025

Hybrid Convolution and Vision Transformer NAS Search Space for TinyML Image Classification

Mikhael Djajapermana

Moritz Reiber

Daniel Mueller-Gritschneder

Ulf Schlichtmann

ViT

140

04 Nov 2025

From Local to Global: Revisiting Structured Pruning Paradigms for Large Language Models

172

20 Oct 2025

Elastic ViTs from Pretrained Models without Retraining

189

20 Oct 2025

Spiking Neural Network Architecture Search: A Survey

Kama Svoboda

Tosiron Adegbija

228

16 Oct 2025

Aixel: A Unified, Adaptive and Extensible System for AI-powered Data Analysis

137

14 Oct 2025

Optimally Deep Networks - Adapting Model Depth to Datasets for Superior Efficiency

Shaharyar Ahmed Khan Tareen

Filza Khan Tareen

AI4CE

338

12 Oct 2025

Slim Scheduler: A Runtime-Aware RL and Scheduler System for Efficient CNN Inference

Ian Harshbarger

Calvin Chidambaram

149

10 Oct 2025

PlatformX: An End-to-End Transferable Platform for Energy-Efficient Neural Architecture Search

180

10 Oct 2025

Where to Begin: Efficient Pretraining via Subnetwork Selection and Distillation

185

08 Oct 2025

LLM-NAS: LLM-driven Hardware-Aware Neural Architecture Search

Hengyi Zhu

Grace Li Zhang

Shaoyi Huang

447

01 Oct 2025

CIMNAS: A Joint Framework for Compute-In-Memory-Aware Neural Architecture Search

143

30 Sep 2025

Regression Language Models for Code

Mohamed S. Abdelfattah

250

30 Sep 2025

CoLLM-NAS: Collaborative Large Language Models for Efficient Knowledge-Guided Neural Architecture Search

Zhe Li

Zhiwei Lin

Yongtao Wang

243

30 Sep 2025

RAM-NAS: Resource-aware Multiobjective Neural Architecture Search Method for Robot Vision TasksIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024

162

25 Sep 2025

Deep Hierarchical Learning with Nested Subspace Networks for Large Language Models

Paulius Rauba

M. Schaar

199

22 Sep 2025

SOLAR: Switchable Output Layer for Accuracy and Robustness in Once-for-All Training

Shaharyar Ahmed Khan Tareen

148

20 Sep 2025

RMT-KD: Random Matrix Theoretic Causal Knowledge Distillation

Davide Ettori

Nastaran Darabi

Sureshkumar Senthilkumar

A. R. Trivedi

292

19 Sep 2025

SAR-NAS: Lightweight SAR Object Detection with Neural Architecture Search

Xinyi Yu

Zhiwei Lin

Yongtao Wang

132

01 Sep 2025

CoFormer: Collaborating with Heterogeneous Edge Devices for Scalable Transformer InferenceIEEE transactions on computers (IEEE Trans. Comput.), 2025

200

28 Aug 2025

Towards 6G Intelligence: The Role of Generative AI in Future Wireless Networks

Muhammad Ahmed Mohsin

Junaid Ahmad

Muhammad Hamza Nawaz

Muhammad Ali Jamshed

208

27 Aug 2025

Jet-Nemotron: Efficient Language Model with Post Neural Architecture Search

314

21 Aug 2025

Formal Algorithms for Model Efficiency

183

19 Aug 2025

SNAP-UQ: Self-supervised Next-Activation Prediction for Single-Pass Uncertainty in TinyML

247

18 Aug 2025

Dextr: Zero-Shot Neural Architecture Search with Singular Value Decomposition and Extrinsic Curvature

Rohan Asthana

Joschua Conrad

M. Ortmanns

Vasileios Belagiannis

175

18 Aug 2025

Designing Object Detection Models for TinyML: Foundations, Comparative Analysis, Challenges, and Emerging SolutionsACM Computing Surveys (ACM Comput. Surv.), 2025

Christophe El Zeinaty

200

11 Aug 2025

Slice or the Whole Pie? Utility Control for AI Models

Ye Tao

AAML

134

06 Aug 2025

ESM: A Framework for Building Effective Surrogate Models for Hardware-Aware Neural Architecture SearchDesign Automation Conference (DAC), 2025

Azaz-Ur-Rehman Nasir

Samroz Ahmad Shoaib

Muhammad Abdullah Hanif

Muhammad Shafique

203

02 Aug 2025

Coflex: Enhancing HW-NAS with Sparse Gaussian Processes for Efficient and Scalable DNN Accelerator Design

307

31 Jul 2025

Sustainable AI Training via Hardware-Software Co-Design on NVIDIA, AMD, and Emerging GPU ArchitecturesInternational Symposium on Service Oriented Software Engineering (ISSOSE), 2025

Yashasvi Makin

Rahul Maliakkal

180

28 Jul 2025

EA-ViT: Efficient Adaptation for Elastic Vision Transformer

...

229

25 Jul 2025

ACME: Adaptive Customization of Large Models via Distributed SystemsIEEE International Conference on Distributed Computing Systems (ICDCS), 2025

339

20 Jul 2025

ThinkingViT: Matryoshka Thinking Vision Transformer for Elastic Inference

267

14 Jul 2025

Zero-Shot Neural Architecture Search with Weighted Response Correlation

235

08 Jul 2025

DANCE: Resource-Efficient Neural Architecture Search with Data-Aware and Continuous Adaptation

335

07 Jul 2025

XTransfer: Modality-Agnostic Few-Shot Model Transfer for Human Sensing at the Edge

237

28 Jun 2025

ProARD: progressive adversarial robustness distillation: provide wide range of robust students

Seyedhamidreza Mousavi

Seyedali Mousavi

Masoud Daneshtalab

AAML

315

09 Jun 2025

Loss Functions for Predictor-based Neural Architecture Search

207

06 Jun 2025

EfficientQuant: An Efficient Post-Training Quantization for CNN-Transformer Hybrid Models on Edge Devices

Shaibal Saha

Lanyu Xu

268

05 Jun 2025

FlexGS: Train Once, Deploy Everywhere with Many-in-One Flexible 3D Gaussian SplattingComputer Vision and Pattern Recognition (CVPR), 2025

335

04 Jun 2025

CARL: Causality-guided Architecture Representation Learning for an Interpretable Performance Predictor

263

04 Jun 2025