Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1908.09791
Cited By
v1
v2
v3
v4
v5 (latest)
Once-for-All: Train One Network and Specialize it for Efficient Deployment
International Conference on Learning Representations (ICLR), 2019
26 August 2019
Han Cai
Chuang Gan
Tianzhe Wang
Zhekai Zhang
Song Han
OOD
Re-assign community
ArXiv (abs)
PDF
HTML
Github (1916★)
Papers citing
"Once-for-All: Train One Network and Specialize it for Efficient Deployment"
50 / 762 papers shown
Hardware-Algorithm Co-Optimization of Early-Exit Neural Networks for Multi-Core Edge Accelerators
Alaa Zniber
Arne Symons
Ouassim Karrakchou
Marian Verhelst
Mounir Ghogho
182
0
0
04 Dec 2025
Network of Theseus (like the ship)
Vighnesh Subramaniam
C. Conwell
Boris Katz
Andrei Barbu
Brian Cheung
150
0
0
03 Dec 2025
hls4ml: A Flexible, Open-Source Platform for Deep Learning Acceleration on Reconfigurable Hardware
Jan-Frederik Schulte
Benjamin Ramhorst
Chang Sun
Jovan Mitrevski
Nicolò Ghielmetti
...
C. Herwig
Ho Fung Tsoi
D. Rankin
Shih-Chieh Hsu
Scott Hauck
VLM
150
2
0
01 Dec 2025
AutoTailor: Automatic and Efficient Adaptive Model Deployment for Diverse Edge Devices
M. Liu
Chenyu Lu
H. Tian
Fang Dong
Ruiting Zhou
Wei Wang
Dian Shen
Guangtong Li
Ye Wan
Li Li
111
0
0
27 Nov 2025
CrypTorch: PyTorch-based Auto-tuning Compiler for Machine Learning with Multi-party Computation
Jinyu Liu
Gang Tan
Kiwan Maeng
130
0
0
24 Nov 2025
AdaPerceiver: Transformers with Adaptive Width, Depth, and Tokens
Purvish Jajal
Nick Eliopoulos
Benjamin Shiue-Hal Chou
George K. Thiruvathukal
Yung-Hsiang Lu
James C. Davis
173
0
0
22 Nov 2025
Stratified Knowledge-Density Super-Network for Scalable Vision Transformers
Longhua Li
Lei Qi
Xin Geng
ViT
155
1
0
12 Nov 2025
CompressNAS : A Fast and Efficient Technique for Model Compression using Decomposition
Sudhakar Sah
Nikhil Chabbra
Matthieu Durnerin
143
0
0
12 Nov 2025
Beyond One-Way Pruning: Bidirectional Pruning-Regrowth for Extreme Accuracy-Sparsity Tradeoff
Junchen Liu
Yi Sheng
109
0
0
11 Nov 2025
Slimmable NAM: Neural Amp Models with adjustable runtime computational cost
Steven Atkinson
84
0
0
08 Nov 2025
Hybrid Convolution and Vision Transformer NAS Search Space for TinyML Image Classification
Mikhael Djajapermana
Moritz Reiber
Daniel Mueller-Gritschneder
Ulf Schlichtmann
ViT
140
0
0
04 Nov 2025
From Local to Global: Revisiting Structured Pruning Paradigms for Large Language Models
Ziyan Wang
Enmao Diao
Qi Le
Pu Wang
Minwoo Lee
Shu-ping Yeh
Evgeny Stupachenko
Hao Feng
Li Yang
172
2
0
20 Oct 2025
Elastic ViTs from Pretrained Models without Retraining
Walter Simoncini
Michael Dorkenwald
Tijmen Blankevoort
Cees G. M. Snoek
Yuki Markus Asano
VLM
189
0
0
20 Oct 2025
Spiking Neural Network Architecture Search: A Survey
Kama Svoboda
Tosiron Adegbija
228
0
0
16 Oct 2025
Aixel: A Unified, Adaptive and Extensible System for AI-powered Data Analysis
Meihui Zhang
Liming Wang
C. Zhang
Zhaojing Luo
137
1
0
14 Oct 2025
Optimally Deep Networks - Adapting Model Depth to Datasets for Superior Efficiency
Shaharyar Ahmed Khan Tareen
Filza Khan Tareen
AI4CE
338
0
0
12 Oct 2025
Slim Scheduler: A Runtime-Aware RL and Scheduler System for Efficient CNN Inference
Ian Harshbarger
Calvin Chidambaram
149
0
0
10 Oct 2025
PlatformX: An End-to-End Transferable Platform for Energy-Efficient Neural Architecture Search
Xiaolong Tu
Dawei Chen
Kyungtae Han
Onur Altintas
Haoxin Wang
180
0
0
10 Oct 2025
Where to Begin: Efficient Pretraining via Subnetwork Selection and Distillation
Arjun Krishnakumar
R. Sukthanker
Hannan Javed Mahadik
Gabriela Kadlecová
Vladyslav Moroshan
Timur Carstensen
Frank Hutter
Aaron Klein
185
0
0
08 Oct 2025
LLM-NAS: LLM-driven Hardware-Aware Neural Architecture Search
Hengyi Zhu
Grace Li Zhang
Shaoyi Huang
447
0
0
01 Oct 2025
CIMNAS: A Joint Framework for Compute-In-Memory-Aware Neural Architecture Search
Olga Krestinskaya
M. Fouda
Ahmed M. Eltawil
K. Salama
143
1
0
30 Sep 2025
Regression Language Models for Code
Yash Akhauri
Xingyou Song
Arissa Wongpanich
Bryan Lewandowski
Mohamed S. Abdelfattah
250
5
0
30 Sep 2025
CoLLM-NAS: Collaborative Large Language Models for Efficient Knowledge-Guided Neural Architecture Search
Zhe Li
Zhiwei Lin
Yongtao Wang
243
1
0
30 Sep 2025
RAM-NAS: Resource-aware Multiobjective Neural Architecture Search Method for Robot Vision Tasks
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Shouren Mao
Minghao Qin
W. Dong
Huajian Liu
Yongzhuo Gao
162
1
0
25 Sep 2025
Deep Hierarchical Learning with Nested Subspace Networks for Large Language Models
Paulius Rauba
M. Schaar
199
2
0
22 Sep 2025
SOLAR: Switchable Output Layer for Accuracy and Robustness in Once-for-All Training
Shaharyar Ahmed Khan Tareen
Lei Fan
Xiaojing Yuan
Qin Lin
Bin Hu
148
0
0
20 Sep 2025
RMT-KD: Random Matrix Theoretic Causal Knowledge Distillation
Davide Ettori
Nastaran Darabi
Sureshkumar Senthilkumar
A. R. Trivedi
292
2
0
19 Sep 2025
SAR-NAS: Lightweight SAR Object Detection with Neural Architecture Search
Xinyi Yu
Zhiwei Lin
Yongtao Wang
132
0
0
01 Sep 2025
CoFormer: Collaborating with Heterogeneous Edge Devices for Scalable Transformer Inference
IEEE transactions on computers (IEEE Trans. Comput.), 2025
Guanyu Xu
Zhiwei Hao
Li Shen
Yong Luo
Fuhui Sun
Xiaoyan Wang
Han Hu
Yonggang Wen
200
2
0
28 Aug 2025
Towards 6G Intelligence: The Role of Generative AI in Future Wireless Networks
Muhammad Ahmed Mohsin
Junaid Ahmad
Muhammad Hamza Nawaz
Muhammad Ali Jamshed
208
0
0
27 Aug 2025
Jet-Nemotron: Efficient Language Model with Post Neural Architecture Search
Yuxian Gu
Qinghao Hu
Shang Yang
Haocheng Xi
Junyu Chen
Song Han
Han Cai
314
19
0
21 Aug 2025
Formal Algorithms for Model Efficiency
Naman Tyagi
Srishti Das
Kunal
Vatsal Gupta
183
0
0
19 Aug 2025
SNAP-UQ: Self-supervised Next-Activation Prediction for Single-Pass Uncertainty in TinyML
Ismail Lamaakal
Chaymae Yahyati
Khalid El Makkaoui
Ibrahim Ouahbi
Yassine Maleh
UQCV
MQ
247
2
0
18 Aug 2025
Dextr: Zero-Shot Neural Architecture Search with Singular Value Decomposition and Extrinsic Curvature
Rohan Asthana
Joschua Conrad
M. Ortmanns
Vasileios Belagiannis
175
1
0
18 Aug 2025
Designing Object Detection Models for TinyML: Foundations, Comparative Analysis, Challenges, and Emerging Solutions
ACM Computing Surveys (ACM Comput. Surv.), 2025
Christophe El Zeinaty
W. Hamidouche
Glenn Herrou
D. Ménard
ObjD
200
0
0
11 Aug 2025
Slice or the Whole Pie? Utility Control for AI Models
Ye Tao
AAML
134
0
0
06 Aug 2025
ESM: A Framework for Building Effective Surrogate Models for Hardware-Aware Neural Architecture Search
Design Automation Conference (DAC), 2025
Azaz-Ur-Rehman Nasir
Samroz Ahmad Shoaib
Muhammad Abdullah Hanif
Muhammad Shafique
203
0
0
02 Aug 2025
Coflex: Enhancing HW-NAS with Sparse Gaussian Processes for Efficient and Scalable DNN Accelerator Design
Yinhui Ma
Tomomasa Yamasaki
Zhehui Wang
Tao Luo
Bo Wang
307
0
0
31 Jul 2025
Sustainable AI Training via Hardware-Software Co-Design on NVIDIA, AMD, and Emerging GPU Architectures
International Symposium on Service Oriented Software Engineering (ISSOSE), 2025
Yashasvi Makin
Rahul Maliakkal
180
1
0
28 Jul 2025
EA-ViT: Efficient Adaptation for Elastic Vision Transformer
Chen Zhu
Wangbo Zhao
Huiwen Zhang
Samir Khaki
Yuhao Zhou
...
Zhihang Yuan
Yuzhang Shang
Xiaojiang Peng
Kai Wang
Dawei Yang
229
4
0
25 Jul 2025
ACME: Adaptive Customization of Large Models via Distributed Systems
IEEE International Conference on Distributed Computing Systems (ICDCS), 2025
Ziming Dai
Chao Qiu
Fei Gao
Yunfeng Zhao
Xiaofei Wang
339
1
0
20 Jul 2025
ThinkingViT: Matryoshka Thinking Vision Transformer for Elastic Inference
A. Hojjat
Janek Haberer
Soren Pirk
Olaf Landsiedel
LRM
267
3
0
14 Jul 2025
Zero-Shot Neural Architecture Search with Weighted Response Correlation
Kun Jing
Luoyu Chen
Jungang Xu
Jianwei Tai
Yiyu Wang
Shuaimin Li
235
3
0
08 Jul 2025
DANCE: Resource-Efficient Neural Architecture Search with Data-Aware and Continuous Adaptation
Xinjian Zhao
Tianshuo Wei
Sheng Zhang
Ruocheng Guo
Wanyu Wang
Shanshan Ye
Lixin Zou
Xuetao Wei
Xiangyu Zhao
TTA
335
3
0
07 Jul 2025
XTransfer: Modality-Agnostic Few-Shot Model Transfer for Human Sensing at the Edge
Yu Zhang
Xi Zhang
Hualin zhou
Xinyuan Chen
Shang Gao
Hong Jia
Jianfei Yang
Yuankai Qi
Tao Gu
237
0
0
28 Jun 2025
ProARD: progressive adversarial robustness distillation: provide wide range of robust students
Seyedhamidreza Mousavi
Seyedali Mousavi
Masoud Daneshtalab
AAML
315
0
0
09 Jun 2025
Loss Functions for Predictor-based Neural Architecture Search
Han Ji
Yuqi Feng
Jiahao Fan
Yanan Sun
207
0
0
06 Jun 2025
EfficientQuant: An Efficient Post-Training Quantization for CNN-Transformer Hybrid Models on Edge Devices
Shaibal Saha
Lanyu Xu
MQ
268
1
0
05 Jun 2025
FlexGS: Train Once, Deploy Everywhere with Many-in-One Flexible 3D Gaussian Splatting
Computer Vision and Pattern Recognition (CVPR), 2025
Hengyu Liu
Yuehao Wang
Chenxin Li
Ruisi Cai
Kevin Wang
Wuyang Li
Pavlo Molchanov
Peihao Wang
Zinan Lin
3DGS
335
4
0
04 Jun 2025
CARL: Causality-guided Architecture Representation Learning for an Interpretable Performance Predictor
Han Ji
Yuqi Feng
Jiahao Fan
Yanan Sun
OOD
CML
263
0
0
04 Jun 2025
1
2
3
4
...
14
15
16
Next
Page 1 of 16
Page
of 16
Go