ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1908.09791
  4. Cited By
Once-for-All: Train One Network and Specialize it for Efficient
  Deployment

Once-for-All: Train One Network and Specialize it for Efficient Deployment

26 August 2019
Han Cai
Chuang Gan
Tianzhe Wang
Zhekai Zhang
Song Han
    OOD
ArXivPDFHTML

Papers citing "Once-for-All: Train One Network and Specialize it for Efficient Deployment"

50 / 239 papers shown
Title
L-SWAG: Layer-Sample Wise Activation with Gradients information for Zero-Shot NAS on Vision Transformers
L-SWAG: Layer-Sample Wise Activation with Gradients information for Zero-Shot NAS on Vision Transformers
S. Casarin
Sergio Escalera
O. Lanz
32
0
0
12 May 2025
How to Train Your Metamorphic Deep Neural Network
How to Train Your Metamorphic Deep Neural Network
Thomas Sommariva
Simone Calderara
Angelo Porrello
26
0
0
07 May 2025
ORXE: Orchestrating Experts for Dynamically Configurable Efficiency
ORXE: Orchestrating Experts for Dynamically Configurable Efficiency
Qingyuan Wang
Guoxin Wang
B. Cardiff
Deepu John
38
0
0
07 May 2025
MedNNS: Supernet-based Medical Task-Adaptive Neural Network Search
MedNNS: Supernet-based Medical Task-Adaptive Neural Network Search
Lotfi Abdelkrim Mecharbat
Ibrahim Almakky
Martin Takac
Mohammad Yaqub
30
0
0
22 Apr 2025
Your Image Generator Is Your New Private Dataset
Your Image Generator Is Your New Private Dataset
Nicolo Resmini
Eugenio Lomurno
Cristian Sbrolli
Matteo Matteucci
26
0
0
06 Apr 2025
Benchmarking Ultra-Low-Power $μ$NPUs
Benchmarking Ultra-Low-Power μμμNPUs
Josh Millar
Yushan Huang
Sarab Sethi
Hamed Haddadi
Anil Madhavapeddy
BDL
56
0
0
28 Mar 2025
Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models
Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models
Xubin Wang
Zhiqing Tang
Jianxiong Guo
Tianhui Meng
Chenhao Wang
Tian-sheng Wang
Weijia Jia
50
0
0
08 Mar 2025
Balcony: A Lightweight Approach to Dynamic Inference of Generative Language Models
Benyamin Jamialahmadi
Parsa Kavehzadeh
Mehdi Rezagholizadeh
Parsa Farinneya
Hossein Rajabzadeh
A. Jafari
Boxing Chen
Marzieh S. Tahaei
42
0
0
06 Mar 2025
Forget the Data and Fine-Tuning! Just Fold the Network to Compress
Forget the Data and Fine-Tuning! Just Fold the Network to Compress
Dong Wang
Haris Šikić
Lothar Thiele
O. Saukh
56
0
0
17 Feb 2025
EfficientLLM: Scalable Pruning-Aware Pretraining for Architecture-Agnostic Edge Language Models
EfficientLLM: Scalable Pruning-Aware Pretraining for Architecture-Agnostic Edge Language Models
Xingrun Xing
Zheng Liu
Shitao Xiao
Boyan Gao
Yiming Liang
Wanpeng Zhang
Haokun Lin
Guoqi Li
Jiajun Zhang
LRM
61
1
0
10 Feb 2025
Merino: Entropy-driven Design for Generative Language Models on IoT Devices
Merino: Entropy-driven Design for Generative Language Models on IoT Devices
Youpeng Zhao
Ming Lin
Huadong Tang
Qiang Wu
Jun Wang
80
0
0
28 Jan 2025
Tailored-LLaMA: Optimizing Few-Shot Learning in Pruned LLaMA Models with Task-Specific Prompts
Tailored-LLaMA: Optimizing Few-Shot Learning in Pruned LLaMA Models with Task-Specific Prompts
Danyal Aftab
Steven Davy
ALM
49
0
0
10 Jan 2025
Mixture of Efficient Diffusion Experts Through Automatic Interval and
  Sub-Network Selection
Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network Selection
Alireza Ganjdanesh
Yan Kang
Yuchen Liu
Richard Y. Zhang
Zhe Lin
Heng Huang
DiffM
32
2
0
23 Sep 2024
HESSO: Towards Automatic Efficient and User Friendly Any Neural Network Training and Pruning
HESSO: Towards Automatic Efficient and User Friendly Any Neural Network Training and Pruning
Tianyi Chen
Xiaoyi Qu
David Aponte
Colby R. Banbury
Jongwoo Ko
Tianyu Ding
Yong Ma
Vladimir Lyapunov
Ilya Zharkov
Luming Liang
80
1
0
11 Sep 2024
NASH: Neural Architecture and Accelerator Search for
  Multiplication-Reduced Hybrid Models
NASH: Neural Architecture and Accelerator Search for Multiplication-Reduced Hybrid Models
Yang Xu
Huihong Shi
Zhongfeng Wang
37
0
0
07 Sep 2024
Combining Neural Architecture Search and Automatic Code Optimization: A
  Survey
Combining Neural Architecture Search and Automatic Code Optimization: A Survey
Inas Bachiri
Hadjer Benmeziane
Smail Niar
Riyadh Baghdadi
Hamza Ouarnoughi
Abdelkrime Aries
40
0
0
07 Aug 2024
Robust and Efficient Transfer Learning via Supernet Transfer in
  Warm-started Neural Architecture Search
Robust and Efficient Transfer Learning via Supernet Transfer in Warm-started Neural Architecture Search
Prabhant Singh
Joaquin Vanschoren
AAML
OOD
32
0
0
26 Jul 2024
DεpS: Delayed ε-Shrinking for Faster Once-For-All
  Training
DεpS: Delayed ε-Shrinking for Faster Once-For-All Training
Aditya Annavajjala
Alind Khare
Animesh Agrawal
Igor Fedorov
Hugo Latapie
Myungjin Lee
Alexey Tumanov
CLL
42
0
0
08 Jul 2024
Accelerate Intermittent Deep Inference
Accelerate Intermittent Deep Inference
Ziliang Zhang
20
0
0
01 Jul 2024
Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models
Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models
Alireza Ganjdanesh
Reza Shirkavand
Shangqian Gao
Heng Huang
DiffM
VLM
53
4
0
17 Jun 2024
AdaRevD: Adaptive Patch Exiting Reversible Decoder Pushes the Limit of
  Image Deblurring
AdaRevD: Adaptive Patch Exiting Reversible Decoder Pushes the Limit of Image Deblurring
Xintian Mao
Qingli Li
Yan Wang
45
14
0
13 Jun 2024
DAISY: Data Adaptive Self-Supervised Early Exit for Speech
  Representation Models
DAISY: Data Adaptive Self-Supervised Early Exit for Speech Representation Models
T. Lin
Hung-yi Lee
Hao Tang
40
1
0
08 Jun 2024
Rapid Deployment of DNNs for Edge Computing via Structured Pruning at
  Initialization
Rapid Deployment of DNNs for Edge Computing via Structured Pruning at Initialization
Bailey J. Eccles
Leon Wong
Blesson Varghese
33
2
0
22 Apr 2024
TG-NAS: Leveraging Zero-Cost Proxies with Transformer and Graph
  Convolution Networks for Efficient Neural Architecture Search
TG-NAS: Leveraging Zero-Cost Proxies with Transformer and Graph Convolution Networks for Efficient Neural Architecture Search
Ye Qiao
Haocheng Xu
Sitao Huang
19
0
0
30 Mar 2024
Tiny Models are the Computational Saver for Large Models
Tiny Models are the Computational Saver for Large Models
Qingyuan Wang
B. Cardiff
Antoine Frappé
Benoît Larras
Deepu John
36
2
0
26 Mar 2024
Anytime Neural Architecture Search on Tabular Data
Anytime Neural Architecture Search on Tabular Data
Naili Xing
Shaofeng Cai
Zhaojing Luo
Bengchin Ooi
Jian Pei
34
1
0
15 Mar 2024
FlatNAS: optimizing Flatness in Neural Architecture Search for
  Out-of-Distribution Robustness
FlatNAS: optimizing Flatness in Neural Architecture Search for Out-of-Distribution Robustness
Matteo Gambella
Fabrizio Pittorino
Manuel Roveri
OOD
40
3
0
29 Feb 2024
Multi-objective Differentiable Neural Architecture Search
Multi-objective Differentiable Neural Architecture Search
R. Sukthanker
Arber Zela
B. Staffler
Samuel Dooley
Josif Grabocka
Frank Hutter
40
1
0
28 Feb 2024
OmniPred: Language Models as Universal Regressors
OmniPred: Language Models as Universal Regressors
Xingyou Song
Oscar Li
Chansoo Lee
Bangding Yang
Daiyi Peng
Sagi Perel
Yutian Chen
53
14
0
22 Feb 2024
MatchNAS: Optimizing Edge AI in Sparse-Label Data Contexts via
  Automating Deep Neural Network Porting for Mobile Deployment
MatchNAS: Optimizing Edge AI in Sparse-Label Data Contexts via Automating Deep Neural Network Porting for Mobile Deployment
Hongtao Huang
Xiaojun Chang
Wen Hu
Lina Yao
30
0
0
21 Feb 2024
Body-Area Capacitive or Electric Field Sensing for Human Activity
  Recognition and Human-Computer Interaction: A Comprehensive Survey
Body-Area Capacitive or Electric Field Sensing for Human Activity Recognition and Human-Computer Interaction: A Comprehensive Survey
Sizhen Bian
Mengxi Liu
Bo Zhou
P. Lukowicz
Michele Magno
37
12
0
11 Jan 2024
Graft: Efficient Inference Serving for Hybrid Deep Learning with SLO
  Guarantees via DNN Re-alignment
Graft: Efficient Inference Serving for Hybrid Deep Learning with SLO Guarantees via DNN Re-alignment
Jing Wu
Lin Wang
Qirui Jin
Fangming Liu
25
11
0
17 Dec 2023
Accelerating Convolutional Neural Network Pruning via Spatial Aura
  Entropy
Accelerating Convolutional Neural Network Pruning via Spatial Aura Entropy
Bogdan Musat
Razvan Andonie
13
0
0
08 Dec 2023
Subnetwork-to-go: Elastic Neural Network with Dynamic Training and
  Customizable Inference
Subnetwork-to-go: Elastic Neural Network with Dynamic Training and Customizable Inference
Kai Li
Yi Luo
23
2
0
06 Dec 2023
OpenStereo: A Comprehensive Benchmark for Stereo Matching and Strong
  Baseline
OpenStereo: A Comprehensive Benchmark for Stereo Matching and Strong Baseline
Xianda Guo
Chenming Zhang
Juntao Lu
Yiqi Wang
Yiqun Duan
Tian Yang
Zheng Zhu
Long Chen
3DV
27
16
0
01 Dec 2023
Efficient Stitchable Task Adaptation
Efficient Stitchable Task Adaptation
Haoyu He
Zizheng Pan
Jing Liu
Jianfei Cai
Bohan Zhuang
26
3
0
29 Nov 2023
REDS: Resource-Efficient Deep Subnetworks for Dynamic Resource
  Constraints
REDS: Resource-Efficient Deep Subnetworks for Dynamic Resource Constraints
Francesco Corti
Balz Maag
Joachim Schauer
U. Pferschy
O. Saukh
29
2
0
22 Nov 2023
Masked Autoencoders Are Robust Neural Architecture Search Learners
Masked Autoencoders Are Robust Neural Architecture Search Learners
Yiming Hu
Xiangxiang Chu
Bo-Wen Zhang
OOD
37
0
0
20 Nov 2023
DistDNAS: Search Efficient Feature Interactions within 2 Hours
DistDNAS: Search Efficient Feature Interactions within 2 Hours
Tunhou Zhang
W. Wen
Igor Fedorov
Xi Liu
Buyun Zhang
...
Wen-Yen Chen
Yiping Han
Feng Yan
Hai Helen Li
Yiran Chen
13
1
0
01 Nov 2023
Learning Generalizable Program and Architecture Representations for
  Performance Modeling
Learning Generalizable Program and Architecture Representations for Performance Modeling
Lingda Li
T. Flynn
A. Hoisie
16
1
0
25 Oct 2023
DONNAv2 -- Lightweight Neural Architecture Search for Vision tasks
DONNAv2 -- Lightweight Neural Architecture Search for Vision tasks
Sweta Priyadarshi
Tianyu Jiang
Hsin-Pai Cheng
S. Rama Krishna
Viswanath Ganapathy
C. Patel
36
0
0
26 Sep 2023
Efficiency is Not Enough: A Critical Perspective of Environmentally Sustainable AI
Efficiency is Not Enough: A Critical Perspective of Environmentally Sustainable AI
Dustin Wright
Christian Igel
Gabrielle Samuel
Raghavendra Selvan
29
15
0
05 Sep 2023
InstaTune: Instantaneous Neural Architecture Search During Fine-Tuning
InstaTune: Instantaneous Neural Architecture Search During Fine-Tuning
S. N. Sridhar
Souvik Kundu
Sairam Sundaresan
Maciej Szankin
Anthony Sarah
17
3
0
29 Aug 2023
GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive
  Language-Image Pre-training
GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training
Xi Deng
Han Shi
Runhu Huang
Changlin Li
Hang Xu
Jianhua Han
James T. Kwok
Shen Zhao
Wei Zhang
Xiaodan Liang
CLIP
VLM
29
3
0
22 Aug 2023
Resource Constrained Model Compression via Minimax Optimization for
  Spiking Neural Networks
Resource Constrained Model Compression via Minimax Optimization for Spiking Neural Networks
Jue Chen
Huan Yuan
Jianchao Tan
Bin Chen
Chengru Song
Di Zhang
25
3
0
09 Aug 2023
Shrink-Perturb Improves Architecture Mixing during Population Based
  Training for Neural Architecture Search
Shrink-Perturb Improves Architecture Mixing during Population Based Training for Neural Architecture Search
A. Chebykin
A. Dushatskiy
T. Alderliesten
Peter A. N. Bosman
34
0
0
28 Jul 2023
LISSNAS: Locality-based Iterative Search Space Shrinkage for Neural
  Architecture Search
LISSNAS: Locality-based Iterative Search Space Shrinkage for Neural Architecture Search
Bhavna Gopal
Arjun Sridhar
Tunhou Zhang
Yiran Chen
16
3
0
06 Jul 2023
Combining Multi-Objective Bayesian Optimization with Reinforcement Learning for TinyML
Combining Multi-Objective Bayesian Optimization with Reinforcement Learning for TinyML
M. Deutel
G. Kontes
Christopher Mutschler
Jürgen Teich
43
0
0
23 May 2023
TinyissimoYOLO: A Quantized, Low-Memory Footprint, TinyML Object
  Detection Network for Low Power Microcontrollers
TinyissimoYOLO: A Quantized, Low-Memory Footprint, TinyML Object Detection Network for Low Power Microcontrollers
Julian Moosmann
Marco Giordano
Christian Vogt
Michele Magno
MQ
ObjD
13
20
0
22 May 2023
Boost Vision Transformer with GPU-Friendly Sparsity and Quantization
Boost Vision Transformer with GPU-Friendly Sparsity and Quantization
Chong Yu
Tao Chen
Zhongxue Gan
Jiayuan Fan
MQ
ViT
25
23
0
18 May 2023
12345
Next