Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.03443
Cited By
FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search
9 December 2018
Bichen Wu
Xiaoliang Dai
Peizhao Zhang
Yanghan Wang
Fei Sun
Yiming Wu
Yuandong Tian
Peter Vajda
Yangqing Jia
Kurt Keutzer
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search"
50 / 233 papers shown
Title
Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models
Xubin Wang
Zhiqing Tang
Jianxiong Guo
Tianhui Meng
Chenhao Wang
Tian-sheng Wang
Weijia Jia
50
0
0
08 Mar 2025
EfficientLLM: Scalable Pruning-Aware Pretraining for Architecture-Agnostic Edge Language Models
Xingrun Xing
Zheng Liu
Shitao Xiao
Boyan Gao
Yiming Liang
Wanpeng Zhang
Haokun Lin
Guoqi Li
Jiajun Zhang
LRM
61
1
0
10 Feb 2025
iFormer: Integrating ConvNet and Transformer for Mobile Application
Chuanyang Zheng
ViT
72
0
0
26 Jan 2025
Improving Accuracy and Generalization for Efficient Visual Tracking
Ram J. Zaveri
Shivang Patel
Yu Gu
Gianfranco Doretto
VLM
86
0
0
28 Nov 2024
NASH: Neural Architecture and Accelerator Search for Multiplication-Reduced Hybrid Models
Yang Xu
Huihong Shi
Zhongfeng Wang
37
0
0
07 Sep 2024
Combining Neural Architecture Search and Automatic Code Optimization: A Survey
Inas Bachiri
Hadjer Benmeziane
Smail Niar
Riyadh Baghdadi
Hamza Ouarnoughi
Abdelkrime Aries
40
0
0
07 Aug 2024
DεpS: Delayed ε-Shrinking for Faster Once-For-All Training
Aditya Annavajjala
Alind Khare
Animesh Agrawal
Igor Fedorov
Hugo Latapie
Myungjin Lee
Alexey Tumanov
CLL
37
0
0
08 Jul 2024
P
2
^2
2
-ViT: Power-of-Two Post-Training Quantization and Acceleration for Fully Quantized Vision Transformer
Huihong Shi
Xin Cheng
Wendong Mao
Zhongfeng Wang
MQ
40
3
0
30 May 2024
Efficient Modulation for Vision Networks
Xu Ma
Xiyang Dai
Jianwei Yang
Bin Xiao
Yinpeng Chen
Yun Fu
Lu Yuan
40
17
0
29 Mar 2024
Multi-objective Differentiable Neural Architecture Search
R. Sukthanker
Arber Zela
B. Staffler
Samuel Dooley
Josif Grabocka
Frank Hutter
40
1
0
28 Feb 2024
Masked Autoencoders Are Robust Neural Architecture Search Learners
Yiming Hu
Xiangxiang Chu
Bo-Wen Zhang
OOD
37
0
0
20 Nov 2023
TinyFormer: Efficient Transformer Design and Deployment on Tiny Devices
Jianlei Yang
Jiacheng Liao
Fanding Lei
Meichen Liu
Junyi Chen
Lingkun Long
Han Wan
Bei Yu
Weisheng Zhao
MoE
33
2
0
03 Nov 2023
DONNAv2 -- Lightweight Neural Architecture Search for Vision tasks
Sweta Priyadarshi
Tianyu Jiang
Hsin-Pai Cheng
S. Rama Krishna
Viswanath Ganapathy
C. Patel
36
0
0
26 Sep 2023
Distributionally Robust Classification on a Data Budget
Ben Feuer
Ameya Joshi
Minh Pham
C. Hegde
OOD
32
2
0
07 Aug 2023
LISSNAS: Locality-based Iterative Search Space Shrinkage for Neural Architecture Search
Bhavna Gopal
Arjun Sridhar
Tunhou Zhang
Yiran Chen
16
3
0
06 Jul 2023
Robustifying DARTS by Eliminating Information Bypass Leakage via Explicit Sparse Regularization
Jiuling Zhang
Zhiming Ding
AAML
18
3
0
12 Jun 2023
Performance-optimized deep neural networks are evolving into worse models of inferotemporal visual cortex
Drew Linsley
I. F. Rodriguez
Thomas Fel
Michael Arcaro
Saloni Sharma
Margaret Livingstone
Thomas Serre
35
18
0
06 Jun 2023
COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models
Jinqi Xiao
Miao Yin
Yu Gong
Xiao Zang
Jian Ren
Bo Yuan
VLM
ViT
30
9
0
26 May 2023
Auto-CARD: Efficient and Robust Codec Avatar Driving for Real-time Mobile Telepresence
Y. Fu
Yuecheng Li
Chenghui Li
Jason M. Saragih
Peizhao Zhang
Xiaoliang Dai
Yingyan Lin
3DH
37
2
0
24 Apr 2023
ALiSNet: Accurate and Lightweight Human Segmentation Network for Fashion E-Commerce
Amrollah Seifoddini
K. Vernooij
Timon Künzle
A. Canopoli
Malte F. Alf
Anna Volokitin
Reza Shirvany
3DH
23
0
0
15 Apr 2023
ERSAM: Neural Architecture Search For Energy-Efficient and Real-Time Social Ambiance Measurement
Chaojian Li
Wenwan Chen
Jiayi Yuan
Yingyan Lin
Ashutosh Sabharwal
23
0
0
19 Mar 2023
Local-to-Global Information Communication for Real-Time Semantic Segmentation Network Search
Guangliang Cheng
Peng Sun
Ting-Bing Xu
Shuchang Lyu
Peiwen Lin
18
1
0
16 Feb 2023
The Framework Tax: Disparities Between Inference Efficiency in NLP Research and Deployment
Jared Fernandez
Jacob Kahn
Clara Na
Yonatan Bisk
Emma Strubell
FedML
25
10
0
13 Feb 2023
Oscillation-free Quantization for Low-bit Vision Transformers
Shi Liu
Zechun Liu
Kwang-Ting Cheng
MQ
13
34
0
04 Feb 2023
Enhancing Once-For-All: A Study on Parallel Blocks, Skip Connections and Early Exits
Simone Sarti
Eugenio Lomurno
Andrea Falanti
Matteo Matteucci
21
3
0
03 Feb 2023
ZiCo: Zero-shot NAS via Inverse Coefficient of Variation on Gradients
Guihong Li
Yuedong Yang
Kartikeya Bhardwaj
R. Marculescu
34
60
0
26 Jan 2023
Rewarded meta-pruning: Meta Learning with Rewards for Channel Pruning
Athul Shibu
Abhishek Kumar
Heechul Jung
Dong-Gyu Lee
9
1
0
26 Jan 2023
Out of Distribution Performance of State of Art Vision Model
Salman Rahman
W. Lee
32
2
0
25 Jan 2023
HALOC: Hardware-Aware Automatic Low-Rank Compression for Compact Neural Networks
Jinqi Xiao
Chengming Zhang
Yu Gong
Miao Yin
Yang Sui
Lizhi Xiang
Dingwen Tao
Bo Yuan
16
19
0
20 Jan 2023
Pruning Compact ConvNets for Efficient Inference
Sayan Ghosh
Karthik Prasad
Xiaoliang Dai
Peizhao Zhang
Bichen Wu
Graham Cormode
Peter Vajda
VLM
19
4
0
11 Jan 2023
OVO: One-shot Vision Transformer Search with Online distillation
Zimian Wei
H. Pan
Xin-Yi Niu
Dongsheng Li
ViT
29
1
0
28 Dec 2022
A Study on the Intersection of GPU Utilization and CNN Inference
J. Kosaian
Amar Phanishayee
13
3
0
15 Dec 2022
HEAT: Hardware-Efficient Automatic Tensor Decomposition for Transformer Compression
Jiaqi Gu
Ben Keller
Jean Kossaifi
Anima Anandkumar
Brucek Khailany
D. Pan
ViT
30
8
0
30 Nov 2022
GhostNetV2: Enhance Cheap Operation with Long-Range Attention
Yehui Tang
Kai Han
Jianyuan Guo
Chang Xu
Chaoting Xu
Yunhe Wang
18
270
0
23 Nov 2022
RepGhost: A Hardware-Efficient Ghost Module via Re-parameterization
Chengpeng Chen
Zichao Guo
Haien Zeng
Pengfei Xiong
Jian Dong
26
37
0
11 Nov 2022
NEON: Enabling Efficient Support for Nonlinear Operations in Resistive RAM-based Neural Network Accelerators
Aditya Manglik
Minesh Patel
Haiyu Mao
Behzad Salami
Jisung Park
Lois Orosa
O. Mutlu
15
1
0
10 Nov 2022
Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: Report
Andrey D. Ignatov
Radu Timofte
Maurizio Denna
Abdelbadie Younes
Ganzorig Gankhuyag
...
Jing Liu
Garas Gendy
Nabil Sabor
J. Hou
Guanghui He
SupR
MQ
20
31
0
07 Nov 2022
Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI & AIM 2022 Challenge: Report
Andrey D. Ignatov
Grigory Malivenko
Radu Timofte
Lukasz Treszczotko
Xin-ke Chang
...
Dongwon Park
Seongmin Hong
Joonhee Lee
Seunggyu Lee
Sengsub Chun
21
17
0
07 Nov 2022
Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report
Andrey D. Ignatov
Radu Timofte
Shuai Liu
Chaoyu Feng
Furui Bai
...
Xin Lou
Wei Zhou
Cong Pang
Haina Qin
Mingxuan Cai
19
23
0
07 Nov 2022
Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models
Muyang Li
Ji Lin
Chenlin Meng
Stefano Ermon
Song Han
Jun-Yan Zhu
DiffM
34
45
0
03 Nov 2022
Adaptive Mask-based Pyramid Network for Realistic Bokeh Rendering
K. Georgiadis
Albert Saà-Garriga
M. K. Yucel
Anastasios Drosou
Bruno Manganelli
3DH
32
7
0
28 Oct 2022
NASA: Neural Architecture Search and Acceleration for Hardware Inspired Hybrid Networks
Huihong Shi
Haoran You
Yang Katie Zhao
Zhongfeng Wang
Yingyan Lin
56
7
0
24 Oct 2022
Deep Model Reassembly
Xingyi Yang
Zhou Daquan
Songhua Liu
Jingwen Ye
Xinchao Wang
MoMe
20
120
0
24 Oct 2022
Pareto-aware Neural Architecture Generation for Diverse Computational Budgets
Yong Guo
Yaofo Chen
Yin Zheng
Qi Chen
P. Zhao
Jian Chen
Junzhou Huang
Mingkui Tan
26
5
0
14 Oct 2022
Latency-aware Spatial-wise Dynamic Networks
Yizeng Han
Zhihang Yuan
Yifan Pu
Chenhao Xue
S. Song
Guangyu Sun
Gao Huang
39
25
0
12 Oct 2022
Toward Edge-Efficient Dense Predictions with Synergistic Multi-Task Neural Architecture Search
Thanh Vu
Yan-Quan Zhou
Chun-Yung Wen
Yueqi Li
Jan-Michael Frahm
32
4
0
04 Oct 2022
DFA: Dynamic Feature Aggregation for Efficient Video Object Detection
Yiming Cui
37
8
0
02 Oct 2022
Searching a High-Performance Feature Extractor for Text Recognition Network
Hui Zhang
Quanming Yao
James T. Kwok
X. Bai
28
7
0
27 Sep 2022
Tiered Pruning for Efficient Differentialble Inference-Aware Neural Architecture Search
Slawomir Kierat
Mateusz Sieniawski
Denys Fridman
Chendi Yu
Szymon Migacz
Pawel M. Morkisz
A. Fit-Florea
3DPC
19
0
0
23 Sep 2022
PolyMPCNet: Towards ReLU-free Neural Architecture Search in Two-party Computation Based Private Inference
Hongwu Peng
Shangli Zhou
Yukui Luo
Shijin Duan
Nuo Xu
...
Tong Geng
Ang Li
Wujie Wen
Xiaolin Xu
Caiwen Ding
21
3
0
20 Sep 2022
1
2
3
4
5
Next