Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.04523
Cited By
Dual Dynamic Inference: Enabling More Efficient, Adaptive and Controllable Deep Inference
10 July 2019
Yue Wang
Jianghao Shen
Ting-Kuei Hu
Pengfei Xu
T. Nguyen
Richard Baraniuk
Zhangyang Wang
Yingyan Lin
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Dual Dynamic Inference: Enabling More Efficient, Adaptive and Controllable Deep Inference"
18 / 18 papers shown
Title
Two is Better than One: Efficient Ensemble Defense for Robust and Compact Models
Yoojin Jung
Byung Cheol Song
AAML
VLM
MQ
36
0
0
07 Apr 2025
Predicting Probabilities of Error to Combine Quantization and Early Exiting: QuEE
Florence Regol
Joud Chataoui
Bertrand Charpentier
Mark J. Coates
Pablo Piantanida
Stephan Gunnemann
45
0
0
20 Jun 2024
Model Adaptation for Time Constrained Embodied Control
Jaehyun Song
Minjong Yoo
Honguk Woo
42
0
0
17 Jun 2024
On the Impact of Black-box Deployment Strategies for Edge AI on Latency and Model Performance
Jaskirat Singh
Emad Fallahzadeh
Bram Adams
Ahmed E. Hassan
MQ
37
3
0
25 Mar 2024
TIPS: Topologically Important Path Sampling for Anytime Neural Networks
Guihong Li
Kartikeya Bhardwaj
Yuedong Yang
R. Marculescu
AAML
36
0
0
13 May 2023
Vision Transformer Computation and Resilience for Dynamic Inference
Kavya Sreedhar
Jason Clemons
Rangharajan Venkatesan
S. Keckler
M. Horowitz
24
2
0
06 Dec 2022
2-in-1 Accelerator: Enabling Random Precision Switch for Winning Both Adversarial Robustness and Efficiency
Yonggan Fu
Yang Katie Zhao
Qixuan Yu
Chaojian Li
Yingyan Lin
AAML
49
12
0
11 Sep 2021
IA-RED
2
^2
2
: Interpretability-Aware Redundancy Reduction for Vision Transformers
Bowen Pan
Rameswar Panda
Yi Ding
Zhangyang Wang
Rogerio Feris
A. Oliva
VLM
ViT
39
153
0
23 Jun 2021
Graceful Degradation and Related Fields
J. Dymond
31
4
0
21 Jun 2021
AppealNet: An Efficient and Highly-Accurate Edge/Cloud Collaborative Architecture for DNN Inference
Min Li
Yu Li
Ye Tian
Li Jiang
Qiang Xu
28
33
0
10 May 2021
InstantNet: Automated Generation and Deployment of Instantaneously Switchable-Precision Networks
Yonggan Fu
Zhongzhi Yu
Yongan Zhang
Yi Ding
Chaojian Li
Yongyuan Liang
Mingchao Jiang
Zhangyang Wang
Yingyan Lin
20
3
0
22 Apr 2021
HW-NAS-Bench:Hardware-Aware Neural Architecture Search Benchmark
Chaojian Li
Zhongzhi Yu
Yonggan Fu
Yongan Zhang
Yang Katie Zhao
Haoran You
Qixuan Yu
Yue Wang
Yingyan Lin
50
106
0
19 Mar 2021
Split Computing and Early Exiting for Deep Learning Applications: Survey and Research Challenges
Yoshitomo Matsubara
Marco Levorato
Francesco Restuccia
33
199
0
08 Mar 2021
FracTrain: Fractionally Squeezing Bit Savings Both Temporally and Spatially for Efficient DNN Training
Y. Fu
Haoran You
Yang Katie Zhao
Yue Wang
Chaojian Li
K. Gopalakrishnan
Zhangyang Wang
Yingyan Lin
MQ
35
32
0
24 Dec 2020
SmartExchange: Trading Higher-cost Memory Storage/Access for Lower-cost Computation
Yang Katie Zhao
Xiaohan Chen
Yue Wang
Chaojian Li
Haoran You
Y. Fu
Yuan Xie
Zhangyang Wang
Yingyan Lin
MQ
32
43
0
07 May 2020
TIMELY: Pushing Data Movements and Interfaces in PIM Accelerators Towards Local and in Time Domain
Weitao Li
Pengfei Xu
Yang Katie Zhao
Haitong Li
Yuan Xie
Yingyan Lin
9
68
0
03 May 2020
DNN-Chip Predictor: An Analytical Performance Predictor for DNN Accelerators with Various Dataflows and Hardware Architectures
Yang Katie Zhao
Chaojian Li
Yue Wang
Pengfei Xu
Yongan Zhang
Yingyan Lin
17
41
0
26 Feb 2020
AutoDNNchip: An Automated DNN Chip Predictor and Builder for Both FPGAs and ASICs
Pengfei Xu
Xiaofan Zhang
Cong Hao
Yang Katie Zhao
Yongan Zhang
Yue Wang
Chaojian Li
Zetong Guan
Deming Chen
Yingyan Lin
23
88
0
06 Jan 2020
1