Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.01686
Cited By
BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks
6 September 2017
Surat Teerapittayanon
Bradley McDanel
H. T. Kung
UQCV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks"
33 / 133 papers shown
Title
VA-RED
2
^2
2
: Video Adaptive Redundancy Reduction
Bowen Pan
Rameswar Panda
Camilo Luciano Fosco
Chung-Ching Lin
A. Andonian
Yue Meng
Kate Saenko
A. Oliva
Rogerio Feris
15
19
0
15 Feb 2021
It's always personal: Using Early Exits for Efficient On-Device CNN Personalisation
Ilias Leontiadis
Stefanos Laskaridis
Stylianos I. Venieris
Nicholas D. Lane
65
29
0
02 Feb 2021
NetCut: Real-Time DNN Inference Using Layer Removal
Mehrshad Zandigohar
Deniz Erdogmus
G. Schirner
15
5
0
13 Jan 2021
Bringing AI To Edge: From Deep Learning's Perspective
Di Liu
Hao Kong
Xiangzhong Luo
Weichen Liu
Ravi Subramaniam
52
116
0
25 Nov 2020
PAC Confidence Predictions for Deep Neural Network Classifiers
Sangdon Park
Shuo Li
Insup Lee
Osbert Bastani
UQCV
24
25
0
02 Nov 2020
Revisiting Batch Normalization for Training Low-latency Deep Spiking Neural Networks from Scratch
Youngeun Kim
Priyadarshini Panda
25
170
0
05 Oct 2020
SPINN: Synergistic Progressive Inference of Neural Networks over Device and Cloud
Stefanos Laskaridis
Stylianos I. Venieris
Mario Almeida
Ilias Leontiadis
Nicholas D. Lane
28
265
0
14 Aug 2020
HAPI: Hardware-Aware Progressive Inference
Stefanos Laskaridis
Stylianos I. Venieris
Hyeji Kim
Nicholas D. Lane
14
45
0
10 Aug 2020
DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference
Ji Xin
Raphael Tang
Jaejun Lee
Yaoliang Yu
Jimmy J. Lin
6
363
0
27 Apr 2020
Computation on Sparse Neural Networks: an Inspiration for Future Hardware
Fei Sun
Minghai Qin
Tianyun Zhang
Liu Liu
Yen-kuang Chen
Yuan Xie
29
7
0
24 Apr 2020
FastBERT: a Self-distilling BERT with Adaptive Inference Time
Weijie Liu
Peng Zhou
Zhe Zhao
Zhiruo Wang
Haotang Deng
Qi Ju
31
354
0
05 Apr 2020
Pre-trained Models for Natural Language Processing: A Survey
Xipeng Qiu
Tianxiang Sun
Yige Xu
Yunfan Shao
Ning Dai
Xuanjing Huang
LM&MA
VLM
243
1,450
0
18 Mar 2020
Resolution Adaptive Networks for Efficient Inference
Le Yang
Yizeng Han
Xi Chen
Shiji Song
Jifeng Dai
Gao Huang
16
215
0
16 Mar 2020
Communication-Efficient Edge AI: Algorithms and Systems
Yuanming Shi
Kai Yang
Tao Jiang
Jun Zhang
Khaled B. Letaief
GNN
17
326
0
22 Feb 2020
Deep regularization and direct training of the inner layers of Neural Networks with Kernel Flows
G. Yoo
H. Owhadi
22
21
0
19 Feb 2020
Project CLAI: Instrumenting the Command Line as a New Environment for AI Agents
Mayank Agarwal
Jorge J. Barroso
Tathagata Chakraborti
Eli M. Dow
Kshitij P. Fadnis
Borja Godoy
Madhavan Pallan
Kartik Talamadupula
14
8
0
31 Jan 2020
Adaptive Anomaly Detection for IoT Data in Hierarchical Edge Computing
Mao V. Ngo
H. Chaouchi
Tie-Mei Luo
Tony Q. S. Quek
19
16
0
10 Jan 2020
S2DNAS:Transforming Static CNN Model for Dynamic Inference via Neural Architecture Search
Zhihang Yuan
Bingzhe Wu
Zheng Liang
Shiwan Zhao
Weichen Bi
Guangyu Sun
25
30
0
16 Nov 2019
ALERT: Accurate Learning for Energy and Timeliness
Chengcheng Wan
M. Santriaji
E. Rogers
H. Hoffmann
Michael Maire
Shan Lu
AI4CE
32
40
0
31 Oct 2019
Depth-Adaptive Transformer
Maha Elbayad
Jiatao Gu
Edouard Grave
Michael Auli
19
188
0
22 Oct 2019
Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing
En Li
Liekang Zeng
Zhi Zhou
Xu Chen
4
614
0
04 Oct 2019
Machine Learning at the Network Edge: A Survey
M. G. Sarwar Murshed
Chris Murphy
Daqing Hou
Nazar Khan
Ganesh Ananthanarayanan
Faraz Hussain
30
378
0
31 Jul 2019
Edge Intelligence: Paving the Last Mile of Artificial Intelligence with Edge Computing
Zhi Zhou
Xu Chen
En Li
Liekang Zeng
Ke Luo
Junshan Zhang
19
1,418
0
24 May 2019
High Frequency Residual Learning for Multi-Scale Image Classification
Bowen Cheng
Rong Xiao
Jianfeng Wang
Thomas Huang
Lei Zhang
26
21
0
07 May 2019
Approximate LSTMs for Time-Constrained Inference: Enabling Fast Reaction in Self-Driving Cars
Alexandros Kouris
Stylianos I. Venieris
Michail Rizakis
C. Bouganis
AI4TS
11
12
0
02 May 2019
MBS: Macroblock Scaling for CNN Model Reduction
Yu-Hsun Lin
Chun-Nan Chou
Edward Y. Chang
MQ
16
4
0
18 Sep 2018
SECS: Efficient Deep Stream Processing via Class Skew Dichotomy
Boyuan Feng
Kun Wan
Shu Yang
Yufei Ding
25
4
0
07 Sep 2018
Edge Intelligence: On-Demand Deep Learning Model Co-Inference with Device-Edge Synergy
En Li
Zhi Zhou
Xu Chen
14
325
0
20 Jun 2018
The streaming rollout of deep networks - towards fully model-parallel execution
Volker Fischer
Jan M. Köhler
Thomas Pfeil
19
16
0
13 Jun 2018
Convolutional Networks with Adaptive Inference Graphs
Andreas Veit
Serge J. Belongie
OOD
GNN
19
382
0
30 Nov 2017
SkipNet: Learning Dynamic Routing in Convolutional Networks
Xin Wang
F. I. F. Richard Yu
Zi-Yi Dou
Trevor Darrell
Joseph E. Gonzalez
25
625
0
26 Nov 2017
Adaptive Feeding: Achieving Fast and Accurate Detections by Adaptively Combining Object Detectors
Hong-Yu Zhou
Bin-Bin Gao
Jianxin Wu
ObjD
40
28
0
20 Jul 2017
Hard-Aware Deeply Cascaded Embedding
Yuhui Yuan
Kuiyuan Yang
Chao Zhang
30
300
0
17 Nov 2016
Previous
1
2
3