Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.10090
Cited By
NestDNN: Resource-Aware Multi-Tenant On-Device Deep Learning for Continuous Mobile Vision
23 October 2018
Biyi Fang
Xiao Zeng
Mi Zhang
3DH
Re-assign community
ArXiv
PDF
HTML
Papers citing
"NestDNN: Resource-Aware Multi-Tenant On-Device Deep Learning for Continuous Mobile Vision"
50 / 82 papers shown
Title
On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance
Jaskirat Singh
Bram Adams
Ahmed E. Hassan
VLM
36
0
0
01 Nov 2024
Panopticus: Omnidirectional 3D Object Detection on Resource-constrained Edge Devices
Jeho Lee
Chanyoung Jung
Jiwon Kim
Hojung Cha
3DPC
24
1
0
02 Oct 2024
HydraViT: Stacking Heads for a Scalable ViT
Janek Haberer
A. Hojjat
Olaf Landsiedel
26
0
0
26 Sep 2024
ELMS: Elasticized Large Language Models On Mobile Devices
Wangsong Yin
Rongjie Yi
Daliang Xu
Gang Huang
Mengwei Xu
Xuanzhe Liu
29
5
0
08 Sep 2024
Loki: A System for Serving ML Inference Pipelines with Hardware and Accuracy Scaling
Sohaib Ahmad
Hui Guan
Ramesh K. Sitaraman
34
4
0
04 Jul 2024
Soar: Design and Deployment of A Smart Roadside Infrastructure System for Autonomous Driving
Shuyao Shi
Neiwen Ling
Zhehao Jiang
Xuan Huang
Yuze He
...
Chen Bian
Jingfei Xia
Zhenyu Yan
Raymond W. Yeung
Guoliang Xing
16
6
0
21 Apr 2024
Socialized Learning: A Survey of the Paradigm Shift for Edge Intelligence in Networked Systems
Xiaofei Wang
Yunfeng Zhao
Chao Qiu
Qinghua Hu
Victor C. M. Leung
27
6
0
20 Apr 2024
BRIEDGE: EEG-Adaptive Edge AI for Multi-Brain to Multi-Robot Interaction
Jinhui Ouyang
Mingzhu Wu
Xinglin Li
Hanhui Deng
Di Wu
16
2
0
14 Mar 2024
Context-aware Multi-Model Object Detection for Diversely Heterogeneous Compute Systems
Justin Davis
M. E. Belviranli
9
1
0
12 Feb 2024
IoT in the Era of Generative AI: Vision and Challenges
Xin Wang
Zhongwei Wan
Arvin Hekmati
M. Zong
Samiul Alam
Mi Zhang
Bhaskar Krishnamachari
27
15
0
03 Jan 2024
Real-time Neural Network Inference on Extremely Weak Devices: Agile Offloading with Explainable AI
Kai Huang
Wei Gao
15
35
0
21 Dec 2023
ECLM: Efficient Edge-Cloud Collaborative Learning with Continuous Environment Adaptation
Zhuang Yan
Zhenzhe Zheng
Yunfeng Shao
Bingshuai Li
Fan Wu
Guihai Chen
17
3
0
18 Nov 2023
Collaborative Inference in DNN-based Satellite Systems with Dynamic Task Streams
Jinglong Guan
Qiyang Zhang
Ilir Murturi
Praveen Kumar Donta
Schahram Dustdar
Shangguang Wang
26
3
0
10 Nov 2023
MOSEL: Inference Serving Using Dynamic Modality Selection
Bodun Hu
Le Xu
Jeongyoon Moon
N. Yadwadkar
Aditya Akella
11
4
0
27 Oct 2023
AdaEvo: Edge-Assisted Continuous and Timely DNN Model Evolution for Mobile Devices
Lehao Wang
Zhiwen Yu
Haoyi Yu
Sicong Liu
Yaxiong Xie
Bin Guo
Yunxin Liu
11
5
0
27 Sep 2023
LLMCad: Fast and Scalable On-device Large Language Model Inference
Daliang Xu
Wangsong Yin
Xin Jin
Y. Zhang
Shiyun Wei
Mengwei Xu
Xuanzhe Liu
17
43
0
08 Sep 2023
RED: A Systematic Real-Time Scheduling Approach for Robotic Environmental Dynamics
Zexin Li
Tao Ren
Xiaoxi He
Cong Liu
21
7
0
29 Aug 2023
SwapMoE: Serving Off-the-shelf MoE-based Large Language Models with Tunable Memory Budget
Rui Kong
Yuanchun Li
Qingtian Feng
Weijun Wang
Xiaozhou Ye
Ye Ouyang
L. Kong
Yunxin Liu
MoE
29
8
0
29 Aug 2023
Generative Model for Models: Rapid DNN Customization for Diverse Tasks and Resource Constraints
Wenxing Xu
Yuanchun Li
Jiacheng Liu
Yiyou Sun
Zhengyang Cao
Yixuan Li
Hao Wen
Yunxin Liu
19
0
0
29 Aug 2023
Federated Learning for Computationally-Constrained Heterogeneous Devices: A Survey
Kilian Pfeiffer
Martin Rapp
R. Khalili
J. Henkel
FedML
14
63
0
18 Jul 2023
Miriam: Exploiting Elastic Kernels for Real-time Multi-DNN Inference on Edge GPU
Zhihe Zhao
Neiwen Ling
Nan Guan
Guoliang Xing
15
11
0
10 Jul 2023
Breaking On-device Training Memory Wall: A Systematic Survey
Shitian Li
Chunlin Tian
Kahou Tam
Ruirui Ma
Li Li
21
2
0
17 Jun 2023
Adaptive Scheduling for Edge-Assisted DNN Serving
Jian He
Chen-Shun Yang
Zhaoyuan He
Ghufran Baig
L. Qiu
11
0
0
19 Apr 2023
AdaptiveNet: Post-deployment Neural Architecture Adaptation for Diverse Edge Environments
Hao Wen
Yuanchun Li
Zunshuai Zhang
Shiqi Jiang
Xiaozhou Ye
Ouyang Ye
Yaqin Zhang
Yunxin Liu
82
29
0
13 Mar 2023
TFormer: A Transmission-Friendly ViT Model for IoT Devices
Zhichao Lu
Chuntao Ding
Felix Juefei Xu
Vishnu Naresh Boddeti
Shangguang Wang
Yun Yang
19
13
0
15 Feb 2023
DynaMIX: Resource Optimization for DNN-Based Real-Time Applications on a Multi-Tasking System
Minkyoung Cho
K. Shin
24
2
0
03 Feb 2023
SuperFedNAS: Cost-Efficient Federated Neural Architecture Search for On-Device Inference
Alind Khare
A. Agrawal
Aditya Annavajjala
Payman Behnam
Myungjin Lee
Hugo Latapie
Alexey Tumanov
FedML
13
2
0
26 Jan 2023
Mind Your Heart: Stealthy Backdoor Attack on Dynamic Deep Neural Network in Edge Computing
Tian Dong
Ziyuan Zhang
Han Qiu
Tianwei Zhang
Hewu Li
T. Wang
AAML
23
6
0
22 Dec 2022
On-device Training: A First Overview on Existing Systems
Shuai Zhu
Thiemo Voigt
Jeonggil Ko
Fatemeh Rahimian
29
14
0
01 Dec 2022
Edge Video Analytics: A Survey on Applications, Systems and Enabling Techniques
Renjie Xu
S. Razavi
Rong Zheng
38
15
0
28 Nov 2022
ROMA: Run-Time Object Detection To Maximize Real-Time Accuracy
JunKyu Lee
Blesson Varghese
Hans Vandierendonck
ObjD
23
4
0
28 Oct 2022
Towards Transmission-Friendly and Robust CNN Models over Cloud and Device
Chuntao Ding
Zhichao Lu
F. Xu
Vishnu Naresh Boddeti
Yidong Li
Jiannong Cao
19
14
0
20 Jul 2022
A Survey on Collaborative DNN Inference for Edge Intelligence
Weiqing Ren
Yuben Qu
Chao Dong
Yuqian Jing
Hao Sun
Qihui Wu
Song Guo
28
49
0
16 Jul 2022
STI: Turbocharge NLP Inference at the Edge via Elastic Pipelining
Liwei Guo
Wonkyo Choe
F. Lin
19
14
0
11 Jul 2022
Smart Multi-tenant Federated Learning
Weiming Zhuang
Yonggang Wen
Shuai Zhang
FedML
23
2
0
09 Jul 2022
CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution
Taeho Kim
Yongin Kwon
Jemin Lee
Taeho Kim
Sangtae Ha
20
2
0
04 Jul 2022
Turbo: Opportunistic Enhancement for Edge Video Analytics
Yan Lu
Shiqi Jiang
Ting Cao
Yuanchao Shu
34
30
0
29 Jun 2022
Boosting DNN Cold Inference on Edge Devices
Rongjie Yi
Ting Cao
Ao Zhou
Xiao Ma
Shangguang Wang
Mengwei Xu
57
6
0
15 Jun 2022
Multi-DNN Accelerators for Next-Generation AI Systems
Stylianos I. Venieris
C. Bouganis
Nicholas D. Lane
21
7
0
19 May 2022
FrameHopper: Selective Processing of Video Frames in Detection-driven Real-Time Video Analytics
Md. Adnan Arefeen
Sumaiya Tabassum Nimi
M. Y. S. Uddin
15
10
0
22 Mar 2022
YONO: Modeling Multiple Heterogeneous Neural Networks on Microcontrollers
Young D. Kwon
Jagmohan Chauhan
Cecilia Mascolo
16
13
0
08 Mar 2022
Resource-Efficient Deep Learning: A Survey on Model-, Arithmetic-, and Implementation-Level Techniques
JunKyu Lee
L. Mukhanov
A. S. Molahosseini
U. Minhas
Yang Hua
Jesus Martinez del Rincon
K. Dichev
Cheol-Ho Hong
Hans Vandierendonck
31
29
0
30 Dec 2021
Virtuoso: Video-based Intelligence for real-time tuning on SOCs
Jayoung Lee
Pengcheng Wang
Ran Xu
Venkateswara Dasari
Noah Weston
Yin Li
S. Bagchi
Somali Chaterji
23
2
0
24 Dec 2021
LegoDNN: Block-grained Scaling of Deep Neural Networks for Mobile Vision
Rui Han
Qinglong Zhang
C. Liu
Guoren Wang
Jian Tang
L. Chen
16
43
0
18 Dec 2021
CANS: Communication Limited Camera Network Self-Configuration for Intelligent Industrial Surveillance
Jingzheng Tu
Qimin Xu
Cailian Chen
24
2
0
13 Sep 2021
SensiX++: Bringing MLOPs and Multi-tenant Model Serving to Sensory Edge Devices
Chulhong Min
Akhil Mathur
Utku Günay Acer
A. Montanari
F. Kawsar
18
11
0
08 Sep 2021
Efficient Visual Recognition with Deep Neural Networks: A Survey on Recent Advances and New Directions
Yang Wu
Dingheng Wang
Xiaotong Lu
Fan Yang
Guoqi Li
W. Dong
Jianbo Shi
27
18
0
30 Aug 2021
Leveraging Transprecision Computing for Machine Vision Applications at the Edge
U. Minhas
L. Mukhanov
G. Karakonstantis
Hans Vandierendonck
Roger Francis Woods
24
5
0
29 Aug 2021
A Field Guide to Federated Optimization
Jianyu Wang
Zachary B. Charles
Zheng Xu
Gauri Joshi
H. B. McMahan
...
Mi Zhang
Tong Zhang
Chunxiang Zheng
Chen Zhu
Wennan Zhu
FedML
173
411
0
14 Jul 2021
How to Reach Real-Time AI on Consumer Devices? Solutions for Programmable and Custom Architectures
Stylianos I. Venieris
Ioannis Panopoulos
Ilias Leontiadis
I. Venieris
28
6
0
21 Jun 2021
1
2
Next