Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.04685
Cited By
Predictive Exit: Prediction of Fine-Grained Early Exits for Computation- and Energy-Efficient Inference
9 June 2022
Xiangjie Li
Chen Lou
Zhengping Zhu
Yuchi Chen
Yingtao Shen
Yehan Ma
An Zou
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Predictive Exit: Prediction of Fine-Grained Early Exits for Computation- and Energy-Efficient Inference"
11 / 11 papers shown
Title
HELIOS: Adaptive Model And Early-Exit Selection for Efficient LLM Inference Serving
Avinash Kumar
Shashank Nag
Jason Clemons
L. John
Poulami Das
26
0
0
14 Apr 2025
AgroLLM: Connecting Farmers and Agricultural Practices through Large Language Models for Enhanced Knowledge Transfer and Practical Application
Dinesh Jackson Samuel
Inna Skarga-Bandurova
David Sikolia
Muhammad Awais
42
0
0
28 Feb 2025
M2R2: Mixture of Multi-Rate Residuals for Efficient Transformer Inference
Nikhil Bhendawade
Mahyar Najibi
Devang Naik
Irina Belousova
MoE
85
0
0
04 Feb 2025
Recall: Empowering Multimodal Embedding for Edge Devices
Dongqi Cai
Shangguang Wang
Chen Peng
Zeling Zhang
Mengwei Xu
27
3
0
09 Sep 2024
Don't Think It Twice: Exploit Shift Invariance for Efficient Online Streaming Inference of CNNs
Christodoulos Kechris
Jonathan Dan
Jose Miranda
David Atienza
AI4TS
22
0
0
06 Aug 2024
Sensor-Aware Classifiers for Energy-Efficient Time Series Applications on IoT Devices
Dina Hussein
Lubah Nelson
Ganapati Bhat
15
1
0
11 Jul 2024
Jointly-Learned Exit and Inference for a Dynamic Neural Network : JEI-DNN
Florence Regol
Joud Chataoui
Mark J. Coates
17
5
0
13 Oct 2023
DVFO: Learning-Based DVFS for Energy-Efficient Edge-Cloud Collaborative Inference
Ziyang Zhang
Yang Zhao
Huan Li
Changyao Lin
Jie Liu
25
13
0
02 Jun 2023
NeuralMatrix: Compute the Entire Neural Networks with Linear Matrix Operations for Efficient Inference
Ruiqi Sun
Siwei Ye
Jie Zhao
Xin He
Yiran Li
An Zou
27
0
0
23 May 2023
FIANCEE: Faster Inference of Adversarial Networks via Conditional Early Exits
Polina Karpikova
Radionova Ekaterina
A. Yaschenko
Andrei A. Spiridonov
Leonid Kostyushko
Riccardo Fabbricatore
Aleksei Ivakhnenko Samsung AI Center
11
3
0
20 Apr 2023
HADAS: Hardware-Aware Dynamic Neural Architecture Search for Edge Performance Scaling
Halima Bouzidi
Mohanad Odema
Hamza Ouarnoughi
Mohammad Abdullah Al Faruque
Smail Niar
22
19
0
06 Dec 2022
1