Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2104.09286
Cited By
Learning to Cascade: Confidence Calibration for Improving the Accuracy and Computational Cost of Cascade Inference Systems
AAAI Conference on Artificial Intelligence (AAAI), 2021
15 April 2021
Shohei Enomoto
Takeharu Eda
UQCV
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning to Cascade: Confidence Calibration for Improving the Accuracy and Computational Cost of Cascade Inference Systems"
13 / 13 papers shown
Task as Context Prompting for Accurate Medical Symptom Coding Using Large Language Models
IEEE/ACM International Conference on Connected Health: Applications, Systems and Engineering Technologies (CHASE), 2025
Chengyang He
Wenlong Zhang
Violet Chen
Yue Ning
Ping Wang
205
1
0
03 Apr 2025
Gatekeeper: Improving Model Cascades Through Confidence Tuning
Stephan Rabanser
Nathalie Rauschmayr
Achin Kulshrestha
Petra Poklukar
Wittawat Jitkrittum
Sean Augenstein
Congchao Wang
Federico Tombari
557
1
0
26 Feb 2025
What is the Role of Small Models in the LLM Era: A Survey
Lihu Chen
Gaël Varoquaux
ALM
918
62
0
10 Sep 2024
Breaking the Ceiling of the LLM Community by Treating Token Generation as a Classification for Ensembling
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Yao-Ching Yu
Chun-Chih Kuo
Ziqi Ye
Yu-Cheng Chang
Yueh-Se Li
250
33
0
18 Jun 2024
Finding the SWEET Spot: Analysis and Improvement of Adaptive Inference in Low Resource Settings
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Daniel Rotem
Michael Hassid
Jonathan Mamou
Roy Schwartz
267
7
0
04 Jun 2023
Perception and Semantic Aware Regularization for Sequential Confidence Calibration
Computer Vision and Pattern Recognition (CVPR), 2023
Zhenghua Peng
Yuanmao Luo
Tianshui Chen
Keke Xu
Shuangping Huang
AI4TS
406
4
0
31 May 2023
QuickNets: Saving Training and Preventing Overconfidence in Early-Exit Neural Architectures
Devdhar Patel
H. Siegelmann
OnRL
225
1
0
25 Dec 2022
Turbo: Opportunistic Enhancement for Edge Video Analytics
ACM International Conference on Embedded Networked Sensor Systems (SenSys), 2022
Yan Lu
Shiqi Jiang
Ting Cao
Yuanchao Shu
233
34
0
29 Jun 2022
CLCNet: Rethinking of Ensemble Modeling with Classification Confidence Network
Yaodong Yu
S. Horng
316
0
0
19 May 2022
Confidence Calibration for Intent Detection via Hyperspherical Space and Rebalanced Accuracy-Uncertainty Loss
AAAI Conference on Artificial Intelligence (AAAI), 2022
Yantao Gong
Cao Liu
Fan Yang
Xunliang Cai
Guanglu Wan
Jiansong Chen
Weipeng Zhang
Houfeng Wang
UQCV
233
4
0
17 Mar 2022
Hidden Heterogeneity: When to Choose Similarity-Based Calibration
K. Wagstaff
Thomas G. Dietterich
314
1
0
03 Feb 2022
Task-Oriented Communication for Multi-Device Cooperative Edge Inference
IEEE Transactions on Wireless Communications (IEEE TWC), 2021
Jiawei Shao
Yuyi Mao
Jun Zhang
333
189
0
01 Sep 2021
CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models Cascade
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Lei Li
Yankai Lin
Deli Chen
Shuhuai Ren
Peng Li
Jie Zhou
Xu Sun
285
59
0
29 Dec 2020
1
Page 1 of 1