Learning to Cascade: Confidence Calibration for Improving the Accuracy and Computational Cost of Cascade Inference Systems

AAAI Conference on Artificial Intelligence (AAAI), 2021

15 April 2021

Papers citing "Learning to Cascade: Confidence Calibration for Improving the Accuracy and Computational Cost of Cascade Inference Systems"

13 / 13 papers shown

Task as Context Prompting for Accurate Medical Symptom Coding Using Large Language ModelsIEEE/ACM International Conference on Connected Health: Applications, Systems and Engineering Technologies (CHASE), 2025

205

03 Apr 2025

Gatekeeper: Improving Model Cascades Through Confidence Tuning

557

26 Feb 2025

What is the Role of Small Models in the LLM Era: A Survey

Lihu Chen

Gaël Varoquaux

ALM

918

10 Sep 2024

Breaking the Ceiling of the LLM Community by Treating Token Generation as a Classification for EnsemblingConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

250

18 Jun 2024

Finding the SWEET Spot: Analysis and Improvement of Adaptive Inference in Low Resource SettingsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

267

04 Jun 2023

Perception and Semantic Aware Regularization for Sequential Confidence CalibrationComputer Vision and Pattern Recognition (CVPR), 2023

Shuangping Huang

406

31 May 2023

QuickNets: Saving Training and Preventing Overconfidence in Early-Exit Neural Architectures

Devdhar Patel

H. Siegelmann

OnRL

225

25 Dec 2022

Turbo: Opportunistic Enhancement for Edge Video AnalyticsACM International Conference on Embedded Networked Sensor Systems (SenSys), 2022

233

29 Jun 2022

CLCNet: Rethinking of Ensemble Modeling with Classification Confidence Network

Yaodong Yu

S. Horng

316

19 May 2022

Confidence Calibration for Intent Detection via Hyperspherical Space and Rebalanced Accuracy-Uncertainty LossAAAI Conference on Artificial Intelligence (AAAI), 2022

Fan Yang

Houfeng Wang

233

17 Mar 2022

Hidden Heterogeneity: When to Choose Similarity-Based Calibration

K. Wagstaff

Thomas G. Dietterich

314

03 Feb 2022

Task-Oriented Communication for Multi-Device Cooperative Edge InferenceIEEE Transactions on Wireless Communications (IEEE TWC), 2021

Jiawei Shao

Yuyi Mao

Jun Zhang

333

189

01 Sep 2021

CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models CascadeConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

Lei Li

Yankai Lin

Deli Chen

Shuhuai Ren

Peng Li

Jie Zhou

Xu Sun

285

29 Dec 2020