Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2502.04428
Cited By
Confident or Seek Stronger: Exploring Uncertainty-Based On-device LLM Routing From Benchmarking to Generalization
6 February 2025
Yu-Neng Chuang
Leisheng Yu
Guanchu Wang
Lizhe Zhang
Zirui Liu
Xuanting Cai
Yang Sui
Vladimir Braverman
Helen Zhou
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Confident or Seek Stronger: Exploring Uncertainty-Based On-device LLM Routing From Benchmarking to Generalization"
16 / 16 papers shown
DTS: Enhancing Large Reasoning Models via Decoding Tree Sketching
Zicheng Xu
G. Wang
Yu-Neng Chuang
Guangyao Zheng
A. Szalay
Zirui Liu
Vladimir Braverman
LRM
AI4CE
137
0
0
01 Nov 2025
Can Confidence Estimates Decide When Chain-of-Thought Is Necessary for LLMs?
Samuel Lewis-Lim
Xingwei Tan
Zhixue Zhao
Nikolaos Aletras
LRM
247
1
0
23 Oct 2025
Gold-Switch: Training-Free Superposition of Slow- and Fast- Thinking LLMs
Jaeseong Lee
Dayoung Kwon
Seung-won Hwang
OffRL
LRM
100
0
0
08 Oct 2025
A Greedy PDE Router for Blending Neural Operators and Classical Methods
Sahana Rayan
Yash Patel
Ambuj Tewari
169
0
0
29 Sep 2025
Efficient Reasoning Through Suppression of Self-Affirmation Reflections in Large Reasoning Models
Kaiyuan Liu
Chen Shen
Zhanwei Zhang
Junjie Liu
Xiaosong Yuan
Jieping Ye
ReLM
LRM
254
11
0
14 Jun 2025
OThink-R1: Intrinsic Fast/Slow Thinking Mode Switching for Over-Reasoning Mitigation
Shengjia Zhang
Junjie Wu
Jiawei Chen
Changwang Zhang
Yudi Wu
Wangchunshu Zhou
Sheng Zhou
Can Wang
Jun Wang
Jun Wang
LRM
301
10
0
03 Jun 2025
Self-ensemble: Mitigating Confidence Mis-calibration for Large Language Models
Zicheng Xu
Guanchu Wang
Guangyao Zheng
Yu-Neng Chuang
A. Szalay
Helen Zhou
Vladimir Braverman
299
1
0
02 Jun 2025
AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models
Feng Luo
Yu-Neng Chuang
Guanchu Wang
Hoang Anh Duy Le
Shaochen Zhong
...
Jiayi Yuan
Yang Sui
Vladimir Braverman
Vipin Chaudhary
Helen Zhou
LRM
243
11
0
28 May 2025
SelfBudgeter: Adaptive Token Allocation for Efficient LLM Reasoning
Zheng Li
Qingxiu Dong
Jingyuan Ma
Di Zhang
Kai Jia
Lei Sha
LRM
480
20
0
16 May 2025
A Survey on Collaborative Mechanisms Between Large and Small Language Models
Yi Chen
JiaHao Zhao
HaoHao Han
381
11
0
12 May 2025
ConCISE: Confidence-guided Compression in Step-by-step Efficient Reasoning
Ziqing Qiao
Yongheng Deng
Jiali Zeng
Dong Wang
Lai Wei
Fandong Meng
Jie Zhou
Jie Zhou
Ju Ren
Yaoxue Zhang
LRM
466
23
0
08 May 2025
Dynamic Early Exit in Reasoning Models
Chenxu Yang
Qingyi Si
Yongjie Duan
Zheliang Zhu
Chenyu Zhu
Zheng Lin
Zheng Lin
Li Cao
Weiping Wang
ReLM
LRM
543
99
0
22 Apr 2025
Efficient Reasoning Models: A Survey
Sicheng Feng
Gongfan Fang
Xinyin Ma
Xinchao Wang
ReLM
LRM
917
41
0
15 Apr 2025
Reasoning Models Can Be Effective Without Thinking
Wenjie Ma
Jingxuan He
Charlie Snell
Tyler Griggs
Sewon Min
Matei A. Zaharia
ReLM
LRM
428
113
1
14 Apr 2025
Efficient Inference for Large Reasoning Models: A Survey
Yi Liu
Jiaying Wu
Yufei He
Hongcheng Gao
Hongyu Chen
...
Xu Cheng
Zhiqi Huang
Bryan Hooi
Stan Z. Li
Keqin Li
LLMAG
LRM
563
50
0
29 Mar 2025
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
Yang Sui
Yu-Neng Chuang
Guanchu Wang
Jiamu Zhang
Tianyi Zhang
...
Andrew Wen
Shaochen
Zhong
Hanjie Chen
Helen Zhou
OffRL
ReLM
LRM
750
266
0
20 Mar 2025
1