Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1706.04599
Cited By
v1
v2 (latest)
On Calibration of Modern Neural Networks
International Conference on Machine Learning (ICML), 2017
14 June 2017
Chuan Guo
Geoff Pleiss
Yu Sun
Kilian Q. Weinberger
UQCV
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"On Calibration of Modern Neural Networks"
50 / 3,755 papers shown
Title
The Illusion of Progress? A Critical Look at Test-Time Adaptation for Vision-Language Models
Lijun Sheng
Jian Liang
Ran He
Z. Wang
Tieniu Tan
VLM
MLLM
271
1
0
30 Jun 2025
Exposing and Mitigating Calibration Biases and Demographic Unfairness in MLLM Few-Shot In-Context Learning for Medical Image Classification
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Xing Shen
Justin Szeto
Mingyang Li
Hengguan Huang
Tal Arbel
209
1
0
29 Jun 2025
HazeMatching: Dehazing Light Microscopy Images with Guided Conditional Flow Matching
Anirban Ray
Ashesh Ashesh
Florian Jug
257
0
0
27 Jun 2025
Active Inference AI Systems for Scientific Discovery
Karthik Duraisamy
AI4CE
LRM
335
1
0
26 Jun 2025
Precise Bayesian Neural Networks
Carlos Stein Brito
87
0
0
24 Jun 2025
The Role of Model Confidence on Bias Effects in Measured Uncertainties for Vision-Language Models
Xinyi Liu
Weiguang Wang
Hangfeng He
183
0
0
20 Jun 2025
Keeping Medical AI Healthy and Trustworthy: A Review of Detection and Correction Methods for System Degradation
Hao Guan
D. Bates
Li Zhou
OOD
145
4
0
20 Jun 2025
A Hybrid DeBERTa and Gated Broad Learning System for Cyberbullying Detection in English Text
Devesh Kumar
100
0
0
19 Jun 2025
Spatially-Aware Evaluation of Segmentation Uncertainty
Tal Zeevi
Eléonore V. Lieffrig
Lawrence H. Staib
John Onofrey
147
2
0
19 Jun 2025
One Sample is Enough to Make Conformal Prediction Robust
Soroush H. Zargarbashi
Mohammad Sadegh Akhondzadeh
Aleksandar Bojchevski
151
3
0
19 Jun 2025
Loss-Oriented Ranking for Automated Visual Prompting in LVLMs
Yuan Zhang
Chun-Kai Fan
Tao Huang
Ming Lu
Sicheng Yu
Junwen Pan
Kuan Cheng
Qi She
Shanghang Zhang
VLM
LRM
198
2
0
19 Jun 2025
Modeling the One-to-Many Property in Open-Domain Dialogue with LLMs
Jing Yang Lee
Kong-Aik Lee
Woon-Seng Gan
210
0
0
18 Jun 2025
Uncertainty Estimation by Human Perception versus Neural Models
Pedro Mendes
Paolo Romano
David Garlan
211
0
0
18 Jun 2025
ODD: Overlap-aware Estimation of Model Performance under Distribution Shift
Conference on Uncertainty in Artificial Intelligence (UAI), 2025
Aayush Mishra
Anqi Liu
160
1
0
17 Jun 2025
Enclosing Prototypical Variational Autoencoder for Explainable Out-of-Distribution Detection
International Conference on Computer Safety, Reliability, and Security (SAFECOMP), 2025
Conrad Orglmeister
Erik Bochinski
Volker Eiselein
Elvira Fleig
155
0
0
17 Jun 2025
Aligning Evaluation with Clinical Priorities: Calibration, Label Shift, and Error Costs
Gerardo Flores
Alyssa H. Smith
Julia A Fukuyama
Ashia C. Wilson
195
1
0
17 Jun 2025
Uncertainty-Aware Graph Neural Networks: A Multi-Hop Evidence Fusion Approach
IEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2025
Qingfeng Chen
Shiyuan Li
Yixin Liu
Shirui Pan
Geoffrey I. Webb
Shichao Zhang
EDL
278
8
0
16 Jun 2025
Generative or Discriminative? Revisiting Text Classification in the Era of Transformers
Siva Rajesh Kasa
Karan Gupta
Sumegh Roychowdhury
Ashutosh Kumar
Yaswanth Biruduraju
Santhosh Kumar Kasa
Nikhil Pattisapu
Arindam Bhattacharya
Shailendra Agarwal
Vijay huddar
164
2
0
13 Jun 2025
Ground Reaction Force Estimation via Time-aware Knowledge Distillation
IEEE Internet of Things Journal (IEEE IoT J.), 2025
Eun Som Jeon
Sinjini Mitra
Jisoo Lee
Omik M. Save
Ankita Shukla
Hyunglae Lee
Pavan Turaga
297
1
0
12 Jun 2025
Box-Constrained Softmax Function and Its Application for Post-Hoc Calibration
Kyohei Atarashi
S. Oyama
Hiromi Arai
H. Kashima
205
0
0
12 Jun 2025
Textual Bayes: Quantifying Uncertainty in LLM-Based Systems
Brendan Leigh Ross
Noël Vouitsis
Atiyeh Ashari Ghomi
Rasa Hosseinzadeh
Ji Xin
...
Yi Sui
Shiyi Hou
Kin Kwan Leung
Gabriel Loaiza-Ganem
Jesse C. Cresswell
292
3
0
11 Jun 2025
Test-Time-Scaling for Zero-Shot Diagnosis with Visual-Language Reasoning
Ji Young Byun
Young-Jin Park
Navid Azizan
Rama Chellappa
LM&MA
LRM
137
1
0
11 Jun 2025
Do LLMs Give Psychometrically Plausible Responses in Educational Assessments?
Workshop on Innovative Use of NLP for Building Educational Applications (UNBEA), 2025
Andreas Säuberli
Diego Frassinelli
Barbara Plank
AI4Ed
227
2
0
11 Jun 2025
Beyond Overconfidence: Foundation Models Redefine Calibration in Deep Neural Networks
Achim Hekler
Lukas Kuhn
Florian Buettner
UQCV
231
1
0
11 Jun 2025
Balanced Hyperbolic Embeddings Are Natural Out-of-Distribution Detectors
Tejaswi Kasarla
Max van Spengler
Pascal Mettes
OODD
273
2
0
11 Jun 2025
Know What You Don't Know: Uncertainty Calibration of Process Reward Models
Young-Jin Park
Kristjan Greenewald
Kaveh Alim
Hao Wang
Navid Azizan
LRM
352
3
0
11 Jun 2025
Temporalizing Confidence: Evaluation of Chain-of-Thought Reasoning with Signal Temporal Logic
Workshop on Innovative Use of NLP for Building Educational Applications (UNBEA), 2025
Zhenjiang Mao
Artem Bisliouk
Rohith Reddy Nama
Ivan Ruchkin
ReLM
LRM
142
1
0
09 Jun 2025
WWAggr: A Window Wasserstein-based Aggregation for Ensemble Change Point Detection
Alexander Stepikin
Galina Boeva
Alexey Zaytsev
279
0
0
09 Jun 2025
From Calibration to Collaboration: LLM Uncertainty Quantification Should Be More Human-Centered
Siddartha Devic
Tejas Srinivasan
Jesse Thomason
Willie Neiswanger
Willie Neiswanger
183
7
0
09 Jun 2025
CausalPFN: Amortized Causal Effect Estimation via In-Context Learning
Vahid Balazadeh
Hamidreza Kamkari
Valentin Thomas
Benson Li
Junwei Ma
Jesse C. Cresswell
Rahul G. Krishnan
CML
186
5
0
09 Jun 2025
Residual Reweighted Conformal Prediction for Graph Neural Networks
Conference on Uncertainty in Artificial Intelligence (UAI), 2025
Zheng Zhang
Jie Bao
Zhixin Zhou
Nicolo Colombo
Lixin Cheng
Rui Luo
194
3
0
09 Jun 2025
AssertBench: A Benchmark for Evaluating Self-Assertion in Large Language Models
Jaeho Lee
Atharv Chowdhary
HILM
161
0
0
08 Jun 2025
Improving Wildlife Out-of-Distribution Detection: Africas Big Five
Mufhumudzi Muthivhi
Jiahao Huo
Fredrik Gustafsson
Terence L van Zyl
OODD
148
1
0
07 Jun 2025
Quantifying Adversarial Uncertainty in Evidential Deep Learning using Conflict Resolution
Charmaine Barker
Daniel Bethell
Simos Gerasimou
AAML
294
0
0
06 Jun 2025
Do LLMs Really Forget? Evaluating Unlearning with Knowledge Correlation and Confidence Awareness
Rongzhe Wei
Peizhi Niu
Hans Hao-Hsun Hsu
Ruihan Wu
Haoteng Yin
...
Vamsi K. Potluru
Eli Chien
Kamalika Chaudhuri
S. Rasoul Etesami
P. Li
MU
KELM
467
6
0
06 Jun 2025
Rethinking Semi-supervised Segmentation Beyond Accuracy: Reliability and Robustness
S. Landgraf
Markus Hillemann
Markus Ulrich
UQCV
216
0
0
06 Jun 2025
Compositional Generalisation for Explainable Hate Speech Detection
Agostina Calabrese
Tom Sherborne
Bjorn Ross
Mirella Lapata
208
1
0
04 Jun 2025
Trustworthy Medical Question Answering: An Evaluation-Centric Survey
Yinuo Wang
Robert E. Mercer
Frank Rudzicz
Sudipta Singha Roy
Sudipta Singha Roy
Pengjie Ren
Zhumin Chen
Xindi Wang
ELM
208
2
0
04 Jun 2025
Trusted Fake Audio Detection Based on Dirichlet Distribution
Chi Ding
Junxiao Xue
Cong Wang
Hao Zhou
168
0
0
03 Jun 2025
Shaking to Reveal: Perturbation-Based Detection of LLM Hallucinations
Jinyuan Luo
Zhen Fang
Shouqing Yang
Seongheon Park
Ling Chen
AAML
HILM
213
0
0
03 Jun 2025
Self-ensemble: Mitigating Confidence Mis-calibration for Large Language Models
Zicheng Xu
Guanchu Wang
Guangyao Zheng
Yu-Neng Chuang
A. Szalay
Helen Zhou
Vladimir Braverman
242
1
0
02 Jun 2025
AIMSCheck: Leveraging LLMs for AI-Assisted Review of Modern Slavery Statements Across Jurisdictions
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Adriana Eufrosina Bora
Akshatha Arodi
Duoyi Zhang
Jordan Bannister
Mirko Bronzi
Arsène Fansi Tchango
M. A. Bashar
R. Nayak
Kerrie Mengersen
108
2
0
02 Jun 2025
Mispronunciation Detection Without L2 Pronunciation Dataset in Low-Resource Setting: A Case Study in Finland Swedish
Nhan Phan
Mikko Kuronen
Maria Kautonen
Riikka Ullakonoja
Anna von Zansen
Yaroslav Getman
Ekaterina Voskoboinik
Tamás Grósz
M. Kurimo
108
0
0
01 Jun 2025
MetaFaith: Faithful Natural Language Uncertainty Expression in LLMs
Gabrielle Kaili-May Liu
Gal Yona
Avi Caciularu
Idan Szpektor
Tim G. J. Rudner
Arman Cohan
265
2
0
30 May 2025
Towards Understanding The Calibration Benefits of Sharpness-Aware Minimization
C. Tan
Yubo Zhou
Haishan Ye
Guang Dai
Junmin Liu
Zengjie Song
Jiangshe Zhang
Zixiang Zhao
Yunda Hao
Yong Xu
AAML
234
0
0
29 May 2025
Network Inversion for Uncertainty-Aware Out-of-Distribution Detection
Pirzada Suhail
Rehna Afroz
Amit Sethi
Amit Sethi
OODD
UQCV
253
1
0
29 May 2025
Revisiting Reweighted Risk for Calibration: AURC, Focal, and Inverse Focal Loss
Han Zhou
Sebastian G.Gruber
Teodora Popordanoska
Matthew B. Blaschko
436
0
0
29 May 2025
MMBoundary: Advancing MLLM Knowledge Boundary Awareness through Reasoning Step Confidence Calibration
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Zhitao He
Sandeep Polisetty
Zhiyuan Fan
Yuchen Huang
Shujin Wu
Yi R.
LRM
405
9
0
29 May 2025
Confidential Guardian: Cryptographically Prohibiting the Abuse of Model Abstention
Stephan Rabanser
Ali Shahin Shamsabadi
Olive Franzese
Xiao Wang
Adrian Weller
Nicolas Papernot
168
2
0
29 May 2025
Revisiting Uncertainty Estimation and Calibration of Large Language Models
Linwei Tao
Yi-Fan Yeh
Minjing Dong
Tao Huang
Philip Torr
Chang Xu
205
4
0
29 May 2025
Previous
1
2
3
...
7
8
9
...
74
75
76
Next