Papers citing 'On Calibration of Modern Neural Networks'

Title
The Illusion of Progress? A Critical Look at Test-Time Adaptation for Vision-Language Models Lijun Sheng Jian Liang Ran He Z. Wang Tieniu Tan VLM MLLM 271 1 0 30 Jun 2025
Exposing and Mitigating Calibration Biases and Demographic Unfairness in MLLM Few-Shot In-Context Learning for Medical Image ClassificationInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025 Xing Shen Justin Szeto Mingyang Li Hengguan Huang Tal Arbel 209 1 0 29 Jun 2025
HazeMatching: Dehazing Light Microscopy Images with Guided Conditional Flow Matching Anirban Ray Ashesh Ashesh Florian Jug 257 0 0 27 Jun 2025
Active Inference AI Systems for Scientific Discovery Karthik Duraisamy AI4CE LRM 335 1 0 26 Jun 2025
Precise Bayesian Neural Networks Carlos Stein Brito 87 0 0 24 Jun 2025
The Role of Model Confidence on Bias Effects in Measured Uncertainties for Vision-Language Models Xinyi Liu Weiguang Wang Hangfeng He 183 0 0 20 Jun 2025
Keeping Medical AI Healthy and Trustworthy: A Review of Detection and Correction Methods for System Degradation Hao Guan D. Bates Li Zhou OOD 145 4 0 20 Jun 2025
A Hybrid DeBERTa and Gated Broad Learning System for Cyberbullying Detection in English Text Devesh Kumar 100 0 0 19 Jun 2025
Spatially-Aware Evaluation of Segmentation Uncertainty Tal Zeevi Eléonore V. Lieffrig Lawrence H. Staib John Onofrey 147 2 0 19 Jun 2025
One Sample is Enough to Make Conformal Prediction Robust Soroush H. Zargarbashi Mohammad Sadegh Akhondzadeh Aleksandar Bojchevski 151 3 0 19 Jun 2025
Loss-Oriented Ranking for Automated Visual Prompting in LVLMs Yuan Zhang Chun-Kai Fan Tao Huang Ming Lu Sicheng Yu Junwen Pan Kuan Cheng Qi She Shanghang Zhang VLM LRM 198 2 0 19 Jun 2025
Modeling the One-to-Many Property in Open-Domain Dialogue with LLMs Jing Yang Lee Kong-Aik Lee Woon-Seng Gan 210 0 0 18 Jun 2025
Uncertainty Estimation by Human Perception versus Neural Models Pedro Mendes Paolo Romano David Garlan 211 0 0 18 Jun 2025
ODD: Overlap-aware Estimation of Model Performance under Distribution ShiftConference on Uncertainty in Artificial Intelligence (UAI), 2025 Aayush Mishra Anqi Liu 160 1 0 17 Jun 2025
Enclosing Prototypical Variational Autoencoder for Explainable Out-of-Distribution DetectionInternational Conference on Computer Safety, Reliability, and Security (SAFECOMP), 2025 Conrad Orglmeister Erik Bochinski Volker Eiselein Elvira Fleig 155 0 0 17 Jun 2025
Aligning Evaluation with Clinical Priorities: Calibration, Label Shift, and Error Costs Gerardo Flores Alyssa H. Smith Julia A Fukuyama Ashia C. Wilson 195 1 0 17 Jun 2025
Uncertainty-Aware Graph Neural Networks: A Multi-Hop Evidence Fusion ApproachIEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2025 Qingfeng Chen Shiyuan Li Yixin Liu Shirui Pan Geoffrey I. Webb Shichao Zhang EDL 278 8 0 16 Jun 2025
Generative or Discriminative? Revisiting Text Classification in the Era of Transformers Siva Rajesh Kasa Karan Gupta Sumegh Roychowdhury Ashutosh Kumar Yaswanth Biruduraju Santhosh Kumar Kasa Nikhil Pattisapu Arindam Bhattacharya Shailendra Agarwal Vijay huddar 164 2 0 13 Jun 2025
Ground Reaction Force Estimation via Time-aware Knowledge DistillationIEEE Internet of Things Journal (IEEE IoT J.), 2025 Eun Som Jeon Sinjini Mitra Jisoo Lee Omik M. Save Ankita Shukla Hyunglae Lee Pavan Turaga 297 1 0 12 Jun 2025
Box-Constrained Softmax Function and Its Application for Post-Hoc Calibration Kyohei Atarashi S. Oyama Hiromi Arai H. Kashima 205 0 0 12 Jun 2025
Textual Bayes: Quantifying Uncertainty in LLM-Based Systems Brendan Leigh Ross Noël Vouitsis Atiyeh Ashari Ghomi Rasa Hosseinzadeh Ji Xin ... Yi Sui Shiyi Hou Kin Kwan Leung Gabriel Loaiza-Ganem Jesse C. Cresswell 292 3 0 11 Jun 2025
Test-Time-Scaling for Zero-Shot Diagnosis with Visual-Language Reasoning Ji Young Byun Young-Jin Park Navid Azizan Rama Chellappa LM&MA LRM 137 1 0 11 Jun 2025
Do LLMs Give Psychometrically Plausible Responses in Educational Assessments?Workshop on Innovative Use of NLP for Building Educational Applications (UNBEA), 2025 Andreas Säuberli Diego Frassinelli Barbara Plank AI4Ed 227 2 0 11 Jun 2025
Beyond Overconfidence: Foundation Models Redefine Calibration in Deep Neural Networks Achim Hekler Lukas Kuhn Florian Buettner UQCV 231 1 0 11 Jun 2025
Balanced Hyperbolic Embeddings Are Natural Out-of-Distribution Detectors Tejaswi Kasarla Max van Spengler Pascal Mettes OODD 273 2 0 11 Jun 2025
Know What You Don't Know: Uncertainty Calibration of Process Reward Models Young-Jin Park Kristjan Greenewald Kaveh Alim Hao Wang Navid Azizan LRM 352 3 0 11 Jun 2025
Temporalizing Confidence: Evaluation of Chain-of-Thought Reasoning with Signal Temporal LogicWorkshop on Innovative Use of NLP for Building Educational Applications (UNBEA), 2025 Zhenjiang Mao Artem Bisliouk Rohith Reddy Nama Ivan Ruchkin ReLM LRM 142 1 0 09 Jun 2025
WWAggr: A Window Wasserstein-based Aggregation for Ensemble Change Point Detection Alexander Stepikin Galina Boeva Alexey Zaytsev 279 0 0 09 Jun 2025
From Calibration to Collaboration: LLM Uncertainty Quantification Should Be More Human-Centered Siddartha Devic Tejas Srinivasan Jesse Thomason Willie Neiswanger Willie Neiswanger 183 7 0 09 Jun 2025
CausalPFN: Amortized Causal Effect Estimation via In-Context Learning Vahid Balazadeh Hamidreza Kamkari Valentin Thomas Benson Li Junwei Ma Jesse C. Cresswell Rahul G. Krishnan CML 186 5 0 09 Jun 2025
Residual Reweighted Conformal Prediction for Graph Neural NetworksConference on Uncertainty in Artificial Intelligence (UAI), 2025 Zheng Zhang Jie Bao Zhixin Zhou Nicolo Colombo Lixin Cheng Rui Luo 194 3 0 09 Jun 2025
AssertBench: A Benchmark for Evaluating Self-Assertion in Large Language Models Jaeho Lee Atharv Chowdhary HILM 161 0 0 08 Jun 2025
Improving Wildlife Out-of-Distribution Detection: Africas Big Five Mufhumudzi Muthivhi Jiahao Huo Fredrik Gustafsson Terence L van Zyl OODD 148 1 0 07 Jun 2025
Quantifying Adversarial Uncertainty in Evidential Deep Learning using Conflict Resolution Charmaine Barker Daniel Bethell Simos Gerasimou AAML 294 0 0 06 Jun 2025
Do LLMs Really Forget? Evaluating Unlearning with Knowledge Correlation and Confidence Awareness Rongzhe Wei Peizhi Niu Hans Hao-Hsun Hsu Ruihan Wu Haoteng Yin ... Vamsi K. Potluru Eli Chien Kamalika Chaudhuri S. Rasoul Etesami P. Li MU KELM 467 6 0 06 Jun 2025
Rethinking Semi-supervised Segmentation Beyond Accuracy: Reliability and Robustness S. Landgraf Markus Hillemann Markus Ulrich UQCV 216 0 0 06 Jun 2025
Compositional Generalisation for Explainable Hate Speech Detection Agostina Calabrese Tom Sherborne Bjorn Ross Mirella Lapata 208 1 0 04 Jun 2025
Trustworthy Medical Question Answering: An Evaluation-Centric Survey Yinuo Wang Robert E. Mercer Frank Rudzicz Sudipta Singha Roy Sudipta Singha Roy Pengjie Ren Zhumin Chen Xindi Wang ELM 208 2 0 04 Jun 2025
Trusted Fake Audio Detection Based on Dirichlet Distribution Chi Ding Junxiao Xue Cong Wang Hao Zhou 168 0 0 03 Jun 2025
Shaking to Reveal: Perturbation-Based Detection of LLM Hallucinations Jinyuan Luo Zhen Fang Shouqing Yang Seongheon Park Ling Chen AAML HILM 213 0 0 03 Jun 2025
Self-ensemble: Mitigating Confidence Mis-calibration for Large Language Models Zicheng Xu Guanchu Wang Guangyao Zheng Yu-Neng Chuang A. Szalay Helen Zhou Vladimir Braverman 242 1 0 02 Jun 2025
AIMSCheck: Leveraging LLMs for AI-Assisted Review of Modern Slavery Statements Across JurisdictionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 Adriana Eufrosina Bora Akshatha Arodi Duoyi Zhang Jordan Bannister Mirko Bronzi Arsène Fansi Tchango M. A. Bashar R. Nayak Kerrie Mengersen 108 2 0 02 Jun 2025
Mispronunciation Detection Without L2 Pronunciation Dataset in Low-Resource Setting: A Case Study in Finland Swedish Nhan Phan Mikko Kuronen Maria Kautonen Riikka Ullakonoja Anna von Zansen Yaroslav Getman Ekaterina Voskoboinik Tamás Grósz M. Kurimo 108 0 0 01 Jun 2025
MetaFaith: Faithful Natural Language Uncertainty Expression in LLMs Gabrielle Kaili-May Liu Gal Yona Avi Caciularu Idan Szpektor Tim G. J. Rudner Arman Cohan 265 2 0 30 May 2025
Towards Understanding The Calibration Benefits of Sharpness-Aware Minimization C. Tan Yubo Zhou Haishan Ye Guang Dai Junmin Liu Zengjie Song Jiangshe Zhang Zixiang Zhao Yunda Hao Yong Xu AAML 234 0 0 29 May 2025
Network Inversion for Uncertainty-Aware Out-of-Distribution Detection Pirzada Suhail Rehna Afroz Amit Sethi Amit Sethi OODD UQCV 253 1 0 29 May 2025
Revisiting Reweighted Risk for Calibration: AURC, Focal, and Inverse Focal Loss Han Zhou Sebastian G.Gruber Teodora Popordanoska Matthew B. Blaschko 436 0 0 29 May 2025
MMBoundary: Advancing MLLM Knowledge Boundary Awareness through Reasoning Step Confidence CalibrationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 Zhitao He Sandeep Polisetty Zhiyuan Fan Yuchen Huang Shujin Wu Yi R. LRM 405 9 0 29 May 2025
Confidential Guardian: Cryptographically Prohibiting the Abuse of Model Abstention Stephan Rabanser Ali Shahin Shamsabadi Olive Franzese Xiao Wang Adrian Weller Nicolas Papernot 168 2 0 29 May 2025
Revisiting Uncertainty Estimation and Calibration of Large Language Models Linwei Tao Yi-Fan Yeh Minjing Dong Tao Huang Philip Torr Chang Xu 205 4 0 29 May 2025