ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.04599
  4. Cited By
On Calibration of Modern Neural Networks
v1v2 (latest)

On Calibration of Modern Neural Networks

International Conference on Machine Learning (ICML), 2017
14 June 2017
Chuan Guo
Geoff Pleiss
Yu Sun
Kilian Q. Weinberger
    UQCV
ArXiv (abs)PDFHTML

Papers citing "On Calibration of Modern Neural Networks"

50 / 3,745 papers shown
Title
Enhancing Multi-Agent Debate System Performance via Confidence Expression
Enhancing Multi-Agent Debate System Performance via Confidence Expression
Zijie Lin
Bryan Hooi
LLMAG
91
1
0
17 Sep 2025
Prompt Stability in Code LLMs: Measuring Sensitivity across Emotion- and Personality-Driven Variations
Prompt Stability in Code LLMs: Measuring Sensitivity across Emotion- and Personality-Driven Variations
Wei Ma
Y. Yang
Jingquan Ge
Xiaofei Xie
Lingxiao Jiang
84
0
0
17 Sep 2025
Post-Hoc Split-Point Self-Consistency Verification for Efficient, Unified Quantification of Aleatoric and Epistemic Uncertainty in Deep Learning
Post-Hoc Split-Point Self-Consistency Verification for Efficient, Unified Quantification of Aleatoric and Epistemic Uncertainty in Deep Learning
Zhizhong Zhao
Ke Chen
UQCV
245
0
0
16 Sep 2025
The Art of Saying "Maybe": A Conformal Lens for Uncertainty Benchmarking in VLMs
The Art of Saying "Maybe": A Conformal Lens for Uncertainty Benchmarking in VLMs
Asif Azad
Mohammad Sadat Hossain
MD Sadik Hossain Shanto
M. Saifur Rahman
Md. Rizwan Parvez
185
0
0
16 Sep 2025
Similarity-Distance-Magnitude Activations
Similarity-Distance-Magnitude Activations
Allen Schmaltz
124
1
0
16 Sep 2025
Selective Risk Certification for LLM Outputs via Information-Lift Statistics: PAC-Bayes, Robustness, and Skeleton Design
Selective Risk Certification for LLM Outputs via Information-Lift Statistics: PAC-Bayes, Robustness, and Skeleton Design
Sanjeda Akter
Ibne Farabi Shihab
Anuj Sharma
168
0
0
16 Sep 2025
Enhancing Dual Network Based Semi-Supervised Medical Image Segmentation with Uncertainty-Guided Pseudo-Labeling
Enhancing Dual Network Based Semi-Supervised Medical Image Segmentation with Uncertainty-Guided Pseudo-LabelingKnowledge-Based Systems (KBS), 2025
Yunyao Lu
Yihang Wu
Ahmad Chaddad
Tareef Daqqaq
R. Kateb
93
0
0
16 Sep 2025
Op-Fed: Opinion, Stance, and Monetary Policy Annotations on FOMC Transcripts Using Active Learning
Op-Fed: Opinion, Stance, and Monetary Policy Annotations on FOMC Transcripts Using Active Learning
Alisa Kanganis
Katherine A. Keith
97
0
0
16 Sep 2025
E2-BKI: Evidential Ellipsoidal Bayesian Kernel Inference for Uncertainty-aware Gaussian Semantic Mapping
E2-BKI: Evidential Ellipsoidal Bayesian Kernel Inference for Uncertainty-aware Gaussian Semantic Mapping
Junyoung Kim
Minsik Jeon
Jihong Min
K. Kwak
Junwon Seo
68
1
0
15 Sep 2025
Protected Probabilistic Classification Library
Protected Probabilistic Classification Library
Ivan Petej
85
0
0
14 Sep 2025
The Impact of Skin Tone Label Granularity on the Performance and Fairness of AI Based Dermatology Image Classification Models
The Impact of Skin Tone Label Granularity on the Performance and Fairness of AI Based Dermatology Image Classification Models
Partha Shah
Durva Sankhe
Maariyah Rashid
Zakaa Khaled
Esther Puyol-Antón
Tiarna Lee
Maram Alqarni
Sweta Rai
Andrew P. King
48
0
0
14 Sep 2025
No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes
No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes
Iván Vicente Moreno Cencerrado
Arnau Padrés Masdemont
Anton Gonzalvez Hawthorne
David Demitri Africa
Lorenzo Pacchiardi
ELM
119
2
0
12 Sep 2025
LoFT: Parameter-Efficient Fine-Tuning for Long-tailed Semi-Supervised Learning in Open-World Scenarios
LoFT: Parameter-Efficient Fine-Tuning for Long-tailed Semi-Supervised Learning in Open-World Scenarios
Zhiyuan Huang
Jiahao Chen
Yurou Liu
Bing Su
145
0
0
12 Sep 2025
Proximity-Based Evidence Retrieval for Uncertainty-Aware Neural Networks
Proximity-Based Evidence Retrieval for Uncertainty-Aware Neural Networks
Hassan Gharoun
M. S. Khorshidi
Kasra Ranjbarigderi
Fang Chen
Amir H. Gandomi
EDL
178
0
0
11 Sep 2025
Model-Agnostic Open-Set Air-to-Air Visual Object Detection for Reliable UAV Perception
Model-Agnostic Open-Set Air-to-Air Visual Object Detection for Reliable UAV Perception
Spyridon Loukovitis
Anastasios Arsenos
Vasileios Karampinis
Athanasios Voulodimos
72
1
0
11 Sep 2025
GrACE: A Generative Approach to Better Confidence Elicitation in Large Language Models
GrACE: A Generative Approach to Better Confidence Elicitation in Large Language Models
Zhaohan Zhang
Ziquan Liu
Ioannis Patras
132
2
0
11 Sep 2025
Composable Score-based Graph Diffusion Model for Multi-Conditional Molecular Generation
Composable Score-based Graph Diffusion Model for Multi-Conditional Molecular Generation
Anjie Qiao
Zhen Wang
Chuan Chen
Defu Lian
Tong Xu
DiffM
132
0
0
11 Sep 2025
Generalized Zero-Shot Learning for Point Cloud Segmentation with Evidence-Based Dynamic Calibration
Generalized Zero-Shot Learning for Point Cloud Segmentation with Evidence-Based Dynamic CalibrationAAAI Conference on Artificial Intelligence (AAAI), 2025
Hyeonseok Kim
Byeongkeun Kang
Yeejin Lee
3DPC
112
0
0
10 Sep 2025
Too Helpful, Too Harmless, Too Honest or Just Right?
Too Helpful, Too Harmless, Too Honest or Just Right?
Gautam Siddharth Kashyap
Mark Dras
Usman Naseem
MoE
179
1
0
10 Sep 2025
Adapting Vision-Language Models for Neutrino Event Classification in High-Energy Physics
Adapting Vision-Language Models for Neutrino Event Classification in High-Energy Physics
Dikshant Sagar
Kaiwen Yu
A. Yankelevich
J. Bian
Pierre Baldi
124
0
0
10 Sep 2025
Calibrating MLLM-as-a-judge via Multimodal Bayesian Prompt Ensembles
Calibrating MLLM-as-a-judge via Multimodal Bayesian Prompt Ensembles
Eric Slyman
Mehrab Tanjim
Kushal Kafle
Stefan Lee
113
0
0
10 Sep 2025
Instance-level Performance Prediction for Long-form Generation Tasks
Instance-level Performance Prediction for Long-form Generation Tasks
Chi-Yang Hsu
Alexander Braylan
Yiheng Su
Omar Alonso
Matthew Lease
88
0
0
09 Sep 2025
Measuring Uncertainty in Transformer Circuits with Effective Information Consistency
Measuring Uncertainty in Transformer Circuits with Effective Information Consistency
Anatoly A. Krasnovsky
59
0
0
08 Sep 2025
Statistical Methods in Generative AI
Statistical Methods in Generative AI
Edgar Dobriban
269
3
0
08 Sep 2025
Performance of Conformal Prediction in Capturing Aleatoric Uncertainty
Performance of Conformal Prediction in Capturing Aleatoric Uncertainty
Misgina Tsighe Hagos
Claes Lundström
114
0
0
06 Sep 2025
3DPillars: Pillar-based two-stage 3D object detection
3DPillars: Pillar-based two-stage 3D object detectionExpert systems with applications (ESWA), 2025
Jongyoun Noh
J. Lee
Hyekang Park
Bumsub Ham
3DPC
104
0
0
06 Sep 2025
Extracting Uncertainty Estimates from Mixtures of Experts for Semantic Segmentation
Extracting Uncertainty Estimates from Mixtures of Experts for Semantic Segmentation
Svetlana Pavlitska
Beyza Keskin
Alwin Faßbender
Christian Hubschneider
Johann Marius Zöllner
UQCVMoE
190
2
0
05 Sep 2025
What Fundamental Structure in Reward Functions Enables Efficient Sparse-Reward Learning?
What Fundamental Structure in Reward Functions Enables Efficient Sparse-Reward Learning?
Ibne Farabi Shihab
Sanjeda Akter
Anuj Sharma
OffRL
95
0
0
04 Sep 2025
Variational Gaussian Mixture Manifold Models for Client-Specific Federated Personalization
Variational Gaussian Mixture Manifold Models for Client-Specific Federated Personalization
Sai Puppala
Ismail Hossain
Md. jahangir Alam
Sajedul Talukder
FedML
64
0
0
04 Sep 2025
Revisiting (Un)Fairness in Recourse by Minimizing Worst-Case Social Burden
Revisiting (Un)Fairness in Recourse by Minimizing Worst-Case Social Burden
Ainhize Barrainkua
Giovanni De Toni
Jose A. Lozano
Novi Quadrianto
FaML
303
0
0
04 Sep 2025
StableSleep: Source-Free Test-Time Adaptation for Sleep Staging with Lightweight Safety Rails
StableSleep: Source-Free Test-Time Adaptation for Sleep Staging with Lightweight Safety Rails
Hritik Arasu
Faisal R Jahangiri
TTA
181
0
0
03 Sep 2025
The Protocol Genome A Self Supervised Learning Framework from DICOM Headers
The Protocol Genome A Self Supervised Learning Framework from DICOM Headers
Jimmy Joseph
MedIm
26
0
0
03 Sep 2025
The distribution of calibrated likelihood functions on the probability-likelihood Aitchison simplex
The distribution of calibrated likelihood functions on the probability-likelihood Aitchison simplex
Paul-Gauthier Noé
A. Nautsch
D. Matrouf
Pierre-Michel Bousquet
J. Bonastre
138
0
0
03 Sep 2025
Calibration Prediction Interval for Non-parametric Regression and Neural Networks
Calibration Prediction Interval for Non-parametric Regression and Neural Networks
Kejin Wu
D. Politis
72
0
0
02 Sep 2025
Trusted Uncertainty in Large Language Models: A Unified Framework for Confidence Calibration and Risk-Controlled Refusal
Trusted Uncertainty in Large Language Models: A Unified Framework for Confidence Calibration and Risk-Controlled Refusal
Markus Oehri
Giulia Conti
Kaviraj Pather
Alexandre Rossi
Laia Serra
Adrian Parody
Rogvi Johannesen
Aviaja Petersen
Arben Krasniqi
116
0
0
01 Sep 2025
Enhancing Uncertainty Estimation in LLMs with Expectation of Aggregated Internal Belief
Enhancing Uncertainty Estimation in LLMs with Expectation of Aggregated Internal Belief
Zeguan Xiao
Diyang Dou
Boya Xiong
Yun-Nung Chen
Guanhua Chen
65
0
0
01 Sep 2025
Multimodal Generative Flows for LHC Jets
Multimodal Generative Flows for LHC Jets
Darius A. Faroughy
Manfred Opper
C. Ojeda
128
0
0
01 Sep 2025
REVELIO -- Universal Multimodal Task Load Estimation for Cross-Domain Generalization
REVELIO -- Universal Multimodal Task Load Estimation for Cross-Domain Generalization
Maximilian P. Oppelt
Andreas Foltyn
Nadine R. Lang-Richter
Bjoern M. Eskofier
108
0
0
01 Sep 2025
Energy Landscapes Enable Reliable Abstention in Retrieval-Augmented Large Language Models for Healthcare
Energy Landscapes Enable Reliable Abstention in Retrieval-Augmented Large Language Models for Healthcare
Ravi Shankar
Sheng Wong
Lin Li
Magdalena Bachmann
Alex Silverthorne
Beth Albert
Gabriel Davis Jones
140
0
0
31 Aug 2025
Game Theoretic Resilience Recommendation Framework for CyberPhysical Microgrids Using Hypergraph MetaLearning
Game Theoretic Resilience Recommendation Framework for CyberPhysical Microgrids Using Hypergraph MetaLearning
S Krishna Niketh
P. Panigrahi
V Vignesh
M. Pal
56
0
0
30 Aug 2025
Failure Prediction Is a Better Performance Proxy for Early-Exit Networks Than Calibration
Failure Prediction Is a Better Performance Proxy for Early-Exit Networks Than Calibration
Piotr Kubaty
Filip Szatkowski
Metod Jazbec
Bartosz Wójcik
159
0
0
29 Aug 2025
Entropy-Based Non-Invasive Reliability Monitoring of Convolutional Neural Networks
Entropy-Based Non-Invasive Reliability Monitoring of Convolutional Neural Networks
Amirhossein Nazeri
Wael Hafez
AAML
76
0
0
29 Aug 2025
Can Multiple Responses from an LLM Reveal the Sources of Its Uncertainty?
Can Multiple Responses from an LLM Reveal the Sources of Its Uncertainty?
Yang Nan
Pengfei He
Ravi Tandon
Han Xu
76
0
0
28 Aug 2025
Gradient Rectification for Robust Calibration under Distribution Shift
Gradient Rectification for Robust Calibration under Distribution Shift
Yilin Zhang
Cai Xu
Y. Wu
Ziyu Guan
Wei Zhao
156
0
0
27 Aug 2025
ALSA: Anchors in Logit Space for Out-of-Distribution Accuracy Estimation
ALSA: Anchors in Logit Space for Out-of-Distribution Accuracy Estimation
Chenzhi Liu
Mahsa Baktashmotlagh
Yanran Tang
Zi Huang
Ruihong Qiu
100
0
0
27 Aug 2025
Plug-in Feedback Self-adaptive Attention in CLIP for Training-free Open-Vocabulary Segmentation
Plug-in Feedback Self-adaptive Attention in CLIP for Training-free Open-Vocabulary Segmentation
Zhixiang Chi
Yanan Wu
Li Gu
Huan Liu
Ziqiang Wang
Yang Zhang
Yang Wang
Konstantinos N. Plataniotis
VLM
92
0
0
27 Aug 2025
ATMS-KD: Adaptive Temperature and Mixed Sample Knowledge Distillation for a Lightweight Residual CNN in Agricultural Embedded Systems
ATMS-KD: Adaptive Temperature and Mixed Sample Knowledge Distillation for a Lightweight Residual CNN in Agricultural Embedded Systems
Mohamed Ohamouddou
Said Ohamouddou
A. E. Afia
Rafik Lasri
80
0
0
27 Aug 2025
The Role of Teacher Calibration in Knowledge Distillation
The Role of Teacher Calibration in Knowledge DistillationIEEE Access (IEEE Access), 2025
Suyoung Kim
Seonguk Park
Junhoo Lee
Nojun Kwak
64
0
0
27 Aug 2025
Beyond Benchmark: LLMs Evaluation with an Anthropomorphic and Value-oriented Roadmap
Beyond Benchmark: LLMs Evaluation with an Anthropomorphic and Value-oriented Roadmap
Jun Wang
Ninglun Gu
Kailai Zhang
Zijiao Zhang
Yelun Bao
...
Liwei Liu
Yihuan Liu
Pengyong Li
Gary G. Yen
Junchi Yan
ALMELM
180
0
0
26 Aug 2025
ConfTuner: Training Large Language Models to Express Their Confidence Verbally
ConfTuner: Training Large Language Models to Express Their Confidence Verbally
Yibo Li
Miao Xiong
Jiaying Wu
Bryan Hooi
144
7
0
26 Aug 2025
Previous
123456...737475
Next