Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2003.07892
Cited By

Calibration of Pre-trained Transformers

v1v2v3 (latest)

Calibration of Pre-trained Transformers

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020

17 March 2020

ArXiv (abs)PDF HTML

Papers citing "Calibration of Pre-trained Transformers"

50 / 243 papers shown

Mapping Clinical Doubt: Locating Linguistic Uncertainty in LLMs

Mapping Clinical Doubt: Locating Linguistic Uncertainty in LLMs

Srivarshinee Sridhar

Raghav Kaushik Ravi

Kripabandhu Ghosh

80

0

0

27 Nov 2025

Open the Oyster: Empirical Evaluation and Improvement of Code Reasoning Confidence in LLMs

Open the Oyster: Empirical Evaluation and Improvement of Code Reasoning Confidence in LLMs

198

0

0

04 Nov 2025

HyperClick: Advancing Reliable GUI Grounding via Uncertainty Calibration

HyperClick: Advancing Reliable GUI Grounding via Uncertainty Calibration

...

180

2

0

31 Oct 2025

Efficient semantic uncertainty quantification in language models via diversity-steered sampling

Efficient semantic uncertainty quantification in language models via diversity-steered sampling

174

0

0

24 Oct 2025

Systematic Evaluation of Uncertainty Estimation Methods in Large Language Models

Systematic Evaluation of Uncertainty Estimation Methods in Large Language Models

Christian Hobelsberger

Andreas Nawroth

Oliver Mitevski

Anna-Carolina Haensch

166

2

0

23 Oct 2025

Annotation-Efficient Universal Honesty Alignment

Annotation-Efficient Universal Honesty Alignment

257

1

0

20 Oct 2025

Mapping from Meaning: Addressing the Miscalibration of Prompt-Sensitive Language Models

Mapping from Meaning: Addressing the Miscalibration of Prompt-Sensitive Language ModelsAAAI Conference on Artificial Intelligence (AAAI), 2025

176

5

0

19 Oct 2025

Lightweight Baselines for Medical Abstract Classification: DistilBERT with Cross-Entropy as a Strong Default

Lightweight Baselines for Medical Abstract Classification: DistilBERT with Cross-Entropy as a Strong Default

305

4

0

11 Oct 2025

SIMBA UQ: Similarity-Based Aggregation for Uncertainty Quantification in Large Language Models

SIMBA UQ: Similarity-Based Aggregation for Uncertainty Quantification in Large Language Models

D. Bhattacharjya

Katsiaryna Mirylenka

Michael R. Glass

157

0

0

10 Oct 2025

Do LLMs Know They Are Being Tested? Evaluation Awareness and Incentive-Sensitive Failures in GPT-OSS-20B

Do LLMs Know They Are Being Tested? Evaluation Awareness and Incentive-Sensitive Failures in GPT-OSS-20B

Muhammad Imran Zaman

175

0

0

08 Oct 2025

SPECTRA: Revealing the Full Spectrum of User Preferences via Distributional LLM Inference

SPECTRA: Revealing the Full Spectrum of User Preferences via Distributional LLM Inference

253

0

0

29 Sep 2025

Calibration Meets Reality: Making Machine Learning Predictions Trustworthy

Calibration Meets Reality: Making Machine Learning Predictions Trustworthy

Kristina P. Sinaga

141

2

0

28 Sep 2025

Less Precise Can Be More Reliable: A Systematic Evaluation of Quantization's Impact on CLIP Beyond Accuracy

Less Precise Can Be More Reliable: A Systematic Evaluation of Quantization's Impact on CLIP Beyond Accuracy

Aymen Bouguerra

Alexandra Gomez-Villa

432

0

0

25 Sep 2025

Confidence Calibration in Large Language Model-Based Entity Matching

Confidence Calibration in Large Language Model-Based Entity Matching

Juan Cardenas-Cartagena

Floris van Beers

Gineke ten Holt

Tsegaye Misikir Tashu

Matias Valdenegro-Toro

139

0

0

23 Sep 2025

Uncertainty Quantification of Large Language Models using Approximate Bayesian Computation

Uncertainty Quantification of Large Language Models using Approximate Bayesian Computation

Zaneta D' Souza

Samira Abbasgholizadeh Rahimi

Sreenath Madathil

173

0

0

19 Sep 2025

LLM on a Budget: Active Knowledge Distillation for Efficient Classification of Large Text Corpora

LLM on a Budget: Active Knowledge Distillation for Efficient Classification of Large Text Corpora

Viviana Luccioli

Rithika Iyengar

Flora Haberkorn

48

0

0

17 Sep 2025

GrACE: A Generative Approach to Better Confidence Elicitation and Efficient Test-Time Scaling in Large Language Models

GrACE: A Generative Approach to Better Confidence Elicitation and Efficient Test-Time Scaling in Large Language Models

201

3

0

11 Sep 2025

Rule-Based Moral Principles for Explaining Uncertainty in Natural Language Generation

Rule-Based Moral Principles for Explaining Uncertainty in Natural Language Generation

202

1

0

08 Sep 2025

BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design

BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design

Deepro Choudhury

Sinead Williamson

Freddie Bickford-Smith

Michael Kirchhof

253

5

0

28 Aug 2025

Do LVLMs Know What They Know? A Systematic Study of Knowledge Boundary Perception in LVLMs

Do LVLMs Know What They Know? A Systematic Study of Knowledge Boundary Perception in LVLMs

110

1

0

26 Aug 2025

How Good are LLM-based Rerankers? An Empirical Analysis of State-of-the-Art Reranking Models

How Good are LLM-based Rerankers? An Empirical Analysis of State-of-the-Art Reranking Models

Abdelrahman Abdallah

Jamshid Mozafari

211

5

0

22 Aug 2025

Towards Universal Neural Likelihood Inference

Towards Universal Neural Likelihood Inference

Shreyas Bhat Brahmavar

Junier B. Oliva

Shashank Srivastava

Junier Oliva

225

0

0

12 Aug 2025

Towards Transparent AI Grading: Semantic Entropy as a Signal for Human-AI Disagreement

Towards Transparent AI Grading: Semantic Entropy as a Signal for Human-AI Disagreement

Prasanna Pendse

135

0

0

06 Aug 2025

Shapley Uncertainty in Natural Language Generation

Shapley Uncertainty in Natural Language Generation

203

0

0

29 Jul 2025

SCOPE: Stochastic and Counterbiased Option Placement for Evaluating Large Language Models

SCOPE: Stochastic and Counterbiased Option Placement for Evaluating Large Language Models

Taegkeun Whangbo

265

2

0

24 Jul 2025

Theoretical Foundations and Mitigation of Hallucination in Large Language Models

Theoretical Foundations and Mitigation of Hallucination in Large Language Models

202

4

0

20 Jul 2025

LLM Probability Concentration: How Alignment Shrinks the Generative Horizon

LLM Probability Concentration: How Alignment Shrinks the Generative Horizon

Ari Holtzman

322

3

0

22 Jun 2025

Temporalizing Confidence: Evaluation of Chain-of-Thought Reasoning with Signal Temporal LogicWorkshop on Innovative Use of NLP for Building Educational Applications (UNBEA), 2025

Rohith Reddy Nama

209

7

0

09 Jun 2025

Trustworthy Medical Question Answering: An Evaluation-Centric Survey

Trustworthy Medical Question Answering: An Evaluation-Centric Survey

Robert E. Mercer

Sudipta Singha Roy

Sudipta Singha Roy

289

6

0

04 Jun 2025

Verbalized Confidence Triggers Self-Verification: Emergent Behavior Without Explicit Reasoning Supervision

Verbalized Confidence Triggers Self-Verification: Emergent Behavior Without Explicit Reasoning Supervision

127

0

0

04 Jun 2025

MetaFaith: Faithful Natural Language Uncertainty Expression in LLMs

MetaFaith: Faithful Natural Language Uncertainty Expression in LLMs

Gabrielle Kaili-May Liu

Tim G. J. Rudner

375

4

0

30 May 2025

Pretrained LLMs Learn Multiple Types of Uncertainty

Pretrained LLMs Learn Multiple Types of Uncertainty

399

1

0

27 May 2025

InFact: Informativeness Alignment for Improved LLM Factuality

InFact: Informativeness Alignment for Improved LLM Factuality

273

1

0

26 May 2025

How Knowledge Popularity Influences and Enhances LLM Knowledge Boundary Perception

How Knowledge Popularity Influences and Enhances LLM Knowledge Boundary Perception

344

3

0

23 May 2025

Internal Causal Mechanisms Robustly Predict Language Model Out-of-Distribution Behaviors

Internal Causal Mechanisms Robustly Predict Language Model Out-of-Distribution Behaviors

Christopher Potts

498

4

0

17 May 2025

Eliminating Hallucination-Induced Errors in LLM Code Generation with Functional Clustering

Eliminating Hallucination-Induced Errors in LLM Code Generation with Functional Clustering

Chaitanya Ravuri

Saman Amarasinghe

170

4

0

16 May 2025

Always Tell Me The Odds: Fine-grained Conditional Probability Estimation

Always Tell Me The Odds: Fine-grained Conditional Probability Estimation

Zhengping Jiang

Benjamin Van Durme

492

4

0

02 May 2025

Comparing Uncertainty Measurement and Mitigation Methods for Large Language Models: A Systematic Review

Comparing Uncertainty Measurement and Mitigation Methods for Large Language Models: A Systematic Review

Toghrul Abbasli

Kentaroh Toyoda

Muhammad Asif Ali

687

2

0

25 Apr 2025

Feeding LLM Annotations to BERT Classifiers at Your Own Risk

Feeding LLM Annotations to BERT Classifiers at Your Own Risk

356

1

0

21 Apr 2025

Enhancing Mathematical Reasoning in Large Language Models with Self-Consistency-Based Hallucination Detection

Enhancing Mathematical Reasoning in Large Language Models with Self-Consistency-Based Hallucination Detection

466

11

0

13 Apr 2025

ML For Hardware Design Interpretability: Challenges and Opportunities

ML For Hardware Design Interpretability: Challenges and Opportunities

Raymond Baartmans

Andrew Ensinger

Victor Agostinelli

233

2

0

11 Apr 2025

Confidence Regularized Masked Language Modeling using Text Length

Confidence Regularized Masked Language Modeling using Text Length

433

0

0

08 Apr 2025

Token-Level Uncertainty-Aware Objective for Language Model Post-Training

Token-Level Uncertainty-Aware Objective for Language Model Post-Training

Ari S. Benjamin

Anthony M. Zador

289

1

0

15 Mar 2025

Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception

Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary PerceptionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

322

19

0

17 Feb 2025

Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches

Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches

366

0

0

13 Feb 2025

The Capabilities and Limitations of Weak-to-Strong Generalization: Generalization and Calibration

The Capabilities and Limitations of Weak-to-Strong Generalization: Generalization and Calibration

1.0K

3

0

03 Feb 2025

A statistically consistent measure of semantic uncertainty using Language Models

A statistically consistent measure of semantic uncertainty using Language Models

416

0

0

01 Feb 2025

Technical report on label-informed logit redistribution for better domain generalization in low-shot classification with foundation models

Technical report on label-informed logit redistribution for better domain generalization in low-shot classification with foundation models

1.2K

2

0

29 Jan 2025

Reliable Text-to-SQL with Adaptive Abstention

Reliable Text-to-SQL with Adaptive Abstention

371

16

0

18 Jan 2025

I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token

I Don't Know: Explicit Modeling of Uncertainty with an [IDK] TokenNeural Information Processing Systems (NeurIPS), 2024

Konstantin Dobler

558

22

0

09 Dec 2024

Page 1 of 5