ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.07892
  4. Cited By
Calibration of Pre-trained Transformers
v1v2v3 (latest)

Calibration of Pre-trained Transformers

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
17 March 2020
Shrey Desai
Greg Durrett
    UQLM
ArXiv (abs)PDFHTML

Papers citing "Calibration of Pre-trained Transformers"

50 / 243 papers shown
Mapping Clinical Doubt: Locating Linguistic Uncertainty in LLMs
Mapping Clinical Doubt: Locating Linguistic Uncertainty in LLMs
Srivarshinee Sridhar
Raghav Kaushik Ravi
Kripabandhu Ghosh
80
0
0
27 Nov 2025
Open the Oyster: Empirical Evaluation and Improvement of Code Reasoning Confidence in LLMs
Open the Oyster: Empirical Evaluation and Improvement of Code Reasoning Confidence in LLMs
Shufan Wang
Xing Hu
Junkai Chen
Zhiyuan Pan
Xin Xia
LRM
198
0
0
04 Nov 2025
HyperClick: Advancing Reliable GUI Grounding via Uncertainty Calibration
HyperClick: Advancing Reliable GUI Grounding via Uncertainty Calibration
Shaojie Zhang
Pei Fu
Ruoceng Zhang
Jiahui Yang
Anan Du
...
S. Wang
Ying Huang
Bin Qin
Zhenbo Luo
Jian Luan
180
2
0
31 Oct 2025
Efficient semantic uncertainty quantification in language models via diversity-steered sampling
Efficient semantic uncertainty quantification in language models via diversity-steered sampling
Ji Won Park
K. Cho
174
0
0
24 Oct 2025
Systematic Evaluation of Uncertainty Estimation Methods in Large Language Models
Systematic Evaluation of Uncertainty Estimation Methods in Large Language Models
Christian Hobelsberger
Theresa Winner
Andreas Nawroth
Oliver Mitevski
Anna-Carolina Haensch
ELM
166
2
0
23 Oct 2025
Annotation-Efficient Universal Honesty Alignment
Annotation-Efficient Universal Honesty Alignment
Shiyu Ni
Keping Bi
Jiafeng Guo
Minghao Tang
Jingtong Wu
Zengxin Han
Xueqi Cheng
HILM
257
1
0
20 Oct 2025
Mapping from Meaning: Addressing the Miscalibration of Prompt-Sensitive Language Models
Mapping from Meaning: Addressing the Miscalibration of Prompt-Sensitive Language ModelsAAAI Conference on Artificial Intelligence (AAAI), 2025
K. Cox
Jiawei Xu
Yikun Han
Rong Xu
Tianhao Li
Chi-Yang Hsu
Tianlong Chen
Walter Gerych
Ying Ding
176
5
0
19 Oct 2025
Lightweight Baselines for Medical Abstract Classification: DistilBERT with Cross-Entropy as a Strong Default
Lightweight Baselines for Medical Abstract Classification: DistilBERT with Cross-Entropy as a Strong Default
Jiaqi Liu
Tong Wang
Su Liu
Xin Hu
Ran Tong
Lanruo Wang
Jiexi Xu
305
4
0
11 Oct 2025
SIMBA UQ: Similarity-Based Aggregation for Uncertainty Quantification in Large Language Models
SIMBA UQ: Similarity-Based Aggregation for Uncertainty Quantification in Large Language Models
D. Bhattacharjya
Balaji Ganesan
Junkyu Lee
Radu Marinescu
Katsiaryna Mirylenka
Michael R. Glass
Xiao Shou
157
0
0
10 Oct 2025
Do LLMs Know They Are Being Tested? Evaluation Awareness and Incentive-Sensitive Failures in GPT-OSS-20B
Do LLMs Know They Are Being Tested? Evaluation Awareness and Incentive-Sensitive Failures in GPT-OSS-20B
Nisar Ahmed
Muhammad Imran Zaman
Gulshan Saleem
Ali Hassan
LRM
175
0
0
08 Oct 2025
SPECTRA: Revealing the Full Spectrum of User Preferences via Distributional LLM Inference
SPECTRA: Revealing the Full Spectrum of User Preferences via Distributional LLM Inference
Luyang Zhang
Siyuan Peng
Jialu Wang
Shichao Zhu
Beibei Li
Zhongcun Wang
Guangmou Pan
253
0
0
29 Sep 2025
Calibration Meets Reality: Making Machine Learning Predictions Trustworthy
Calibration Meets Reality: Making Machine Learning Predictions Trustworthy
Kristina P. Sinaga
Arjun S. Nair
141
2
0
28 Sep 2025
Less Precise Can Be More Reliable: A Systematic Evaluation of Quantization's Impact on CLIP Beyond Accuracy
Less Precise Can Be More Reliable: A Systematic Evaluation of Quantization's Impact on CLIP Beyond Accuracy
Aymen Bouguerra
Daniel Montoya
Alexandra Gomez-Villa
Fabio Arnez
Chokri Mraidha
UQCV
432
0
0
25 Sep 2025
Confidence Calibration in Large Language Model-Based Entity Matching
Confidence Calibration in Large Language Model-Based Entity Matching
Iris Kamsteeg
Juan Cardenas-Cartagena
Floris van Beers
Gineke ten Holt
Tsegaye Misikir Tashu
Matias Valdenegro-Toro
139
0
0
23 Sep 2025
Uncertainty Quantification of Large Language Models using Approximate Bayesian Computation
Uncertainty Quantification of Large Language Models using Approximate Bayesian Computation
Mridul Sharma
Adeetya Patel
Zaneta D' Souza
Samira Abbasgholizadeh Rahimi
Siva Reddy
Sreenath Madathil
173
0
0
19 Sep 2025
LLM on a Budget: Active Knowledge Distillation for Efficient Classification of Large Text Corpora
LLM on a Budget: Active Knowledge Distillation for Efficient Classification of Large Text Corpora
Viviana Luccioli
Rithika Iyengar
Ryan Panley
Flora Haberkorn
Xiaoyu Ge
Leland Crane
Nitish Sinha
Seung Jung Lee
48
0
0
17 Sep 2025
GrACE: A Generative Approach to Better Confidence Elicitation and Efficient Test-Time Scaling in Large Language Models
GrACE: A Generative Approach to Better Confidence Elicitation and Efficient Test-Time Scaling in Large Language Models
Zhaohan Zhang
Ziquan Liu
Ioannis Patras
ELM
201
3
0
11 Sep 2025
Rule-Based Moral Principles for Explaining Uncertainty in Natural Language Generation
Rule-Based Moral Principles for Explaining Uncertainty in Natural Language Generation
Zahra Atf
Peter Lewis
202
1
0
08 Sep 2025
BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design
BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design
Deepro Choudhury
Sinead Williamson
Adam Goliñski
Ning Miao
Freddie Bickford-Smith
Michael Kirchhof
Yizhe Zhang
Tom Rainforth
253
5
0
28 Aug 2025
Do LVLMs Know What They Know? A Systematic Study of Knowledge Boundary Perception in LVLMs
Do LVLMs Know What They Know? A Systematic Study of Knowledge Boundary Perception in LVLMs
Zhikai Ding
Shiyu Ni
Keping Bi
110
1
0
26 Aug 2025
How Good are LLM-based Rerankers? An Empirical Analysis of State-of-the-Art Reranking Models
How Good are LLM-based Rerankers? An Empirical Analysis of State-of-the-Art Reranking Models
Abdelrahman Abdallah
Bhawna Piryani
Jamshid Mozafari
Mohammed Ali
Adam Jatowt
LRM
211
5
0
22 Aug 2025
Towards Universal Neural Likelihood Inference
Towards Universal Neural Likelihood Inference
Shreyas Bhat Brahmavar
Yang Li
Junier B. Oliva
Shashank Srivastava
Junier Oliva
OOD
225
0
0
12 Aug 2025
Towards Transparent AI Grading: Semantic Entropy as a Signal for Human-AI Disagreement
Towards Transparent AI Grading: Semantic Entropy as a Signal for Human-AI Disagreement
Karrtik Iyer
M. R
Prasanna Pendse
Shayan Mohanty
135
0
0
06 Aug 2025
Shapley Uncertainty in Natural Language Generation
Shapley Uncertainty in Natural Language Generation
Meilin Zhu
Gaojie Jin
Xiaowei Huang
Lijun Zhang
203
0
0
29 Jul 2025
SCOPE: Stochastic and Counterbiased Option Placement for Evaluating Large Language Models
SCOPE: Stochastic and Counterbiased Option Placement for Evaluating Large Language Models
Wonjun Jeong
Dongseok Kim
Taegkeun Whangbo
265
2
0
24 Jul 2025
Theoretical Foundations and Mitigation of Hallucination in Large Language Models
Theoretical Foundations and Mitigation of Hallucination in Large Language Models
Esmail Gumaan
HILM
202
4
0
20 Jul 2025
LLM Probability Concentration: How Alignment Shrinks the Generative Horizon
LLM Probability Concentration: How Alignment Shrinks the Generative Horizon
Chenghao Yang
Ari Holtzman
Ari Holtzman
322
3
0
22 Jun 2025
Temporalizing Confidence: Evaluation of Chain-of-Thought Reasoning with Signal Temporal LogicWorkshop on Innovative Use of NLP for Building Educational Applications (UNBEA), 2025
Zhenjiang Mao
Artem Bisliouk
Rohith Reddy Nama
Ivan Ruchkin
ReLMLRM
209
7
0
09 Jun 2025
Trustworthy Medical Question Answering: An Evaluation-Centric Survey
Trustworthy Medical Question Answering: An Evaluation-Centric Survey
Yinuo Wang
Robert E. Mercer
Frank Rudzicz
Sudipta Singha Roy
Sudipta Singha Roy
Pengjie Ren
Zhumin Chen
Xindi Wang
ELM
289
6
0
04 Jun 2025
Verbalized Confidence Triggers Self-Verification: Emergent Behavior Without Explicit Reasoning Supervision
Verbalized Confidence Triggers Self-Verification: Emergent Behavior Without Explicit Reasoning Supervision
Chaeyun Jang
Moonseok Choi
Yegon Kim
Hyungi Lee
Juho Lee
ReLMLRM
127
0
0
04 Jun 2025
MetaFaith: Faithful Natural Language Uncertainty Expression in LLMs
MetaFaith: Faithful Natural Language Uncertainty Expression in LLMs
Gabrielle Kaili-May Liu
Gal Yona
Avi Caciularu
Idan Szpektor
Tim G. J. Rudner
Arman Cohan
375
4
0
30 May 2025
Pretrained LLMs Learn Multiple Types of Uncertainty
Pretrained LLMs Learn Multiple Types of Uncertainty
Roi Cohen
Omri Fahn
Gerard de Melo
399
1
0
27 May 2025
InFact: Informativeness Alignment for Improved LLM Factuality
InFact: Informativeness Alignment for Improved LLM Factuality
Roi Cohen
Russa Biswas
Gerard de Melo
273
1
0
26 May 2025
How Knowledge Popularity Influences and Enhances LLM Knowledge Boundary Perception
How Knowledge Popularity Influences and Enhances LLM Knowledge Boundary Perception
Shiyu Ni
Keping Bi
Jiafeng Guo
Xueqi Cheng
344
3
0
23 May 2025
Internal Causal Mechanisms Robustly Predict Language Model Out-of-Distribution Behaviors
Internal Causal Mechanisms Robustly Predict Language Model Out-of-Distribution Behaviors
Jing Huang
Junyi Tao
Thomas Icard
Diyi Yang
Christopher Potts
OODD
498
4
0
17 May 2025
Eliminating Hallucination-Induced Errors in LLM Code Generation with Functional Clustering
Eliminating Hallucination-Induced Errors in LLM Code Generation with Functional Clustering
Chaitanya Ravuri
Saman Amarasinghe
170
4
0
16 May 2025
Always Tell Me The Odds: Fine-grained Conditional Probability Estimation
Always Tell Me The Odds: Fine-grained Conditional Probability Estimation
Liaoyaqi Wang
Zhengping Jiang
Anqi Liu
Benjamin Van Durme
492
4
0
02 May 2025
Comparing Uncertainty Measurement and Mitigation Methods for Large Language Models: A Systematic Review
Comparing Uncertainty Measurement and Mitigation Methods for Large Language Models: A Systematic Review
Toghrul Abbasli
Kentaroh Toyoda
Yuan Wang
Leon Witt
Muhammad Asif Ali
Yukai Miao
Dan Li
Qingsong Wei
UQCVHILM
687
2
0
25 Apr 2025
Feeding LLM Annotations to BERT Classifiers at Your Own Risk
Feeding LLM Annotations to BERT Classifiers at Your Own Risk
Yucheng Lu
Kazimier Smith
356
1
0
21 Apr 2025
Enhancing Mathematical Reasoning in Large Language Models with Self-Consistency-Based Hallucination Detection
Enhancing Mathematical Reasoning in Large Language Models with Self-Consistency-Based Hallucination Detection
MingShan Liu
Shi Bo
LRM
466
11
0
13 Apr 2025
ML For Hardware Design Interpretability: Challenges and Opportunities
ML For Hardware Design Interpretability: Challenges and Opportunities
Raymond Baartmans
Andrew Ensinger
Victor Agostinelli
Lizhong Chen
233
2
0
11 Apr 2025
Confidence Regularized Masked Language Modeling using Text Length
Confidence Regularized Masked Language Modeling using Text Length
Seunghyun Ji
Soowon Lee
433
0
0
08 Apr 2025
Token-Level Uncertainty-Aware Objective for Language Model Post-Training
Token-Level Uncertainty-Aware Objective for Language Model Post-Training
Tingkai Liu
Ari S. Benjamin
Anthony M. Zador
289
1
0
15 Mar 2025
Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception
Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary PerceptionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Shiyu Ni
Keping Bi
Jiafeng Guo
Lulu Yu
Baolong Bi
Xueqi Cheng
322
19
0
17 Feb 2025
Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches
Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches
D. Elbaz
Oren Salzman
OffRL
366
0
0
13 Feb 2025
The Capabilities and Limitations of Weak-to-Strong Generalization: Generalization and Calibration
The Capabilities and Limitations of Weak-to-Strong Generalization: Generalization and Calibration
Wei Yao
Wenkai Yang
Liang Luo
Yankai Lin
Yong Liu
Yong Liu
ELM
1.0K
3
0
03 Feb 2025
A statistically consistent measure of semantic uncertainty using Language Models
A statistically consistent measure of semantic uncertainty using Language Models
Yi Liu
416
0
0
01 Feb 2025
Technical report on label-informed logit redistribution for better domain generalization in low-shot classification with foundation models
Technical report on label-informed logit redistribution for better domain generalization in low-shot classification with foundation models
Behraj Khan
T. Syed
1.2K
2
0
29 Jan 2025
Reliable Text-to-SQL with Adaptive Abstention
Reliable Text-to-SQL with Adaptive Abstention
Kaiwen Chen
Yueting Chen
Xiaohui Yu
Nick Koudas
RALM
371
16
0
18 Jan 2025
I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token
I Don't Know: Explicit Modeling of Uncertainty with an [IDK] TokenNeural Information Processing Systems (NeurIPS), 2024
Roi Cohen
Konstantin Dobler
Eden Biran
Gerard de Melo
558
22
0
09 Dec 2024
12345
Next
Page 1 of 5