ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.13677
  4. Cited By
Revisiting Uncertainty Quantification Evaluation in Language Models: Spurious Interactions with Response Length Bias Results
v1v2 (latest)

Revisiting Uncertainty Quantification Evaluation in Language Models: Spurious Interactions with Response Length Bias Results

Annual Meeting of the Association for Computational Linguistics (ACL), 2025
18 April 2025
Andrea Santilli
Adam Goliñski
Michael Kirchhof
Federico Danieli
Arno Blaas
Miao Xiong
Luca Zappella
Sinead Williamson
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Revisiting Uncertainty Quantification Evaluation in Language Models: Spurious Interactions with Response Length Bias Results"

6 / 6 papers shown
Addressing Pitfalls in the Evaluation of Uncertainty Estimation Methods for Natural Language Generation
Addressing Pitfalls in the Evaluation of Uncertainty Estimation Methods for Natural Language Generation
Mykyta Ielanskyi
Kajetan Schweighofer
L. Aichberger
Sepp Hochreiter
HILM
217
0
0
02 Oct 2025
When Judgment Becomes Noise: How Design Failures in LLM Judge Benchmarks Silently Undermine Validity
When Judgment Becomes Noise: How Design Failures in LLM Judge Benchmarks Silently Undermine Validity
Benjamin Feuer
Chiung-Yi Tseng
Astitwa Sarthak Lathe
Oussama Elachqar
John P. Dickerson
ELM
249
1
0
24 Sep 2025
The Geometries of Truth Are Orthogonal Across Tasks
Waiss Azizian
Michael Kirchhof
Eugène Ndiaye
Louis Béthune
Stephen Zhang
Pierre Ablin
Marco Cuturi
233
0
0
10 Jun 2025
Is Your Model Fairly Certain? Uncertainty-Aware Fairness Evaluation for LLMs
Is Your Model Fairly Certain? Uncertainty-Aware Fairness Evaluation for LLMs
Yinong Oliver Wang
N. Sivakumar
Falaah Arif Khan
Rin Metcalf Susa
Adam Goliñski
Natalie Mackraz
B. Theobald
Luca Zappella
N. Apostoloff
290
1
0
29 May 2025
SelfReflect: Can LLMs Communicate Their Internal Answer Distribution?
SelfReflect: Can LLMs Communicate Their Internal Answer Distribution?
Michael Kirchhof
Luca Füger
Adam Goliñski
Eeshan Gunesh Dhekane
Arno Blaas
Seong Joon Oh
Sinead Williamson
UQLMELM
584
3
2
26 May 2025
UNCERTAINTY-LINE: Length-Invariant Estimation of Uncertainty for Large Language Models
UNCERTAINTY-LINE: Length-Invariant Estimation of Uncertainty for Large Language Models
Roman Vashurin
Maiya Goloburda
Preslav Nakov
Maxim Panov
236
0
0
25 May 2025
1