v1v2 (latest)

Revisiting Uncertainty Quantification Evaluation in Language Models: Spurious Interactions with Response Length Bias Results

Annual Meeting of the Association for Computational Linguistics (ACL), 2025

18 April 2025

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "Revisiting Uncertainty Quantification Evaluation in Language Models: Spurious Interactions with Response Length Bias Results"

6 / 6 papers shown

Addressing Pitfalls in the Evaluation of Uncertainty Estimation Methods for Natural Language Generation

217

02 Oct 2025

When Judgment Becomes Noise: How Design Failures in LLM Judge Benchmarks Silently Undermine Validity

Benjamin Feuer

Chiung-Yi Tseng

Astitwa Sarthak Lathe

Oussama Elachqar

John P. Dickerson

ELM

249

24 Sep 2025

The Geometries of Truth Are Orthogonal Across Tasks

233

10 Jun 2025

Is Your Model Fairly Certain? Uncertainty-Aware Fairness Evaluation for LLMs

290

29 May 2025

SelfReflect: Can LLMs Communicate Their Internal Answer Distribution?

Michael Kirchhof

Luca Füger

Adam Goliñski

Eeshan Gunesh Dhekane

584

26 May 2025

UNCERTAINTY-LINE: Length-Invariant Estimation of Uncertainty for Large Language Models

236

25 May 2025