v1v2 (latest)

AmbigQA: Answering Ambiguous Open-domain Questions

22 April 2020

Luke Zettlemoyer

Papers citing "AmbigQA: Answering Ambiguous Open-domain Questions"

50 / 275 papers shown

When Robots Should Say "I Don't Know": Benchmarking Abstention in Embodied Question Answering

378

04 Dec 2025

Learning Steerable Clarification Policies with Collaborative Self-play

248

03 Dec 2025

Fantastic Bugs and Where to Find Them in AI Benchmarks

...

167

20 Nov 2025

Reasoning about Intent for Ambiguous Requests

Irina Saparina

Mirella Lapata

AI4CE

207

13 Nov 2025

The Illusion of Certainty: Uncertainty Quantification for LLMs Fails under Ambiguity

206

06 Nov 2025

KGFR: A Foundation Retriever for Generalized Knowledge Graph Question Answering

321

06 Nov 2025

Beyond Single Embeddings: Capturing Diverse Targets with Multi-Query Retrieval

153

04 Nov 2025

DEEPAMBIGQA: Ambiguous Multi-hop Questions for Benchmarking LLM Answer Completeness

159

03 Nov 2025

ChessQA: Evaluating Large Language Models for Chess Understanding

237

28 Oct 2025

Efficient semantic uncertainty quantification in language models via diversity-steered sampling

Ji Won Park

K. Cho

176

24 Oct 2025

A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications

635

19 Oct 2025

ESI: Epistemic Uncertainty Quantification via Semantic-preserving Intervention for Large Language Models

211

15 Oct 2025

Teaching Language Models to Faithfully Express their Uncertainty

216

14 Oct 2025

Generation Space Size: Understanding and Calibrating Open-Endedness of LLM Generations

258

14 Oct 2025

VeriCite: Towards Reliable Citations in Retrieval-Augmented Generation via Rigorous Verification

164

13 Oct 2025

RefusalBench: Generative Evaluation of Selective Refusal in Grounded Language Models

Aashiq Muhamed

Leonardo F. R. Ribeiro

Markus Dreyer

Virginia Smith

Mona Diab

155

12 Oct 2025

Trace Length is a Simple Uncertainty Signal in Reasoning Models

201

12 Oct 2025

ConDABench: Interactive Evaluation of Language Models for Data Analysis

242

10 Oct 2025

^2

Search: Ambiguity-Aware Question Answering with Reinforcement Learning

150

09 Oct 2025

QGraphLIME - Explaining Quantum Graph Neural Networks

293

07 Oct 2025

BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation for Large Language Models via Lens of Dynamic Interactions

...

Yannis Papakonstantinou

Reynold Cheng

LMTD VLM

333

06 Oct 2025

Detecting Corpus-Level Knowledge Inconsistencies in Wikipedia with Large Language Models

Sina J. Semnani

Jirayu Burapacheep

Arpandeep Khatua

Thanawan Atchariyachanvanit

Zheng Wang

M. Lam

KELM

176

27 Sep 2025

MARCH: Evaluating the Intersection of Ambiguity Interpretation and Multi-hop Inference

201

26 Sep 2025

Fine-Grained Uncertainty Decomposition in Large Language Models: A Spectral ApproachIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2025

595

26 Sep 2025

Unsupervised Conformal Inference: Bootstrapping and Alignment to Control LLM Uncertainty

150

26 Sep 2025

It Depends: Resolving Referential Ambiguity in Minimal Contexts with Commonsense Knowledge

Lukas Ellinger

Georg Groh

154

19 Sep 2025

Relevance to Utility: Process-Supervised Rewrite for RAG

261

19 Sep 2025

Sparse Neurons Carry Strong Signals of Question Ambiguity in LLMs

147

17 Sep 2025

Can Multiple Responses from an LLM Reveal the Sources of Its Uncertainty?

145

28 Aug 2025

Identifying and Answering Questions with False Assumptions: An Interpretable Approach

Zijie Wang

Eduardo Blanco

HILM

264

21 Aug 2025

Consensus or Conflict? Fine-Grained Evaluation of Conflicting Answers in Question-Answering

183

17 Aug 2025

Beyond Solving Math Quiz: Evaluating the Ability of Large Reasoning Models to Ask for Information

258

15 Aug 2025

TRAIL: Joint Inference and Refinement of Knowledge Graphs with Large Language Models

145

06 Aug 2025

MAO-ARAG: Multi-Agent Orchestration for Adaptive Retrieval-Augmented Generation

183

01 Aug 2025

Which LLMs Get the Joke? Probing Non-STEM Reasoning Abilities with HumorBench

Reuben Narad

Siddharth Suresh

Jiayi Chen

Pine S.L. Dysart-Bricken

225

29 Jul 2025

PRGB Benchmark: A Robust Placeholder-Assisted Algorithm for Benchmarking Retrieval-Augmented Generation

146

23 Jul 2025

Awakening LLMs' Reasoning Potential: A Fine-Grained Pipeline to Evaluate and Mitigate Vague Perception

522

22 Jul 2025

Teaching Vision-Language Models to Ask: Resolving Ambiguity in Visual QuestionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

240

18 Jul 2025

Read the Docs Before Rewriting: Equip Rewriter with Domain Knowledge via Continual Pre-training

259

01 Jul 2025

Conversational LLMs Simplify Secure Clinical Data Access, Understanding, and Analysis

325

27 Jun 2025

MUCAR: Benchmarking Multilingual Cross-Modal Ambiguity Resolution for Multimodal Large Language Models

...

316

20 Jun 2025

The Role of Model Confidence on Bias Effects in Measured Uncertainties for Vision-Language Models

Xinyi Liu

Weiguang Wang

Hangfeng He

308

20 Jun 2025

Physics vs Distributions: Pareto Optimal Flow Matching with Physics Constraints

366

10 Jun 2025

DRAGged into Conflicts: Detecting and Addressing Conflicting Sources in Search-Augmented LLMs

291

10 Jun 2025

From Calibration to Collaboration: LLM Uncertainty Quantification Should Be More Human-Centered

241

09 Jun 2025

ChemAU: Harness the Reasoning of LLMs in Chemical Research with Adaptive Uncertainty Estimation

253

01 Jun 2025

Do not Abstain! Identify and Solve the UncertaintyAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

356

01 Jun 2025

Trick or Neat: Adversarial Ambiguity and Language Model EvaluationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

196

01 Jun 2025

Inter-Passage Verification for Multi-evidence Multi-answer QAAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

216

31 May 2025

Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents

381

28 May 2025