ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.10645
  4. Cited By
AmbigQA: Answering Ambiguous Open-domain Questions
v1v2 (latest)

AmbigQA: Answering Ambiguous Open-domain Questions

22 April 2020
Sewon Min
Julian Michael
Hannaneh Hajishirzi
Luke Zettlemoyer
ArXiv (abs)PDFHTML

Papers citing "AmbigQA: Answering Ambiguous Open-domain Questions"

50 / 275 papers shown
When Robots Should Say "I Don't Know": Benchmarking Abstention in Embodied Question Answering
When Robots Should Say "I Don't Know": Benchmarking Abstention in Embodied Question Answering
Tao Wu
Chuhao Zhou
Guangyu Zhao
Haozhi Cao
Yewen Pu
J. Yang
378
0
0
04 Dec 2025
Learning Steerable Clarification Policies with Collaborative Self-play
Learning Steerable Clarification Policies with Collaborative Self-play
Jonathan Berant
Maximillian Chen
Adam Fisch
Reza Aghajani
Fantine Huot
Mirella Lapata
Jacob Eisenstein
248
3
0
03 Dec 2025
Fantastic Bugs and Where to Find Them in AI Benchmarks
Fantastic Bugs and Where to Find Them in AI Benchmarks
Sang Truong
Yuheng Tu
Michael Hardy
Anka Reuel
Zeyu Tang
...
Jonathan Perera
Chibuike Uwakwe
Ben Domingue
Nick Haber
Sanmi Koyejo
167
6
0
20 Nov 2025
Reasoning about Intent for Ambiguous Requests
Reasoning about Intent for Ambiguous Requests
Irina Saparina
Mirella Lapata
AI4CE
207
1
0
13 Nov 2025
The Illusion of Certainty: Uncertainty Quantification for LLMs Fails under Ambiguity
The Illusion of Certainty: Uncertainty Quantification for LLMs Fails under Ambiguity
Tim Tomov
Dominik Fuchsgruber
Tom Wollschlager
Stephan Günnemann
206
10
0
06 Nov 2025
KGFR: A Foundation Retriever for Generalized Knowledge Graph Question Answering
KGFR: A Foundation Retriever for Generalized Knowledge Graph Question Answering
Yuanning Cui
Zequn Sun
Wei Hu
Zhangjie Fu
RALM
321
0
0
06 Nov 2025
Beyond Single Embeddings: Capturing Diverse Targets with Multi-Query Retrieval
Beyond Single Embeddings: Capturing Diverse Targets with Multi-Query Retrieval
Hung-Ting Chen
Xiang Liu
Shauli Ravfogel
Eunsol Choi
153
3
0
04 Nov 2025
DEEPAMBIGQA: Ambiguous Multi-hop Questions for Benchmarking LLM Answer Completeness
DEEPAMBIGQA: Ambiguous Multi-hop Questions for Benchmarking LLM Answer Completeness
Jiabao Ji
Min Li
Priyanshu Kumar
Shiyu Chang
Saloni Potdar
159
3
0
03 Nov 2025
ChessQA: Evaluating Large Language Models for Chess Understanding
ChessQA: Evaluating Large Language Models for Chess Understanding
Qianfeng Wen
Zhenwei Tang
Ashton Anderson
ELMLRM
237
2
0
28 Oct 2025
Efficient semantic uncertainty quantification in language models via diversity-steered sampling
Efficient semantic uncertainty quantification in language models via diversity-steered sampling
Ji Won Park
K. Cho
176
0
0
24 Oct 2025
A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications
A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications
Minhua Lin
Zongyu Wu
Zhichao Xu
Hui Liu
Xianfeng Tang
Qi He
Charu C. Aggarwal
Hui Liu
Xiang Zhang
Suhang Wang
AI4TSLRM
635
9
0
19 Oct 2025
ESI: Epistemic Uncertainty Quantification via Semantic-preserving Intervention for Large Language Models
ESI: Epistemic Uncertainty Quantification via Semantic-preserving Intervention for Large Language Models
Mingda Li
Xinyu Li
Weinan Zhang
Longxuan Ma
211
0
0
15 Oct 2025
Teaching Language Models to Faithfully Express their Uncertainty
Teaching Language Models to Faithfully Express their Uncertainty
Bryan Eikema
Evgenia Ilia
José G. C. de Souza
Chrysoula Zerva
Wilker Aziz
HILM
216
1
0
14 Oct 2025
Generation Space Size: Understanding and Calibrating Open-Endedness of LLM Generations
Generation Space Size: Understanding and Calibrating Open-Endedness of LLM Generations
Sunny Yu
Ahmad Jabbar
Robert Hawkins
Dan Jurafsky
Myra Cheng
258
1
0
14 Oct 2025
VeriCite: Towards Reliable Citations in Retrieval-Augmented Generation via Rigorous Verification
VeriCite: Towards Reliable Citations in Retrieval-Augmented Generation via Rigorous Verification
Haosheng Qian
Yixing Fan
Jiafeng Guo
Ruqing Zhang
Qi Chen
Dawei Yin
Xueqi Cheng
RALM
164
4
0
13 Oct 2025
RefusalBench: Generative Evaluation of Selective Refusal in Grounded Language Models
RefusalBench: Generative Evaluation of Selective Refusal in Grounded Language Models
Aashiq Muhamed
Leonardo F. R. Ribeiro
Markus Dreyer
Virginia Smith
Mona Diab
155
2
0
12 Oct 2025
Trace Length is a Simple Uncertainty Signal in Reasoning Models
Trace Length is a Simple Uncertainty Signal in Reasoning Models
Siddartha Devic
Charlotte Peale
Arwen Bradley
Sinead Williamson
Preetum Nakkiran
Aravind Gollakota
LRM
201
7
0
12 Oct 2025
ConDABench: Interactive Evaluation of Language Models for Data Analysis
ConDABench: Interactive Evaluation of Language Models for Data Analysis
Avik Dutta
Priyanshu Gupta
Hosein Hasanbeig
Rahul Pratap Singh
Harshit Nigam
Sumit Gulwani
Arjun Radhakrishna
Gustavo Soares
A. Tiwari
LMTD
242
1
0
10 Oct 2025
A$^2$Search: Ambiguity-Aware Question Answering with Reinforcement Learning
A2^22Search: Ambiguity-Aware Question Answering with Reinforcement Learning
Fengji Zhang
Xinyao Niu
Chengyang Ying
Guancheng Lin
Zhongkai Hao
Zhou Fan
Chengen Huang
J. Keung
B. Chen
Junyang Lin
150
0
0
09 Oct 2025
QGraphLIME - Explaining Quantum Graph Neural Networks
QGraphLIME - Explaining Quantum Graph Neural Networks
Haribandhu Jena
Jyotirmaya Shivottam
Subhankar Mishra
FAtt
293
5
0
07 Oct 2025
BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation for Large Language Models via Lens of Dynamic Interactions
BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation for Large Language Models via Lens of Dynamic Interactions
Nan Huo
Xiaohan Xu
Jinyang Li
Per Jacobsson
Shipei Lin
...
Hongyu Liu
Chenhao Ma
Fatma Ozcan
Yannis Papakonstantinou
Reynold Cheng
LMTDVLM
333
7
0
06 Oct 2025
Detecting Corpus-Level Knowledge Inconsistencies in Wikipedia with Large Language Models
Detecting Corpus-Level Knowledge Inconsistencies in Wikipedia with Large Language Models
Sina J. Semnani
Jirayu Burapacheep
Arpandeep Khatua
Thanawan Atchariyachanvanit
Zheng Wang
M. Lam
KELM
176
3
0
27 Sep 2025
MARCH: Evaluating the Intersection of Ambiguity Interpretation and Multi-hop Inference
MARCH: Evaluating the Intersection of Ambiguity Interpretation and Multi-hop Inference
Jeonghyun Park
Ingeol Baek
Seunghyun Yoon
Haeun Jang
Aparna Garimella
Akriti Jain
Nedim Lipka
Hwanhee Lee
LRM
201
0
0
26 Sep 2025
Fine-Grained Uncertainty Decomposition in Large Language Models: A Spectral Approach
Fine-Grained Uncertainty Decomposition in Large Language Models: A Spectral ApproachIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2025
Nassim Walha
Sebastian G. Gruber
Thomas Decker
Yinchong Yang
Alireza Javanmardi
Eyke Hüllermeier
Florian Buettner
UQCVUDPER
595
2
0
26 Sep 2025
Unsupervised Conformal Inference: Bootstrapping and Alignment to Control LLM Uncertainty
Unsupervised Conformal Inference: Bootstrapping and Alignment to Control LLM Uncertainty
Lingyou Pang
Daigang Xu
Jianyu Lin
Tianyu Wang
Akira Horiguchi
Alexander Aue
Carey E. Priebe
150
1
0
26 Sep 2025
It Depends: Resolving Referential Ambiguity in Minimal Contexts with Commonsense Knowledge
It Depends: Resolving Referential Ambiguity in Minimal Contexts with Commonsense Knowledge
Lukas Ellinger
Georg Groh
154
1
0
19 Sep 2025
Relevance to Utility: Process-Supervised Rewrite for RAG
Relevance to Utility: Process-Supervised Rewrite for RAG
Jaeyoung Kim
Jongho Kim
Seung-won Hwang
Seoho Song
Young-In Song
261
0
0
19 Sep 2025
Sparse Neurons Carry Strong Signals of Question Ambiguity in LLMs
Sparse Neurons Carry Strong Signals of Question Ambiguity in LLMs
Zhuoxuan Zhang
Jinhao Duan
Edward Kim
Kaidi Xu
147
1
0
17 Sep 2025
Can Multiple Responses from an LLM Reveal the Sources of Its Uncertainty?
Can Multiple Responses from an LLM Reveal the Sources of Its Uncertainty?
Yang Nan
Pengfei He
Ravi Tandon
Han Xu
145
3
0
28 Aug 2025
Identifying and Answering Questions with False Assumptions: An Interpretable Approach
Identifying and Answering Questions with False Assumptions: An Interpretable Approach
Zijie Wang
Eduardo Blanco
HILM
264
0
0
21 Aug 2025
Consensus or Conflict? Fine-Grained Evaluation of Conflicting Answers in Question-Answering
Consensus or Conflict? Fine-Grained Evaluation of Conflicting Answers in Question-Answering
Eviatar Nachshoni
Arie Cattan
Shmuel Amar
Ori Shapira
Ido Dagan
AAML
183
1
0
17 Aug 2025
Beyond Solving Math Quiz: Evaluating the Ability of Large Reasoning Models to Ask for Information
Beyond Solving Math Quiz: Evaluating the Ability of Large Reasoning Models to Ask for Information
Youcheng Huang
Bowen Qin
Chen Huang
Duanyu Feng
Xi Yang
Wenqiang Lei
ReLMELMLRM
258
0
0
15 Aug 2025
TRAIL: Joint Inference and Refinement of Knowledge Graphs with Large Language Models
TRAIL: Joint Inference and Refinement of Knowledge Graphs with Large Language Models
Xinkui Zhao
Haode Li
Yifan Zhang
Guanjie Cheng
Yueshen Xu
KELMLRM
145
1
0
06 Aug 2025
MAO-ARAG: Multi-Agent Orchestration for Adaptive Retrieval-Augmented Generation
MAO-ARAG: Multi-Agent Orchestration for Adaptive Retrieval-Augmented Generation
Yiqun Chen
Erhan Zhang
Lingyong Yan
Shuaiqiang Wang
J. Huang
D. Yin
Jiaxin Mao
183
5
0
01 Aug 2025
Which LLMs Get the Joke? Probing Non-STEM Reasoning Abilities with HumorBench
Which LLMs Get the Joke? Probing Non-STEM Reasoning Abilities with HumorBench
Reuben Narad
Siddharth Suresh
Jiayi Chen
Pine S.L. Dysart-Bricken
Bob Mankoff
R. Nowak
Jifan Zhang
Lalit P. Jain
LRM
225
3
0
29 Jul 2025
PRGB Benchmark: A Robust Placeholder-Assisted Algorithm for Benchmarking Retrieval-Augmented Generation
PRGB Benchmark: A Robust Placeholder-Assisted Algorithm for Benchmarking Retrieval-Augmented Generation
ZheHao Tan
YiHan Jiao
Dan Yang
Lei Liu
Jie Feng
DuoLin Sun
Yue Shen
Jian Wang
Peng Wei
Jinjie Gu
146
3
0
23 Jul 2025
Awakening LLMs' Reasoning Potential: A Fine-Grained Pipeline to Evaluate and Mitigate Vague Perception
Awakening LLMs' Reasoning Potential: A Fine-Grained Pipeline to Evaluate and Mitigate Vague Perception
Zipeng Ling
Yuehao Tang
Qi Zheng
Junqi Yang
Shenghong Fu
Chen Huang
Kejia Huang
Yao Wan
Zhichao Hou
Xuming Hu
LRM
522
2
0
22 Jul 2025
Teaching Vision-Language Models to Ask: Resolving Ambiguity in Visual Questions
Teaching Vision-Language Models to Ask: Resolving Ambiguity in Visual QuestionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Pu Jian
Donglei Yu
Wen Yang
Shuo Ren
Jiajun Zhang
240
13
0
18 Jul 2025
Read the Docs Before Rewriting: Equip Rewriter with Domain Knowledge via Continual Pre-training
Read the Docs Before Rewriting: Equip Rewriter with Domain Knowledge via Continual Pre-training
Qi Wang
Yixuan Cao
Yifan Liu
Jiangtao Zhao
Ping Luo
RALM
259
1
0
01 Jul 2025
Conversational LLMs Simplify Secure Clinical Data Access, Understanding, and Analysis
Conversational LLMs Simplify Secure Clinical Data Access, Understanding, and Analysis
Rafi Al Attrach
Pedro Moreira
Rajna Fani
Renato Umeton
Amelia Fiske
Leo Anthony Celi
325
4
0
27 Jun 2025
MUCAR: Benchmarking Multilingual Cross-Modal Ambiguity Resolution for Multimodal Large Language Models
MUCAR: Benchmarking Multilingual Cross-Modal Ambiguity Resolution for Multimodal Large Language Models
Xiaolong Wang
Zhaolu Kang
Wangyuxuan Zhai
Xinyue Lou
Yunghwei Lai
...
Yawen Wang
Kaiyu Huang
Yile Wang
Peng Li
Wenshu Fan
316
0
0
20 Jun 2025
The Role of Model Confidence on Bias Effects in Measured Uncertainties for Vision-Language Models
The Role of Model Confidence on Bias Effects in Measured Uncertainties for Vision-Language Models
Xinyi Liu
Weiguang Wang
Hangfeng He
308
0
0
20 Jun 2025
Physics vs Distributions: Pareto Optimal Flow Matching with Physics Constraints
Giacomo Baldan
Qiang Liu
Alberto Guardone
Nils Thuerey
AI4CE
366
14
0
10 Jun 2025
DRAGged into Conflicts: Detecting and Addressing Conflicting Sources in Search-Augmented LLMs
Arie Cattan
Alon Jacovi
Ori Ram
Jonathan Herzig
Roee Aharoni
Sasha Goldshtein
E. Ofek
Idan Szpektor
Avi Caciularu
291
12
0
10 Jun 2025
From Calibration to Collaboration: LLM Uncertainty Quantification Should Be More Human-Centered
From Calibration to Collaboration: LLM Uncertainty Quantification Should Be More Human-Centered
Siddartha Devic
Tejas Srinivasan
Jesse Thomason
Willie Neiswanger
Willie Neiswanger
241
14
0
09 Jun 2025
ChemAU: Harness the Reasoning of LLMs in Chemical Research with Adaptive Uncertainty Estimation
ChemAU: Harness the Reasoning of LLMs in Chemical Research with Adaptive Uncertainty Estimation
Xinyi Liu
Lipeng Ma
Yixuan Li
Weidong Yang
Qingyuan Zhou
Jiayi Song
Shuhao Li
Ben Fei
LRM
253
1
0
01 Jun 2025
Do not Abstain! Identify and Solve the Uncertainty
Do not Abstain! Identify and Solve the UncertaintyAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Jingyu Liu
Jingquan Peng
xiaopeng Wu
Xubin Li
Bo Xiao
Bo Zheng
Yong Liu
356
6
0
01 Jun 2025
Trick or Neat: Adversarial Ambiguity and Language Model Evaluation
Trick or Neat: Adversarial Ambiguity and Language Model EvaluationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Antonia Karamolegkou
Oliver Eberle
Phillip Rust
Carina Kauf
Anders Søgaard
196
2
0
01 Jun 2025
Inter-Passage Verification for Multi-evidence Multi-answer QA
Inter-Passage Verification for Multi-evidence Multi-answer QAAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Bingsen Chen
Shengjie Wang
Xi Ye
Chen Zhao
RALM
216
0
0
31 May 2025
Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents
Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents
Michael Kirchhof
Gjergji Kasneci
Enkelejda Kasneci
LLMAG
381
22
0
28 May 2025
123456
Next
Page 1 of 6