ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.10645
  4. Cited By
AmbigQA: Answering Ambiguous Open-domain Questions

AmbigQA: Answering Ambiguous Open-domain Questions

22 April 2020
Sewon Min
Julian Michael
Hannaneh Hajishirzi
Luke Zettlemoyer
ArXivPDFHTML

Papers citing "AmbigQA: Answering Ambiguous Open-domain Questions"

50 / 212 papers shown
Title
Direct Retrieval-augmented Optimization: Synergizing Knowledge Selection and Language Models
Direct Retrieval-augmented Optimization: Synergizing Knowledge Selection and Language Models
Zhengliang Shi
Lingyong Yan
Weiwei Sun
Yue Feng
Pengjie Ren
Xinyu Ma
Shuaiqiang Wang
D. Yin
Maarten de Rijke
Z. Ren
RALM
43
0
0
05 May 2025
Conflicts in Texts: Data, Implications and Challenges
Conflicts in Texts: Data, Implications and Challenges
Siyi Liu
Dan Roth
105
0
0
28 Apr 2025
CORG: Generating Answers from Complex, Interrelated Contexts
CORG: Generating Answers from Complex, Interrelated Contexts
Hyunji Lee
Franck Dernoncourt
Trung H. Bui
Seunghyun Yoon
19
0
0
25 Apr 2025
Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation
Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation
Ziqiao Ma
Jing Ding
Xuejun Zhang
Dezhi Luo
Jiahe Ding
Sihan Xu
Yuchen Huang
Run Peng
Joyce Chai
49
0
0
22 Apr 2025
Retrieval-Augmented Generation with Conflicting Evidence
Retrieval-Augmented Generation with Conflicting Evidence
Han Wang
Archiki Prasad
Elias Stengel-Eskin
Mohit Bansal
RALM
66
1
0
17 Apr 2025
Clarifying Ambiguities: on the Role of Ambiguity Types in Prompting Methods for Clarification Generation
Clarifying Ambiguities: on the Role of Ambiguity Types in Prompting Methods for Clarification Generation
Anfu Tang
Laure Soulier
Vincent Guigue
LRM
77
0
0
16 Apr 2025
NoTeS-Bank: Benchmarking Neural Transcription and Search for Scientific Notes Understanding
NoTeS-Bank: Benchmarking Neural Transcription and Search for Scientific Notes Understanding
Aniket Pal
Sanket Biswas
Alloy Das
Ayush Lodh
Priyanka Banerjee
Soumitri Chattopadhyay
Dimosthenis Karatzas
Josep Lladós
C. V. Jawahar
VLM
32
0
0
12 Apr 2025
TALE: A Tool-Augmented Framework for Reference-Free Evaluation of Large Language Models
TALE: A Tool-Augmented Framework for Reference-Free Evaluation of Large Language Models
Sher Badshah
Ali Emami
Hassan Sajjad
LLMAG
ELM
43
0
0
10 Apr 2025
QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks?
QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks?
Belinda Z. Li
Been Kim
Z. Wang
LRM
38
2
0
28 Mar 2025
Uncertainty Quantification and Confidence Calibration in Large Language Models: A Survey
Uncertainty Quantification and Confidence Calibration in Large Language Models: A Survey
Xiaoou Liu
Tiejin Chen
Longchao Da
Chacha Chen
Zhen Lin
Hua Wei
HILM
62
3
0
20 Mar 2025
Navigating Rifts in Human-LLM Grounding: Study and Benchmark
Navigating Rifts in Human-LLM Grounding: Study and Benchmark
Omar Shaikh
Hussein Mozannar
Gagan Bansal
Adam Fourney
Eric Horvitz
71
2
0
18 Mar 2025
Benchmarking Failures in Tool-Augmented Language Models
Benchmarking Failures in Tool-Augmented Language Models
Eduardo Treviño
Hugo Contant
James Ngai
Graham Neubig
Zora Zhiruo Wang
67
0
0
18 Mar 2025
DAFE: LLM-Based Evaluation Through Dynamic Arbitration for Free-Form Question-Answering
Sher Badshah
Hassan Sajjad
60
1
0
11 Mar 2025
SePer: Measure Retrieval Utility Through The Lens Of Semantic Perplexity Reduction
SePer: Measure Retrieval Utility Through The Lens Of Semantic Perplexity Reduction
Lu Dai
Yijie Xu
Jinhui Ye
Hao Liu
Hui Xiong
3DV
RALM
76
2
0
03 Mar 2025
Generate, Discriminate, Evolve: Enhancing Context Faithfulness via Fine-Grained Sentence-Level Self-Evolution
K. Li
Tianhua Zhang
Yunxiang Li
Hongyin Luo
Abdalla Moustafa
Xixin Wu
James Glass
H. Meng
61
0
0
03 Mar 2025
Semantic Volume: Quantifying and Detecting both External and Internal Uncertainty in LLMs
Semantic Volume: Quantifying and Detecting both External and Internal Uncertainty in LLMs
Xiaomin Li
Zhou Yu
Ziji Zhang
Yingying Zhuang
S.
Narayanan Sadagopan
Anurag Beniwal
HILM
58
0
0
28 Feb 2025
Program Synthesis Dialog Agents for Interactive Decision-Making
Program Synthesis Dialog Agents for Interactive Decision-Making
Matthew Toles
Nikhil Balwani
Rattandeep Singh
Valentina Giulia Sartori Rodriguez
Zhou Yu
60
0
0
26 Feb 2025
Disambiguate First Parse Later: Generating Interpretations for Ambiguity Resolution in Semantic Parsing
Disambiguate First Parse Later: Generating Interpretations for Ambiguity Resolution in Semantic Parsing
Irina Saparina
Mirella Lapata
41
0
0
25 Feb 2025
Improving Consistency in Large Language Models through Chain of Guidance
Improving Consistency in Large Language Models through Chain of Guidance
Harsh Raj
Vipul Gupta
Domenic Rosati
Subhabrata Majumdar
LLMAG
LRM
63
3
0
21 Feb 2025
Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation
Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation
Abdelrahman Abdallah
Bhawna Piryani
Jamshid Mozafari
Mohammed Ali
Adam Jatowt
81
1
0
21 Feb 2025
MTPChat: A Multimodal Time-Aware Persona Dataset for Conversational Agents
MTPChat: A Multimodal Time-Aware Persona Dataset for Conversational Agents
Wanqi Yang
Y. Li
Meng Fang
L. Chen
59
1
0
09 Feb 2025
CondAmbigQA: A Benchmark and Dataset for Conditional Ambiguous Question Answering
CondAmbigQA: A Benchmark and Dataset for Conditional Ambiguous Question Answering
Zongxi Li
Y. Li
Haoran Xie
S. J. Qin
66
0
0
03 Feb 2025
Accounting for Focus Ambiguity in Visual Questions
Chongyan Chen
Yu-Yun Tseng
Zhuoheng Li
Anush Venkatesh
Danna Gurari
36
0
0
04 Jan 2025
A review of faithfulness metrics for hallucination assessment in Large Language Models
Ben Malin
Tatiana Kalganova
Nikoloas Boulgouris
HILM
59
2
0
03 Jan 2025
RACQUET: Unveiling the Dangers of Overlooked Referential Ambiguity in
  Visual LLMs
RACQUET: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs
A. Testoni
Barbara Plank
Raquel Fernández
64
0
0
18 Dec 2024
DMQR-RAG: Diverse Multi-Query Rewriting for RAG
DMQR-RAG: Diverse Multi-Query Rewriting for RAG
Zhicong Li
Jiahao Wang
Zhishu Jiang
Hangyu Mao
Zhongxia Chen
Jiazhen Du
Yuanxing Zhang
Fuzheng Zhang
Di Zhang
Yong Liu
135
3
0
20 Nov 2024
Do LLMs Understand Ambiguity in Text? A Case Study in Open-world
  Question Answering
Do LLMs Understand Ambiguity in Text? A Case Study in Open-world Question Answering
Aryan Keluskar
Amrita Bhattacharjee
Huan Liu
66
2
0
19 Nov 2024
Who's Who: Large Language Models Meet Knowledge Conflicts in Practice
Who's Who: Large Language Models Meet Knowledge Conflicts in Practice
Quang Hieu Pham
Hoang Ngo
Anh Tuan Luu
Dat Quoc Nguyen
RALM
HILM
21
4
0
21 Oct 2024
Modeling Future Conversation Turns to Teach LLMs to Ask Clarifying Questions
Modeling Future Conversation Turns to Teach LLMs to Ask Clarifying Questions
Michael J.Q. Zhang
W. Bradley Knox
Eunsol Choi
48
3
0
17 Oct 2024
Open Domain Question Answering with Conflicting Contexts
Open Domain Question Answering with Conflicting Contexts
Siyi Liu
Qiang Ning
Kishaloy Halder
Wei Xiao
Zheng Qi
...
Yi Zhang
Neha Anna John
Bonan Min
Yassine Benajiba
Dan Roth
LLMAG
63
2
0
16 Oct 2024
Retrieving Contextual Information for Long-Form Question Answering using
  Weak Supervision
Retrieving Contextual Information for Long-Form Question Answering using Weak Supervision
Philipp Christmann
Svitlana Vakulenko
Ionut Teodor Sorodoc
Bill Byrne
Adria de Gispert
RALM
29
0
0
11 Oct 2024
The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield
  Better Language Models
The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Language Models
Yanjun Chen
Dawei Zhu
Yirong Sun
Xinghao Chen
Wei Zhang
Xiaoyu Shen
ALM
26
1
0
09 Oct 2024
Do great minds think alike? Investigating Human-AI Complementarity in
  Question Answering with CAIMIRA
Do great minds think alike? Investigating Human-AI Complementarity in Question Answering with CAIMIRA
Maharshi Gor
Hal Daumé III
Tianyi Zhou
Jordan Boyd-Graber
ELM
AI4MH
LRM
20
1
0
09 Oct 2024
Adaptive Question Answering: Enhancing Language Model Proficiency for
  Addressing Knowledge Conflicts with Source Citations
Adaptive Question Answering: Enhancing Language Model Proficiency for Addressing Knowledge Conflicts with Source Citations
Sagi Shaier
Ari Kobren
Philip Ogren
HILM
29
5
0
05 Oct 2024
ECon: On the Detection and Resolution of Evidence Conflicts
ECon: On the Detection and Resolution of Evidence Conflicts
Cheng Jiayang
Chunkit Chan
Qianqian Zhuang
Lin Qiu
Tianhang Zhang
Tengxiao Liu
Yangqiu Song
Yue Zhang
Pengfei Liu
Zheng Zhang
36
1
0
05 Oct 2024
Detecting Temporal Ambiguity in Questions
Detecting Temporal Ambiguity in Questions
Bhawna Piryani
Abdelrahman Abdallah
Jamshid Mozafari
Adam Jatowt
31
0
0
25 Sep 2024
"I Never Said That": A dataset, taxonomy and baselines on response
  clarity classification
"I Never Said That": A dataset, taxonomy and baselines on response clarity classification
Konstantinos Thomas
Giorgos Filandrianos
Maria Lymperaiou
Chrysoula Zerva
Giorgos Stamou
31
0
0
20 Sep 2024
IQA-EVAL: Automatic Evaluation of Human-Model Interactive Question
  Answering
IQA-EVAL: Automatic Evaluation of Human-Model Interactive Question Answering
Ruosen Li
Barry Wang
Ruochen Li
Xinya Du
ELM
33
5
0
24 Aug 2024
MAQA: Evaluating Uncertainty Quantification in LLMs Regarding Data Uncertainty
MAQA: Evaluating Uncertainty Quantification in LLMs Regarding Data Uncertainty
Yongjin Yang
Haneul Yoo
Hwaran Lee
60
1
0
13 Aug 2024
FastFiD: Improve Inference Efficiency of Open Domain Question Answering
  via Sentence Selection
FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection
Yufei Huang
Xu Han
Maosong Sun
26
0
0
12 Aug 2024
Chain of Condition: Construct, Verify and Solve Conditions for
  Conditional Question Answering
Chain of Condition: Construct, Verify and Solve Conditions for Conditional Question Answering
Jiuheng Lin
Yuxuan Lai
Yansong Feng
LRM
21
0
0
10 Aug 2024
Citekit: A Modular Toolkit for Large Language Model Citation Generation
Citekit: A Modular Toolkit for Large Language Model Citation Generation
Jiajun Shen
Tong Zhou
Suifeng Zhao
Yubo Chen
Kang Liu
HILM
KELM
33
7
0
06 Aug 2024
DebateQA: Evaluating Question Answering on Debatable Knowledge
DebateQA: Evaluating Question Answering on Debatable Knowledge
Rongwu Xu
Xuan Qi
Zehan Qi
Wei Xu
Zhijiang Guo
ELM
41
5
0
02 Aug 2024
Improving Retrieval Augmented Language Model with Self-Reasoning
Improving Retrieval Augmented Language Model with Self-Reasoning
Yuan Xia
Jingbo Zhou
Zhenhui Shi
Jun Chen
Hai-ting Huang
AIFin
LRM
ReLM
KELM
36
8
0
29 Jul 2024
Know Your Limits: A Survey of Abstention in Large Language Models
Know Your Limits: A Survey of Abstention in Large Language Models
Bingbing Wen
Jihan Yao
Shangbin Feng
Chenjun Xu
Yulia Tsvetkov
Bill Howe
Lucy Lu Wang
49
5
0
25 Jul 2024
I Could've Asked That: Reformulating Unanswerable Questions
I Could've Asked That: Reformulating Unanswerable Questions
Wenting Zhao
Ge Gao
Claire Cardie
Alexander M. Rush
ELM
30
1
0
24 Jul 2024
Continual Learning for Temporal-Sensitive Question Answering
Continual Learning for Temporal-Sensitive Question Answering
Wanqi Yang
Yunqiu Xu
Yanda Li
Kunze Wang
Binbin Huang
Ling-Hao Chen
CLL
27
3
0
17 Jul 2024
Enhancing Retrieval and Managing Retrieval: A Four-Module Synergy for
  Improved Quality and Efficiency in RAG Systems
Enhancing Retrieval and Managing Retrieval: A Four-Module Synergy for Improved Quality and Efficiency in RAG Systems
Yunxiao Shi
Xing Zi
Zijing Shi
Haimin Zhang
Qiang Wu
Min Xu
34
7
0
15 Jul 2024
The Art of Saying No: Contextual Noncompliance in Language Models
The Art of Saying No: Contextual Noncompliance in Language Models
Faeze Brahman
Sachin Kumar
Vidhisha Balachandran
Pradeep Dasigi
Valentina Pyatkin
...
Jack Hessel
Yulia Tsvetkov
Noah A. Smith
Yejin Choi
Hannaneh Hajishirzi
65
20
0
02 Jul 2024
AMBROSIA: A Benchmark for Parsing Ambiguous Questions into Database
  Queries
AMBROSIA: A Benchmark for Parsing Ambiguous Questions into Database Queries
Irina Saparina
Mirella Lapata
39
10
0
27 Jun 2024
12345
Next