Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1704.05179
Cited By
SearchQA: A New Q&A Dataset Augmented with Context from a Search Engine
18 April 2017
Matthew Dunn
Levent Sagun
Mike Higgins
V. U. Güney
Volkan Cirik
Kyunghyun Cho
RALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SearchQA: A New Q&A Dataset Augmented with Context from a Search Engine"
50 / 281 papers shown
Title
HalluLens: LLM Hallucination Benchmark
Yejin Bang
Ziwei Ji
Alan Schelten
Anthony Hartshorn
Tara Fowler
Cheng Zhang
Nicola Cancedda
Pascale Fung
HILM
92
0
0
24 Apr 2025
aiXamine: Simplified LLM Safety and Security
Fatih Deniz
Dorde Popovic
Yazan Boshmaf
Euisuh Jeong
M. Ahmad
Sanjay Chawla
Issa M. Khalil
ELM
80
0
0
21 Apr 2025
Adapting Large Language Models for Multi-Domain Retrieval-Augmented-Generation
Alexandre Misrahi
Nadezhda Chirkova
Maxime Louis
Vassilina Nikoulina
RALM
85
0
0
03 Apr 2025
M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?
Haolong Yan
Kaijun Tan
Yeqing Shen
Xin Huang
Zheng Ge
Xiangyu Zhang
Si Li
Daxin Jiang
VLM
40
0
0
27 Mar 2025
Dynamic Task Vector Grouping for Efficient Multi-Task Prompt Tuning
Pieyi Zhang
Richong Zhang
Zhijie Nie
VLM
65
0
0
23 Mar 2025
Granite Embedding Models
Parul Awasthy
Aashka Trivedi
Yulong Li
Mihaela A. Bornea
David D. Cox
...
Sukriti Sharma
Avirup Sil
Kate Soule
Arafat Sultan
Radu Florian
RALM
62
1
0
27 Feb 2025
PIP-KAG: Mitigating Knowledge Conflicts in Knowledge-Augmented Generation via Parametric Pruning
Pengcheng Huang
Zhenghao Liu
Yukun Yan
Xiaoyuan Yi
Hao Chen
Zhiyuan Liu
Maosong Sun
Tong Xiao
Ge Yu
Chenyan Xiong
98
1
0
24 Feb 2025
Worse than Zero-shot? A Fact-Checking Dataset for Evaluating the Robustness of RAG Against Misleading Retrievals
Linda Zeng
Rithwik Gupta
Divij Motwani
Diji Yang
Yi Zhang
AAML
41
1
0
22 Feb 2025
Dynamic Attention-Guided Context Decoding for Mitigating Context Faithfulness Hallucinations in Large Language Models
Yanwen Huang
Yong Zhang
Ning Cheng
Zhitao Li
Shaojun Wang
Jing Xiao
86
0
0
02 Jan 2025
Building a Rich Dataset to Empower the Persian Question Answering Systems
Mohsen Yazdinejad
Marjan Kaedi
26
0
0
31 Dec 2024
DRS: Deep Question Reformulation With Structured Output
Zhecheng Li
Y. Wang
Bryan Hooi
Yujun Cai
Nanyun Peng
Kai-Wei Chang
KELM
71
0
0
27 Nov 2024
M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework
Yew Ken Chia
Liying Cheng
Hou Pong Chan
Chaoqun Liu
Maojia Song
Sharifah Mahani Aljunied
Soujanya Poria
Lidong Bing
RALM
VLM
43
4
0
09 Nov 2024
VERITAS: A Unified Approach to Reliability Evaluation
Rajkumar Ramamurthy
Meghana Arakkal Rajeev
Oliver Molenschot
James Y. Zou
Nazneen Rajani
HILM
47
1
0
05 Nov 2024
Dynamic Strategy Planning for Efficient Question Answering with Large Language Models
Tanmay Parekh
Pradyot Prakash
Alexander Radovic
Akshay Shekher
Denis Savenkov
LRM
59
1
0
30 Oct 2024
RiTeK: A Dataset for Large Language Models Complex Reasoning over Textual Knowledge Graphs
Jiatan Huang
Mingchen Li
Zonghai Yao
Zhichao Yang
Yongkang Xiao
Feiyun Ouyang
Xiaohan Li
Shuo Han
Hong-ye Yu
RALM
25
3
0
17 Oct 2024
ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning
Yu-Chen Lin
Wei-Hua Li
Jun-Cheng Chen
Chu-Song Chen
27
1
0
10 Oct 2024
FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"
Yifei Ming
Senthil Purushwalkam
Shrey Pandit
Zixuan Ke
Xuan-Phi Nguyen
Caiming Xiong
Shafiq R. Joty
HILM
110
16
0
30 Sep 2024
Enhancing Temporal Sensitivity and Reasoning for Time-Sensitive Question Answering
Wanqi Yang
Yanda Li
Meng Fang
Ling Chen
24
4
0
25 Sep 2024
SFR-RAG: Towards Contextually Faithful LLMs
Xuan-Phi Nguyen
Shrey Pandit
Senthil Purushwalkam
Austin Xu
Hailin Chen
Yifei Ming
Zixuan Ke
Silvio Savarese
Caiming Xong
Shafiq Joty
RALM
88
7
0
16 Sep 2024
WeQA: A Benchmark for Retrieval Augmented Generation in Wind Energy Domain
Rounak Meyur
Hung Phan
S. Wagle
Jan Strube
M. Halappanavar
Sameera Horawalavithana
Anurag Acharya
Sai Munikoti
25
1
0
21 Aug 2024
Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval
Guangyuan Ma
Yongliang Ma
Xing Wu
Zhenpeng Su
Ming Zhou
Songlin Hu
OOD
41
2
0
20 Aug 2024
Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP
Omer Goldman
Alon Jacovi
Aviv Slobodkin
Aviya Maimon
Ido Dagan
Reut Tsarfaty
62
10
0
29 Jun 2024
SEC-QA: A Systematic Evaluation Corpus for Financial QA
Viet Dac Lai
Michael Krumdick
Charles Lovering
Varshini Reddy
Craig W. Schmidt
Chris Tanner
48
3
0
20 Jun 2024
Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and Metrics for Open Domain Question Answering in the Era of Large Language Models
Akchay Srivastava
Atif Memon
ELM
40
1
0
19 Jun 2024
Evaluating the Generalization Ability of Quantized LLMs: Benchmark, Analysis, and Toolbox
Yijun Liu
Yuan Meng
Fang Wu
Shenhao Peng
Hang Yao
Chaoyu Guan
Chen Tang
Xinzhu Ma
Zhi Wang
Wenwu Zhu
MQ
55
7
0
15 Jun 2024
Benchmark Data Contamination of Large Language Models: A Survey
Cheng Xu
Shuhao Guan
Derek Greene
Mohand-Tahar Kechadi
ELM
ALM
38
38
0
06 Jun 2024
Conditional Language Learning with Context
X. Zhang
Miao Li
Ji Wu
51
3
0
04 Jun 2024
INDUS: Effective and Efficient Language Models for Scientific Applications
Bishwaranjan Bhattacharjee
Aashka Trivedi
Masayasu Muraoka
Muthukumaran Ramasubramanian
Takuma Udagawa
...
Peter W. J. Staar
S. Vahidinia
Ryan McGranaghan
A. Mehrabian
Tsendgar Lee
AI4CE
23
5
0
17 May 2024
Improving Long Text Understanding with Knowledge Distilled from Summarization Model
Yan Liu
Yazheng Yang
Xiaokang Chen
VLM
RALM
35
1
0
08 May 2024
Studying Large Language Model Behaviors Under Realistic Knowledge Conflicts
Evgenii Kortukov
Alexander Rubinstein
Elisa Nguyen
Seong Joon Oh
RALM
433
5
2
24 Apr 2024
STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases
Shirley Wu
Shiyu Zhao
Michihiro Yasunaga
Kexin Huang
Kaidi Cao
Qian Huang
V. Ioannidis
Karthik Subbian
James Y. Zou
J. Leskovec
39
17
0
19 Apr 2024
Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization
Zixuan Zhang
R. Reddy
Kevin Small
Tong Zhang
Heng Ji
34
1
0
02 Apr 2024
Automatic Question-Answer Generation for Long-Tail Knowledge
Rohan Kumar
Youngmin Kim
Sunitha Ravi
Haitian Sun
Christos Faloutsos
Ruslan Salakhutdinov
Minji Yoon
23
8
0
03 Mar 2024
Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents
Corby Rosset
Ho-Lam Chung
Guanghui Qin
Ethan C. Chau
Zhuo Feng
Ahmed Hassan Awadallah
Jennifer Neville
Nikhil Rao
37
10
0
27 Feb 2024
PEMT: Multi-Task Correlation Guided Mixture-of-Experts Enables Parameter-Efficient Transfer Learning
Zhisheng Lin
Han Fu
Chenghao Liu
Zhuo Li
Jianling Sun
MoE
MoMe
30
5
0
23 Feb 2024
Bayesian Multi-Task Transfer Learning for Soft Prompt Tuning
Haeju Lee
Minchan Jeong
SeYoung Yun
Kee-Eung Kim
AAML
VPVLM
53
2
0
13 Feb 2024
Contrastive Learning and Mixture of Experts Enables Precise Vector Embeddings
Logan Hallee
Rohan Kapur
Arjun Patel
Jason P. Gleghorn
Bohdan B. Khomtchouk
MoE
17
3
0
28 Jan 2024
Graph Guided Question Answer Generation for Procedural Question-Answering
Hai X. Pham
Isma Hadji
Xinnuo Xu
Ziedune Degutyte
Jay Rainey
Evangelos Kazakos
Afsaneh Fazly
Georgios Tzimiropoulos
Brais Martínez
18
1
0
24 Jan 2024
DocFinQA: A Long-Context Financial Reasoning Dataset
Varshini Reddy
Rik Koncel-Kedziorski
Viet Dac Lai
Michael Krumdick
Charles Lovering
Chris Tanner
RALM
27
15
0
12 Jan 2024
Building Efficient and Effective OpenQA Systems for Low-Resource Languages
Emrah Budur
Riza Ozccelik
Dilara Soylu
Omar Khattab
Tunga Güngör
Christopher Potts
30
1
0
07 Jan 2024
PCoQA: Persian Conversational Question Answering Dataset
Hamed Hematian Hemati
Atousa Toghyani
Atena Souri
Sayed Hesam Alavian
Hossein Sameti
Hamid Beigy
22
3
0
07 Dec 2023
Drilling Down into the Discourse Structure with LLMs for Long Document Question Answering
Inderjeet Nair
Shwetha Somasundaram
Apoorv Saxena
Koustava Goswami
RALM
29
8
0
22 Nov 2023
Adapting Pre-trained Generative Models for Extractive Question Answering
Prabir Mallick
Tapas Nayak
Indrajit Bhattacharya
11
2
0
06 Nov 2023
NuclearQA: A Human-Made Benchmark for Language Models for the Nuclear Domain
Anurag Acharya
Sai Munikoti
Aaron Hellinger
Sara Smith
S. Wagle
Sameera Horawalavithana
ELM
25
4
0
17 Oct 2023
MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering
Xiusi Chen
Jyun-Yu Jiang
Wei-Cheng Chang
Cho-Jui Hsieh
Hsiang-Fu Yu
Wei Wang
19
11
0
08 Oct 2023
DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning
Zhengxiang Shi
Aldo Lipani
VLM
26
30
0
11 Sep 2023
A Massive Scale Semantic Similarity Dataset of Historical English
Emily Silcock
Melissa Dell
39
5
0
30 Jun 2023
Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories
Thomas Mensink
J. Uijlings
Lluis Castrejon
A. Goel
Felipe Cadar
Howard Zhou
Fei Sha
A. Araújo
V. Ferrari
34
37
0
15 Jun 2023
Multi-Source Test-Time Adaptation as Dueling Bandits for Extractive Question Answering
Hai Ye
Qizhe Xie
Hwee Tou Ng
40
8
0
11 Jun 2023
Revealing the Blind Spot of Sentence Encoder Evaluation by HEROS
Cheng-Han Chiang
Yung-Sung Chuang
James R. Glass
Hung-yi Lee
AI4TS
21
3
0
08 Jun 2023
1
2
3
4
5
6
Next