Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1705.03551
Cited By
v1
v2 (latest)
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
9 May 2017
Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension"
50 / 2,187 papers shown
Title
What Makes a Good Speech Tokenizer for LLM-Centric Speech Generation? A Systematic Study
Xiaoran Fan
Zhichao Sun
Yangfan Gao
Jingfei Xiong
Hang Yan
...
Shaokang Dong
Changzhi Sun
Tao Gui
Qi Zhang
Xuanjing Huang
174
1
0
14 Jun 2025
MALM: A Multi-Information Adapter for Large Language Models to Mitigate Hallucination
Ao Jia
Haiming Wu
Guohui Yao
D. Song
Songkun Ji
Yazhou Zhang
167
0
0
14 Jun 2025
FlexRAG: A Flexible and Comprehensive Framework for Retrieval-Augmented Generation
Zhuocheng Zhang
Yang Feng
Min Zhang
178
3
0
14 Jun 2025
Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback
Dongwei Jiang
Alvin Zhang
Andrew Wang
Nicholas Andrews
Daniel Khashabi
LRM
175
4
0
13 Jun 2025
Lifting Data-Tracing Machine Unlearning to Knowledge-Tracing for Foundation Models
Yuwen Tan
Boqing Gong
MU
187
1
0
12 Jun 2025
Constructing and Evaluating Declarative RAG Pipelines in PyTerrier
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Craig Macdonald
Jinyuan Fang
Andrew Parry
Zaiqiao Meng
AI4TS
243
3
0
12 Jun 2025
Inv-Entropy: A Fully Probabilistic Framework for Uncertainty Quantification in Language Models
Haoyi Song
Ruihan Ji
Naichen Shi
Fan Lai
Raed Al Kontar
262
1
0
11 Jun 2025
DrVoice: Parallel Speech-Text Voice Conversation Model via Dual-Resolution Speech Representations
Chao-Hong Tan
Qian Chen
Wen Wang
Chong Deng
Qinglin Zhang
...
Yafeng Chen
Hui Wang
Jiaqing Liu
Jieping Ye
Jieping Ye
AuLLM
184
0
0
11 Jun 2025
Query-Level Uncertainty in Large Language Models
Lihu Chen
Gerard de Melo
Fabian M. Suchanek
Gaël Varoquaux
311
1
0
11 Jun 2025
TransXSSM: A Hybrid Transformer State Space Model with Unified Rotary Position Embedding
Yiran Peng
Jingze Shi
Yifan Wu
Nan Tang
Yuyu Luo
282
3
0
11 Jun 2025
PropMEND: Hypernetworks for Knowledge Propagation in LLMs
Zeyu Leo Liu
Greg Durrett
Eunsol Choi
KELM
109
0
0
10 Jun 2025
The Geometries of Truth Are Orthogonal Across Tasks
Waiss Azizian
Michael Kirchhof
Eugène Ndiaye
Louis Béthune
Stephen Zhang
Pierre Ablin
Marco Cuturi
166
0
0
10 Jun 2025
Reinforcement Fine-Tuning for Reasoning towards Multi-Step Multi-Source Search in Large Language Models
Wentao Shi
Yiqing Shen
KELM
LRM
127
4
0
10 Jun 2025
Efficient Context Selection for Long-Context QA: No Tuning, No Iteration, Just Adaptive-
k
k
k
Chihiro Taguchi
Seiji Maekawa
Nikita Bhutani
RALM
183
2
0
10 Jun 2025
Flow Matching Meets PDEs: A Unified Framework for Physics-Constrained Generation
Giacomo Baldan
Qiang Liu
Alberto Guardone
Nils Thuerey
AI4CE
129
6
0
10 Jun 2025
Graph-KV: Breaking Sequence via Injecting Structural Biases into Large Language Models
Haoyu Wang
Peihao Wang
Mufei Li
Shikun Liu
Siqi Miao
Zinan Lin
P. Li
112
1
0
09 Jun 2025
LLM Unlearning Should Be Form-Independent
Xiaotian Ye
Mengqi Zhang
Shu Wu
MU
193
0
0
09 Jun 2025
LEANN: A Low-Storage Vector Index
Yichuan Wang
Shu Liu
Zhifei Li
Yongji Wu
Ron Yifeng Wang
...
Yang Zhou
Eric Liang
Sewon Min
Matei A. Zaharia
Joseph E. Gonzalez
202
1
0
09 Jun 2025
GaRAGe: A Benchmark with Grounding Annotations for RAG Evaluation
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Ionut Teodor Sorodoc
Leonardo F. R. Ribeiro
Rexhina Blloshmi
Christopher Davis
Adria de Gispert
97
3
0
09 Jun 2025
From Calibration to Collaboration: LLM Uncertainty Quantification Should Be More Human-Centered
Siddartha Devic
Tejas Srinivasan
Jesse Thomason
Willie Neiswanger
Willie Neiswanger
163
7
0
09 Jun 2025
Bridging External and Parametric Knowledge: Mitigating Hallucination of LLMs with Shared-Private Semantic Synergy in Dual-Stream Knowledge
Yi Sui
Chaozhuo Li
Chen Zhang
D. Song
Qiuchi Li
122
1
0
06 Jun 2025
dots.llm1 Technical Report
Bi Huo
Bin Tu
Cheng Qin
Da Zheng
Debing Zhang
...
Yuqiu Ji
Ze Wen
Zhenhai Liu
Zichao Li
Zilong Liao
MoE
171
3
0
06 Jun 2025
ECoRAG: Evidentiality-guided Compression for Long Context RAG
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yeonseok Jeong
Jinsu Kim
Dohyeon Lee
S. Hwang
342
0
0
05 Jun 2025
Multidimensional Analysis of Specific Language Impairment Using Unsupervised Learning Through PCA and Clustering
IEEE International Conference on Healthcare Informatics (ICHI), 2025
Niruthiha Selvanayagam
152
0
0
05 Jun 2025
From Understanding to Generation: An Efficient Shortcut for Evaluating Language Models
Viktor Hangya
Fabian Küch
Darina Gold
ELM
205
0
0
04 Jun 2025
RedDebate: Safer Responses through Multi-Agent Red Teaming Debates
Ali Asad
Stephen Obadinma
Radin Shayanfar
Xiaodan Zhu
AAML
LLMAG
149
2
0
04 Jun 2025
Towards Efficient Speech-Text Jointly Decoding within One Speech Language Model
Haibin Wu
Yuxuan Hu
Ruchao Fan
Xiaofei Wang
K. Kumatani
...
J. Yu
Heng Lu
Lijuan Wang
Y. Qian
Jinyu Li
AuLLM
183
1
0
04 Jun 2025
R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning
Qingfei Zhao
Ruobing Wang
Dingling Xu
Daren Zha
Limin Liu
AI4TS
KELM
LRM
188
12
0
04 Jun 2025
Shaking to Reveal: Perturbation-Based Detection of LLM Hallucinations
Jinyuan Luo
Zhen Fang
Shouqing Yang
Seongheon Park
Ling Chen
AAML
HILM
181
0
0
03 Jun 2025
IP-Dialog: Evaluating Implicit Personalization in Dialogue Systems with Synthetic Data
Bo Peng
Zhiheng Wang
Heyang Gong
Chaochao Lu
167
0
0
03 Jun 2025
SOVA-Bench: Benchmarking the Speech Conversation Ability for LLM-based Voice Assistant
Yixuan Hou
Heyang Liu
Yuhao Wang
Ziyang Cheng
Ronghua Wu
Qunshan Gu
Yanfeng Wang
Yu Wang
AuLLM
184
4
0
03 Jun 2025
Representations of Fact, Fiction and Forecast in Large Language Models: Epistemics and Attitudes
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Meng Li
Michael Vrazitulis
David Schlangen
181
0
0
02 Jun 2025
IndicRAGSuite: Large-Scale Datasets and a Benchmark for Indian Language RAG Systems
Pasunuti Prasanjith
Prathmesh B More
Anoop Kunchukuttan
Mary Dabre
RALM
218
0
0
02 Jun 2025
Reconsidering LLM Uncertainty Estimation Methods in the Wild
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yavuz Faruk Bakman
D. Yaldiz
Sungmin Kang
Tuo Zhang
Baturalp Buyukates
Salman Avestimehr
Sai Praneeth Karimireddy
173
4
0
01 Jun 2025
Probing the Geometry of Truth: Consistency and Generalization of Truth Directions in LLMs Across Logical Transformations and Question Answering Tasks
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yuntai Bao
Xuhong Zhang
Tianyu Du
Xinkui Zhao
Zhengwen Feng
Hao Peng
Jianwei Yin
HILM
125
3
0
01 Jun 2025
Efficient Latent Semantic Clustering for Scaling Test-Time Computation of LLMs
Sungjae Lee
Hoyoung Kim
Jeongyeon Hwang
Eunhyeok Park
Jungseul Ok
LRM
125
0
0
31 May 2025
RLAE: Reinforcement Learning-Assisted Ensemble for LLMs
Y. Fu
Yuanheng Zhu
Jiajun Chai
Guojun Yin
Wei Lin
Qichao Zhang
Dongbin Zhao
129
10
0
31 May 2025
ClueAnchor: Clue-Anchored Knowledge Reasoning Exploration and Optimization for Retrieval-Augmented Generation
Hao Chen
Shi Yu
Sen Mei
Wanxiang Che
Zhenghao Liu
...
Yuchun Fan
Pengcheng Huang
Qiushi Xiong
Zhiyuan Liu
Maosong Sun
LRM
256
0
0
30 May 2025
HD-NDEs: Neural Differential Equations for Hallucination Detection in LLMs
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Qing Li
Fauzan Farooqui
Zongxiong Chen
Derui Zhu
Yuxia Wang
Congbo Ma
Chenyang Lyu
Fakhri Karray
194
2
0
30 May 2025
MetaFaith: Faithful Natural Language Uncertainty Expression in LLMs
Gabrielle Kaili-May Liu
Gal Yona
Avi Caciularu
Idan Szpektor
Tim G. J. Rudner
Arman Cohan
237
2
0
30 May 2025
Beyond Semantic Entropy: Boosting LLM Uncertainty Quantification with Pairwise Semantic Similarity
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Dang Nguyen
Ali Payani
Baharan Mirzasoleiman
124
4
0
30 May 2025
UAQFact: Evaluating Factual Knowledge Utilization of LLMs on Unanswerable Questions
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Chuanyuan Tan
Wenbiao Shao
Hao Xiong
Tong Zhu
Zhenhua Liu
Kai Shi
Wenliang Chen
148
1
0
29 May 2025
Reinforcement Learning for Better Verbalized Confidence in Long-Form Generation
Caiqi Zhang
Xiaochen Zhu
Chengzu Li
Nigel Collier
Andreas Vlachos
OffRL
HILM
223
7
0
29 May 2025
SC-LoRA: Balancing Efficient Fine-tuning and Knowledge Preservation via Subspace-Constrained LoRA
Minrui Luo
Fuhang Kuang
Yu Wang
Zirui Liu
Tianxing He
CLL
193
0
0
29 May 2025
Mis-prompt: Benchmarking Large Language Models for Proactive Error Handling
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Jiayi Zeng
Yizhe Feng
Mengliang He
Wenhui Lei
Wei Zhang
Zeming Liu
Xiaoming Shi
Aimin Zhou
LRM
127
0
0
29 May 2025
Revisiting Uncertainty Estimation and Calibration of Large Language Models
Linwei Tao
Yi-Fan Yeh
Minjing Dong
Tao Huang
Philip Torr
Chang Xu
177
4
0
29 May 2025
From Chat Logs to Collective Insights: Aggregative Question Answering
Wentao Zhang
Woojeong Kim
Yuntian Deng
LMTD
162
0
0
29 May 2025
Are Reasoning Models More Prone to Hallucination?
Zijun Yao
Y. Liu
Yanxu Chen
Jianhui Chen
Junfeng Fang
Lei Hou
Juanzi Li
Tat-Seng Chua
ReLM
HILM
LRM
239
25
0
29 May 2025
Read Your Own Mind: Reasoning Helps Surface Self-Confidence Signals in LLMs
Jakub Podolak
Rajeev Verma
ReLM
LRM
243
1
0
28 May 2025
ChatPD: An LLM-driven Paper-Dataset Networking System
Anjie Xu
Ruiqing Ding
Leye Wang
141
2
0
28 May 2025
Previous
1
2
3
...
7
8
9
...
42
43
44
Next