Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1705.03551
Cited By
v1
v2 (latest)
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
9 May 2017
Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension"
50 / 2,188 papers shown
Title
HACK: Hallucinations Along Certainty and Knowledge Axes
Adi Simhi
Jonathan Herzig
Itay Itzhak
Dana Arad
Zorik Gekhman
Roi Reichart
Fazl Barez
Gabriel Stanovsky
Idan Szpektor
Yonatan Belinkov
108
0
0
28 Oct 2025
Repurposing Synthetic Data for Fine-grained Search Agent Supervision
Yida Zhao
Kuan Li
Xixi Wu
Liwen Zhang
Dingchu Zhang
...
Xinyu Wang
Kewei Tu
Pengjun Xie
Jingren Zhou
Yong Jiang
96
0
0
28 Oct 2025
Robust Uncertainty Quantification for Self-Evolving Large Language Models via Continual Domain Pretraining
Xiaofan Zhou
Lu Cheng
CLL
365
0
0
27 Oct 2025
Multi-Agent Evolve: LLM Self-Improve through Co-evolution
Yixing Chen
Yiding Wang
Siqi Zhu
Haofei Yu
Tao Feng
Muhan Zhang
M. Patwary
Jiaxuan You
LLMAG
LRM
263
4
0
27 Oct 2025
RaCoT: Plug-and-Play Contrastive Example Generation Mechanism for Enhanced LLM Reasoning Reliability
Kaitong Cai
Jusheng Zhang
Yijia Fan
Jing Yang
Keze Wang
LRM
96
1
0
26 Oct 2025
NeuroGenPoisoning: Neuron-Guided Attacks on Retrieval-Augmented Generation of LLM via Genetic Optimization of External Knowledge
Hanyu Zhu
Lance Fiondella
Jiawei Yuan
K. Zeng
Long Jiao
SILM
AAML
KELM
225
0
0
24 Oct 2025
Embedding Trust: Semantic Isotropy Predicts Nonfactuality in Long-Form Text Generation
Dhrupad Bhardwaj
Julia Kempe
Tim G. J. Rudner
HILM
208
1
0
24 Oct 2025
Redefining Retrieval Evaluation in the Era of LLMs
Giovanni Trappolini
Florin Cuconasu
Simone Filice
Y. Maarek
Fabrizio Silvestri
57
0
0
24 Oct 2025
Head Pursuit: Probing Attention Specialization in Multimodal Transformers
Lorenzo Basile
Valentino Maiorca
Diego Doimo
Francesco Locatello
Alberto Cazzaniga
93
0
0
24 Oct 2025
DeepWideSearch: Benchmarking Depth and Width in Agentic Information Seeking
Tian Lan
Bin Zhu
Qianghuai Jia
Junyang Ren
Haijun Li
Longyue Wang
Zhao Xu
Weihua Luo
Kaifu Zhang
49
0
0
23 Oct 2025
Systematic Evaluation of Uncertainty Estimation Methods in Large Language Models
Christian Hobelsberger
Theresa Winner
Andreas Nawroth
Oliver Mitevski
Anna-Carolina Haensch
ELM
120
0
0
23 Oct 2025
HA-RAG: Hotness-Aware RAG Acceleration via Mixed Precision and Data Placement
Danying Ge
Jianhua Gao
Yixue Yang
Weixing Ji
121
0
0
23 Oct 2025
Simple Context Compression: Mean-Pooling and Multi-Ratio Training
Yair Feldman
Yoav Artzi
72
1
0
23 Oct 2025
ARC-Encoder: learning compressed text representations for large language models
Hippolyte Pilchen
Edouard Grave
P. Pérez
LLMAG
RALM
AI4CE
113
0
0
23 Oct 2025
Neural Diversity Regularizes Hallucinations in Language Models
Kushal Chakrabarti
Nirmal Balachundhar
84
0
0
23 Oct 2025
Data-Centric Lessons To Improve Speech-Language Pretraining
Vishaal Udandarao
Zhiyun Lu
Xuankai Chang
Yongqiang Wang
Violet Z. Yao
Albin Madapally Jose
Fartash Faghri
Josh Gardner
Chung-Cheng Chiu
120
0
0
22 Oct 2025
Lost in the Maze: Overcoming Context Limitations in Long-Horizon Agentic Search
Howard Yen
Ashwin Paranjape
Mengzhou Xia
Thejas Venkatesh
Jack Hessel
Danqi Chen
Yuhao Zhang
96
2
0
21 Oct 2025
ssToken: Self-modulated and Semantic-aware Token Selection for LLM Fine-tuning
Xiaohan Qin
Xiaoxing Wang
Ning Liao
Cancheng Zhang
Xiangdong Zhang
Mingquan Feng
Jingzhi Wang
Junchi Yan
122
0
0
21 Oct 2025
WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-Reflection
Guanzhong He
Zhen Yang
Jinxin Liu
Bin Xu
Lei Hou
Juanzi Li
88
0
0
21 Oct 2025
See the Text: From Tokenization to Visual Reading
Ling Xing
Alex Jinpeng Wang
Rui Yan
Hongyu Qu
Zechao Li
Jinhui Tang
VLM
116
0
0
21 Oct 2025
Search Self-play: Pushing the Frontier of Agent Capability without Supervision
Hongliang Lu
Yuhang Wen
Pengyu Cheng
Ruijin Ding
Haotian Xu
Jiaqi Guo
Chutian Wang
Haonan Chen
Xiaoxi Jiang
Guanjun Jiang
LRM
76
2
0
21 Oct 2025
Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Ling Team
Anqi Shen
B. Li
Bin Hu
Bin Jing
...
Z. Pan
Longxiang Zhang
Zhenzhong Lan
Zhiqiang Ding
Zhiqiang Zhang
ALM
ReLM
LRM
216
2
0
21 Oct 2025
Adamas: Hadamard Sparse Attention for Efficient Long-Context Inference
Siyuan Yan
Guo-qing Jiang
Y. Zhang
Xiaoxing Ma
Ran Zhu
Chun Cao
Jingwei Xu
OffRL
128
0
0
21 Oct 2025
Rethinking On-policy Optimization for Query Augmentation
Zhichao Xu
Shengyao Zhuang
Xueguang Ma
Bingsen Chen
Yijun Tian
Fengran Mo
Jie Cao
Vivek Srikumar
RALM
LRM
115
0
0
20 Oct 2025
Annotation-Efficient Universal Honesty Alignment
Shiyu Ni
Keping Bi
Jiafeng Guo
Minghao Tang
Jingtong Wu
Zengxin Han
Xueqi Cheng
HILM
124
0
0
20 Oct 2025
A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications
Minhua Lin
Zongyu Wu
Zhichao Xu
Hui Liu
Xianfeng Tang
Qi He
Charu C. Aggarwal
Hui Liu
Xiang Zhang
Suhang Wang
AI4TS
LRM
438
1
0
19 Oct 2025
Vocab Diet: Reshaping the Vocabulary of LLMs with Vector Arithmetic
Yuval Reif
Guy Kaplan
Roy Schwartz
KELM
145
0
0
19 Oct 2025
SafeSearch: Do Not Trade Safety for Utility in LLM Search Agents
Qiusi Zhan
Angeline Budiman-Chan
Abdelrahman Zayed
Xingzhi Guo
Daniel Kang
Joo-Kyung Kim
LLMAG
KELM
AI4TS
ELM
225
0
0
19 Oct 2025
Mapping from Meaning: Addressing the Miscalibration of Prompt-Sensitive Language Models
AAAI Conference on Artificial Intelligence (AAAI), 2025
K. Cox
Jiawei Xu
Yikun Han
Rong Xu
Tianhao Li
Chi-Yang Hsu
Tianlong Chen
Walter Gerych
Ying Ding
90
1
0
19 Oct 2025
End-to-end Listen, Look, Speak and Act
Siyin Wang
Wenyi Yu
Xianzhao Chen
Xiaohai Tian
Jun Zhang
Lu Lu
C. Zhang
AuLLM
156
0
0
19 Oct 2025
PokeeResearch: Effective Deep Research via Reinforcement Learning from AI Feedback and Robust Reasoning Scaffold
Yi Wan
Jiuqi Wang
Liam Li
Jinsong Liu
Ruihao Zhu
Zheqing Zhu
OffRL
RALM
AI4TS
LRM
265
0
0
17 Oct 2025
EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle
Rong Wu
Xiaoman Wang
Jianbiao Mei
Pinlong Cai
Daocheng Fu
...
Licheng Wen
Xuemeng Yang
Yufan Shen
Yuxin Wang
Botian Shi
72
2
0
17 Oct 2025
Compressing Many-Shots in In-Context Learning
Devvrit Khatri
Pranamya Kulkarni
Nilesh Gupta
Yerram Varun
Liqian Peng
...
Cho-Jui Hsieh
Alec Go
Inderjit Dhillon
Aditya Kusupati
Prateek Jain
OffRL
81
0
0
17 Oct 2025
Cost-Aware Retrieval-Augmentation Reasoning Models with Adaptive Retrieval Depth
Helia Hashemi
Victor Rühle
Saravan Rajmohan
RALM
3DV
LRM
122
0
0
17 Oct 2025
Kelle: Co-design KV Caching and eDRAM for Efficient LLM Serving in Edge Computing
Tianhua Xia
Sai Qian Zhang
56
0
0
16 Oct 2025
Structure-R1: Dynamically Leveraging Structural Knowledge in LLM Reasoning through Reinforcement Learning
Junlin Wu
Xianrui Zhong
Jiashuo Sun
Bolian Li
Bowen Jin
Jiawei Han
Qingkai Zeng
OffRL
AI4TS
LRM
77
0
0
16 Oct 2025
Finding Answers in Thought Matters: Revisiting Evaluation on Large Language Models with Reasoning
Hwiyeol Jo
Joosung Lee
J. H. Lee
Sang-Woo Lee
Joonsuk Park
Kang Min Yoo
ReLM
LRM
89
0
0
16 Oct 2025
Towards Agentic Self-Learning LLMs in Search Environment
Wangtao Sun
Xiang Cheng
Jialin Fan
Yao Xu
Xing Yu
Shizhu He
Jun Zhao
Kang Liu
78
0
0
16 Oct 2025
Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents
Guoqing Wang
Sunhao Dai
Guangze Ye
Zeyu Gan
Wei Yao
Yong Deng
Xiaofeng Wu
ZhenZhe Ying
OffRL
120
1
0
16 Oct 2025
Beyond Correctness: Rewarding Faithful Reasoning in Retrieval-Augmented Generation
Zhichao Xu
Zongyu Wu
Yun Zhou
Aosong Feng
Kang Zhou
...
Yijun Tian
Xuan Qi
Weikang Qiu
Lin Lee Cheong
Haibo Ding
OffRL
RALM
LRM
96
0
0
15 Oct 2025
Document Intelligence in the Era of Large Language Models: A Survey
Weishi Wang
Hengchang Hu
Zhijie Zhang
Zhaochen Li
Hongxin Shao
Daniel Dahlmeier
AI4TS
128
0
0
15 Oct 2025
ESI: Epistemic Uncertainty Quantification via Semantic-preserving Intervention for Large Language Models
Mingda Li
Xinyu Li
Weinan Zhang
Longxuan Ma
92
0
0
15 Oct 2025
Taming the Fragility of KV Cache Eviction in LLM Inference
Yuan Feng
Haoyu Guo
Junlin Lv
S.Kevin Zhou
Xike Xie
84
1
0
15 Oct 2025
Classifying and Addressing the Diversity of Errors in Retrieval-Augmented Generation Systems
Kin Kwan Leung
Mouloud Belbahri
Yi Sui
Alex Labach
Xueying Zhang
Stephen Rose
Jesse C. Cresswell
72
0
0
15 Oct 2025
InteractiveOmni: A Unified Omni-modal Model for Audio-Visual Multi-turn Dialogue
Wenwen Tong
Hewei Guo
Dongchuan Ran
Jiangnan Chen
Jiefan Lu
...
Dinghao Zhou
Guiping Zhong
Ken Zheng
Shiyin Kang
Lewei Lu
MLLM
AuLLM
VGen
VLM
380
3
0
15 Oct 2025
Uncertainty Quantification for Hallucination Detection in Large Language Models: Foundations, Methodology, and Future Directions
Sungmin Kang
Yavuz Faruk Bakman
D. Yaldiz
Baturalp Buyukates
Salman Avestimehr
HILM
182
3
0
14 Oct 2025
Who's Asking? Evaluating LLM Robustness to Inquiry Personas in Factual Question Answering
Nil-Jana Akpinar
Chia-Jung Lee
Vanessa Murdock
Pietro Perona
84
0
0
14 Oct 2025
Teaching Language Models to Faithfully Express their Uncertainty
Bryan Eikema
Evgenia Ilia
José G. C. de Souza
Chrysoula Zerva
Wilker Aziz
HILM
132
0
0
14 Oct 2025
APLOT: Robust Reward Modeling via Adaptive Preference Learning with Optimal Transport
Z. Li
Yuege Feng
Dandan Guo
Jinpeng Hu
Anningzhe Gao
Xiang Wan
100
0
0
13 Oct 2025
The Curious Case of Factual (Mis)Alignment between LLMs' Short- and Long-Form Answers
Saad Obaid ul Islam
Anne Lauscher
Goran Glavaš
HILM
158
0
0
13 Oct 2025
Previous
1
2
3
4
5
...
42
43
44
Next