Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1705.03551
Cited By
v1
v2 (latest)
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
9 May 2017
Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension"
50 / 2,195 papers shown
Title
F2LLM Technical Report: Matching SOTA Embedding Performance with 6 Million Open-Source Data
Ziyin Zhang
Zihan Liao
Hang Yu
Peng Di
Rui Wang
134
1
0
02 Oct 2025
Addressing Pitfalls in the Evaluation of Uncertainty Estimation Methods for Natural Language Generation
Mykyta Ielanskyi
Kajetan Schweighofer
L. Aichberger
Sepp Hochreiter
HILM
209
0
0
02 Oct 2025
MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
Xingjian Zhao
Zhe Xu
Qinyuan Cheng
Zhaoye Fei
Luozhijie Jin
...
Yitian Gong
Yuanfan Xu
Yaqian Zhou
Xuanjing Huang
Xipeng Qiu
AuLLM
230
2
0
01 Oct 2025
Milco: Learned Sparse Retrieval Across Languages via a Multilingual Connector
Thong Nguyen
Yibin Lei
Jia-Huei Ju
Eugene Yang
Andrew Yates
84
0
0
01 Oct 2025
ReSeek: A Self-Correcting Framework for Search Agents with Instructive Rewards
Shiyu Li
Yang Tang
Yifan Wang
P. Li
Xi Chen
KELM
LRM
167
1
0
01 Oct 2025
Pay-Per-Search Models are Abstention Models
Mustafa Omer Gul
Claire Cardie
Tanya Goyal
88
0
0
01 Oct 2025
Eyes-on-Me: Scalable RAG Poisoning through Transferable Attention-Steering Attractors
Yen-Shan Chen
Sian-Yao Huang
Cheng-Lin Yang
Yun-Nung Chen
AAML
112
0
0
01 Oct 2025
Are Robust LLM Fingerprints Adversarially Robust?
Anshul Nasery
Edoardo Contente
Alkin Kaz
Pramod Viswanath
Sewoong Oh
AAML
173
2
0
30 Sep 2025
RoRecomp: Enhancing Reasoning Efficiency via Rollout Response Recomposition in Reinforcement Learning
Gang Li
Yulei Qin
Xiaoyu Tan
Dingkang Yang
Yuchen Shi
Zihan Xu
Xiang Li
Xing Sun
Ke Li
OffRL
ReLM
LRM
238
0
0
30 Sep 2025
Beyond Token Probes: Hallucination Detection via Activation Tensors with ACT-ViT
Guy Bar-Shalom
Fabrizio Frasca
Yaniv Galron
Yftah Ziser
Haggai Maron
MLLM
115
0
0
30 Sep 2025
From Factoid Questions to Data Product Requests: Benchmarking Data Product Discovery over Tables and Text
L. Zhang
Nandana Mihindukulasooriya
Niharika S. D'Souza
Sola S. Shirai
Sarthak Dash
Yao Ma
Horst Samulowitz
LMTD
229
1
0
30 Sep 2025
Thinking Sparks!: Emergent Attention Heads in Reasoning Models During Post Training
Yein Park
Minbyul Jeong
Jaewoo Kang
LRM
1.5K
0
0
30 Sep 2025
Accelerating LLM Inference with Precomputed Query Storage
Jay H. Park
Youngju Cho
Choungsol Lee
Moonwook Oh
Euiseong Seo
RALM
28
0
0
30 Sep 2025
TraceDet: Hallucination Detection from the Decoding Trace of Diffusion Large Language Models
Shenxu Chang
Junchi Yu
Weixing Wang
Yongqiang Chen
Jialin Yu
Philip Torr
Jindong Gu
HILM
116
0
0
30 Sep 2025
CAST: Continuous and Differentiable Semi-Structured Sparsity-Aware Training for Large Language Models
Weiyu Huang
Yuezhou Hu
Jun Zhu
Jianfei Chen
CLL
88
0
0
30 Sep 2025
RE-Searcher: Robust Agentic Search with Goal-oriented Planning and Self-reflection
Daocheng Fu
Jianbiao Mei
Licheng Wen
Xuemeng Yang
Cheng Yang
...
Xinyu Cai
Pinlong Cai
Ding Wang
Yong-Jin Liu
Yu Qiao
LLMAG
LRM
160
0
0
30 Sep 2025
MemGen: Weaving Generative Latent Memory for Self-Evolving Agents
Guibin Zhang
Muxin Fu
Shuicheng Yan
LLMAG
370
2
0
29 Sep 2025
Generalized Correctness Models: Learning Calibrated and Model-Agnostic Correctness Predictors from Historical Patterns
Hanqi Xiao
Vaidehi Patil
Hyunji Lee
Elias Stengel-Eskin
Mohit Bansal
164
1
0
29 Sep 2025
Short window attention enables long-term memorization
Loic Cabannes
Maximilian Beck
Gergely Szilvasy
Matthijs Douze
Maria Lomeli
Jade Copet
Pierre-Emmanuel Mazaré
Gabriel Synnaeve
Hervé Jégou
120
1
0
29 Sep 2025
Calibrating Verbalized Confidence with Self-Generated Distractors
Victor Wang
Elias Stengel-Eskin
116
0
0
29 Sep 2025
Pretraining with hierarchical memories: separating long-tail and common knowledge
Hadi Pouransari
David Grangier
C Thomas
Michael Kirchhof
Oncel Tuzel
RALM
KELM
215
1
0
29 Sep 2025
Hybrid Reward Normalization for Process-supervised Non-verifiable Agentic Tasks
Peiran Xu
Ruoyao Xiao
Xiaoying Xing
Guannan Zhang
Debiao Li
Kunyu Shi
OffRL
LRM
92
1
0
29 Sep 2025
Investigating Multi-layer Representations for Dense Passage Retrieval
Zhongbin Xie
Thomas Lukasiewicz
96
0
0
28 Sep 2025
Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval and Synthesis for SLMs
Shreyas Singh
Kunal Singh
Pradeep Moturi
LLMAG
LRM
81
1
0
28 Sep 2025
Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning
Shaobo Wang
Jiaming Wang
Jiajun Zhang
C. Wang
Yue Min
...
Fei Huang
Huiqiang Jiang
Junyang Lin
Dayiheng Liu
Linfeng Zhang
129
5
0
28 Sep 2025
ReliabilityRAG: Effective and Provably Robust Defense for RAG-based Web-Search
Zeyu Shen
Basileal Imana
Tong Wu
Chong Xiang
Prateek Mittal
Aleksandra Korolova
AAML
86
1
0
27 Sep 2025
Tracing the Representation Geometry of Language Models from Pretraining to Post-training
Melody Zixuan Li
Kumar Krishna Agrawal
Arna Ghosh
Komal Kumar Teru
Adam Santoro
Guillaume Lajoie
Blake A. Richards
152
1
0
27 Sep 2025
MoE-PHDS: One MoE checkpoint for flexible runtime sparsity
Lauren Hannah
Soheil Zibakhsh
K. Nishu
Arnav Kundu
Mohammad Samragh Razlighi
Mehrdad Farajtabar
Minsik Cho
MoE
76
0
0
27 Sep 2025
Detecting Corpus-Level Knowledge Inconsistencies in Wikipedia with Large Language Models
Sina J. Semnani
Jirayu Burapacheep
Arpandeep Khatua
Thanawan Atchariyachanvanit
Zheng Wang
M. Lam
KELM
104
1
0
27 Sep 2025
Fine-Grained Uncertainty Decomposition in Large Language Models: A Spectral Approach
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2025
Nassim Walha
Sebastian G. Gruber
Thomas Decker
Yinchong Yang
Alireza Javanmardi
Eyke Hüllermeier
Florian Buettner
UQCV
UD
PER
463
0
0
26 Sep 2025
Do LLM Agents Know How to Ground, Recover, and Assess? A Benchmark for Epistemic Competence in Information-Seeking Agents
Jiaqi Shao
Yuxiang Lin
Munish Prasad Lohani
Yufeng Miao
B. Luo
RALM
ELM
76
0
0
26 Sep 2025
Stochastic activations
Maria Lomeli
Matthijs Douze
Gergely Szilvasy
Loic Cabannes
Jade Copet
Sainbayar Sukhbaatar
Jason Weston
Gabriel Synnaeve
Pierre-Emmanuel Mazaré
Hervé Jégou
LLMSV
172
0
0
26 Sep 2025
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Yulei Qin
Xiaoyu Tan
Zhengbao He
Gang Li
Haojia Lin
...
Yuzheng Cai
Xuan Zhang
Sheng Ye
Ke Li
Xing Sun
343
0
0
26 Sep 2025
What Matters More For In-Context Learning under Matched Compute Budgets: Pretraining on Natural Text or Incorporating Targeted Synthetic Examples?
Mohammed Sabry
Anya Belz
79
0
0
26 Sep 2025
Semantic Agreement Enables Efficient Open-Ended LLM Cascades
Duncan Soiffer
Steven Kolawole
Virginia Smith
190
0
0
26 Sep 2025
Elastic MoE: Unlocking the Inference-Time Scalability of Mixture-of-Experts
Naibin Gu
Zhenyu Zhang
Yuchen Feng
Yilong Chen
Peng Fu
...
Shuohuan Wang
Yu Sun
Hua Wu
Weiping Wang
Haifeng Wang
MoE
77
0
0
26 Sep 2025
Multidimensional Uncertainty Quantification via Optimal Transport
Nikita Kotelevskii
Maiya Goloburda
Vladimir Kondratyev
Alexander Fishkov
Mohsen Guizani
Eric Moulines
Maxim Panov
169
1
0
26 Sep 2025
Predicting LLM Reasoning Performance with Small Proxy Model
Woosung Koh
Juyoung Suk
Sungjun Han
Se-Young Yun
Jay Shin
LRM
AI4CE
218
0
0
25 Sep 2025
Tree Search for LLM Agent Reinforcement Learning
Yuxiang Ji
Ziyu Ma
Yong Wang
Guanhua Chen
Xiangxiang Chu
Liaoni Wu
128
3
0
25 Sep 2025
Hallucination reduction with CASAL: Contrastive Activation Steering For Amortized Learning
Wannan Yang
Xinchi Qiu
L. Yu
Yuchen Zhang
Oliver Aobo Yang
Narine Kokhlikyan
Nicola Cancedda
Diego Garcia-Olano
Diego Garcia-Olano
170
0
0
25 Sep 2025
Mamba Modulation: On the Length Generalization of Mamba
Peng Lu
Jerry Huang
Qiuhao Zeng
X. Wang
Boxing Wang
Philippe Langlais
Yufei Cui
Mamba
277
0
0
23 Sep 2025
Confidence-Aware Routing for Large Language Model Reliability Enhancement: A Multi-Signal Approach to Pre-Generation Hallucination Mitigation
Nandakishor M
HILM
88
0
0
23 Sep 2025
Prior-based Noisy Text Data Filtering: Fast and Strong Alternative For Perplexity
Yeongbin Seo
Gayoung Kim
Jaehyung Kim
Jinyoung Yeo
119
0
0
23 Sep 2025
SilentStriker:Toward Stealthy Bit-Flip Attacks on Large Language Models
Haotian Xu
Qingsong Peng
Jie Shi
Huadi Zheng
Yu Li
Cheng Zhuo
AAML
171
1
0
22 Sep 2025
Semantic Reformulation Entropy for Robust Hallucination Detection in QA Tasks
Chaodong Tong
Qi Zhang
Lei Jiang
Y. Liu
Nannan Sun
Wei Li
HILM
248
0
0
22 Sep 2025
Dynamic Expert Specialization: Towards Catastrophic Forgetting-Free Multi-Domain MoE Adaptation
Junzhuo Li
Bo Wang
Xiuze Zhou
Xuming Hu
MoMe
CLL
MoE
152
0
0
21 Sep 2025
Influence Guided Context Selection for Effective Retrieval-Augmented Generation
Jiale Deng
Yanyan Shen
Ziyuan Pei
Youmin Chen
Linpeng Huang
298
1
0
21 Sep 2025
PruneCD: Contrasting Pruned Self Model to Improve Decoding Factuality
Byeongho Yu
Changhun Lee
Jungyu Jin
Eunhyeok Park
SyDa
118
0
0
20 Sep 2025
Decoding Uncertainty: The Impact of Decoding Strategies for Uncertainty Estimation in Large Language Models
Wataru Hashimoto
Hidetaka Kamigaito
Taro Watanabe
92
1
0
20 Sep 2025
Sparse-Autoencoder-Guided Internal Representation Unlearning for Large Language Models
Tomoya Yamashita
Akira Ito
Yuuki Yamanaka
Masanori Yamada
Takayuki Miura
Toshiki Shibahara
MU
KELM
84
1
0
19 Sep 2025
Previous
1
2
3
4
5
...
42
43
44
Next