Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1803.05355
Cited By
v1
v2
v3 (latest)
FEVER: a large-scale dataset for Fact Extraction and VERification
North American Chapter of the Association for Computational Linguistics (NAACL), 2018
14 March 2018
James Thorne
Andreas Vlachos
Christos Christodoulopoulos
Arpit Mittal
HILM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"FEVER: a large-scale dataset for Fact Extraction and VERification"
50 / 1,133 papers shown
AlignCheck: a Semantic Open-Domain Metric for Factual Consistency Assessment
Ahmad Aghaebrahimian
HILM
163
0
0
03 Dec 2025
Towards Unification of Hallucination Detection and Fact Verification for Large Language Models
Weihang Su
Jianming Long
Changyue Wang
Shiyu Lin
Jingyan Xu
Ziyi Ye
Qingyao Ai
Yiqun Liu
HILM
119
0
0
02 Dec 2025
HealthContradict: Evaluating Biomedical Knowledge Conflicts in Language Models
Boya Zhang
Alban Bornet
Rui Yang
Nan Liu
Douglas Teodoro
147
0
0
02 Dec 2025
Trification: A Comprehensive Tree-based Strategy Planner and Structural Verification for Fact-Checking
Anab Maulana Barik
Shou Ziyi
Yang Kaiwen
Yang Qi
Shen Xin
41
0
0
29 Nov 2025
Can LLMs extract human-like fine-grained evidence for evidence-based fact-checking?
Antonín Jarolím
Martin Fajčík
Lucia Makaiová
136
0
0
26 Nov 2025
Large Language Models Require Curated Context for Reliable Political Fact-Checking -- Even with Reasoning and Web Search
Matthew R. Deverna
Kai-Cheng Yang
Harry Yaojun Yan
Filippo Menczer
KELM
HILM
LRM
ELM
228
0
0
24 Nov 2025
Learning to Compress: Unlocking the Potential of Large Language Models for Text Representation
Y. Zhang
Yizheng Zhao
Chen-Hao Hu
Binxing Jiao
Daxin Jiang
Ruihang Miao
Cam-Tu Nguyen
194
0
0
21 Nov 2025
ConInstruct: Evaluating Large Language Models on Conflict Detection and Resolution in Instructions
Xingwei He
Qianru Zhang
Pengfei Chen
Guanhua Chen
Linlin Yu
Yuan Yuan
Siu-Ming Yiu
217
0
0
18 Nov 2025
Consistency Is the Key: Detecting Hallucinations in LLM Generated Text By Checking Inconsistencies About Key Facts
Raavi Gupta
Pranav Hari Panicker
S. Bhatia
Ganesh Ramakrishnan
HILM
142
2
0
15 Nov 2025
Llama-Embed-Nemotron-8B: A Universal Text Embedding Model for Multilingual and Cross-Lingual Tasks
Yauhen Babakhin
Radek Osmulski
Ronay Ak
Gabriel de Souza P. Moreira
Mengyao Xu
Benedikt Schifferer
Bo Liu
Even Oldridge
135
6
0
10 Nov 2025
Wikipedia-based Datasets in Russian Information Retrieval Benchmark RusBEIR
Grigory Kovalev
Natalia Loukachevitch
M. Tikhomirov
Olga Babina
Pavel Mamaev
106
0
0
07 Nov 2025
Hybrid Fact-Checking that Integrates Knowledge Graphs, Large Language Models, and Search-Based Retrieval Agents Improves Interpretable Claim Verification
Shaghayegh Kolli
Richard Rosenbaum
Timo Cavelius
Lasse Strothe
Andrii Lata
Jana Diesner
KELM
137
1
0
05 Nov 2025
TSVer: A Benchmark for Fact Verification Against Time-Series Evidence
Marek Strong
Andreas Vlachos
AI4TS
146
1
0
02 Nov 2025
RzenEmbed: Towards Comprehensive Multimodal Retrieval
Weijian Jian
Yajun Zhang
Dawei Liang
Chunyu Xie
Yixiao He
Dawei Leng
Yuhui Yin
133
1
0
31 Oct 2025
CausalGuard: A Smart System for Detecting and Preventing False Information in Large Language Models
Piyushkumar Patel
HILM
LRM
100
0
0
30 Oct 2025
Layer of Truth: Probing Belief Shifts under Continual Pre-Training Poisoning
S. Churina
Niranjan Chebrolu
Kokil Jaidka
KELM
HILM
CLL
365
0
0
29 Oct 2025
HACK: Hallucinations Along Certainty and Knowledge Axes
Adi Simhi
Jonathan Herzig
Itay Itzhak
Dana Arad
Zorik Gekhman
Roi Reichart
Fazl Barez
Gabriel Stanovsky
Idan Szpektor
Yonatan Belinkov
190
1
0
28 Oct 2025
MERGE: Minimal Expression-Replacement GEneralization Test for Natural Language Inference
Mădălina Zgreabăn
Tejaswini Deoskar
Lasha Abzianidze
118
0
0
28 Oct 2025
ReCAP: Recursive Context-Aware Reasoning and Planning for Large Language Model Agents
Zhenyu Zhang
Tianyi Chen
Weiran Xu
Alex Pentland
Jiaxin Pei
LLMAG
LRM
93
1
0
27 Oct 2025
Multi-Modal Fact-Verification Framework for Reducing Hallucinations in Large Language Models
Piyushkumar Patel
HILM
176
0
0
26 Oct 2025
A Comprehensive Dataset for Human vs. AI Generated Text Detection
Rajarshi Roy
Nasrin Imanpour
Ashhar Aziz
Shashwat Bajpai
Gurpreet Singh
...
Vasu Sharma
Aishwarya N. Reganti
Vinija Jain
Aman Chadha
Amitava Das
DeLMO
520
1
0
26 Oct 2025
A Benchmark for Open-Domain Numerical Fact-Checking Enhanced by Claim Decomposition
Venktesh V
Deepali Prabhu
Avishek Anand
HILM
174
0
0
24 Oct 2025
Compress to Impress: Efficient LLM Adaptation Using a Single Gradient Step on 100 Samples
Shiva Sreeram
Alaa Maalouf
Pratyusha Sharma
Daniela Rus
118
0
0
23 Oct 2025
Rethinking On-policy Optimization for Query Augmentation
Zhichao Xu
Shengyao Zhuang
Xueguang Ma
Bingsen Chen
Yijun Tian
Fengran Mo
Jie Cao
Vivek Srikumar
RALM
LRM
183
0
0
20 Oct 2025
A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications
Minhua Lin
Zongyu Wu
Zhichao Xu
Hui Liu
Xianfeng Tang
Qi He
Charu C. Aggarwal
Hui Liu
Xiang Zhang
Suhang Wang
AI4TS
LRM
564
2
0
19 Oct 2025
SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models
Chih-Kai Yang
Yen-Ting Piao
Tzu-wen Hsu
Szu-Wei Fu
Zhehuai Chen
...
Sung-Feng Huang
Chao-Han Huck Yang
Y. Wang
Yun-Nung Chen
Hung-yi Lee
KELM
AuLLM
184
0
0
19 Oct 2025
Stable but Miscalibrated: A Kantian View on Overconfidence from Filters to Large Language Models
Akira Okutomi
LRM
208
0
0
16 Oct 2025
Retrofitting Small Multilingual Models for Retrieval: Matching 7B Performance with 300M Parameters
Lifu Tu
Yingbo Zhou
Semih Yavuz
LRM
82
0
0
16 Oct 2025
Putting on the Thinking Hats: A Survey on Chain of Thought Fine-tuning from the Perspective of Human Reasoning Mechanism
Xiaoshu Chen
Sihang Zhou
Ke Liang
Duanyang Yuan
Haoyuan Chen
Xiaoyu Sun
Linyuan Meng
Xinwang Liu
ReLM
LRM
226
0
0
15 Oct 2025
When Embedding Models Meet: Procrustes Bounds and Applications
Lucas Maystre
Alvaro Ortega Gonzalez
Charles Park
Rares Dolga
Tudor Berariu
Yu Zhao
K. Ciosek
167
0
0
15 Oct 2025
The Role of Parametric Injection-A Systematic Study of Parametric Retrieval-Augmented Generation
Minghao Tang
Shiyu Ni
Jingtong Wu
Zengxin Han
Keping Bi
101
0
0
14 Oct 2025
Probing Latent Knowledge Conflict for Faithful Retrieval-Augmented Generation
Linfeng Gao
Baolong Bi
Zheng Yuan
Le Wang
Zerui Chen
Zhimin Wei
Shenghua Liu
Qinggang Zhang
Jinsong Su
RALM
200
0
0
14 Oct 2025
LLM-Specific Utility: A New Perspective for Retrieval-Augmented Generation
Hengran Zhang
Keping Bi
Jiafeng Guo
Jiaming Zhang
Shuaiqiang Wang
Dawei Yin
Xueqi Cheng
RALM
145
0
0
13 Oct 2025
Attacks by Content: Automated Fact-checking is an AI Security Issue
Michael Schlichtkrull
AAML
116
0
0
13 Oct 2025
Discrepancy Detection at the Data Level: Toward Consistent Multilingual Question Answering
Lorena Calvo-Bartolomé
Valérie Aldana
Karla Cantarero
Alonso Madroñal de Mesa
Jerónimo Arenas-García
Jordan L. Boyd-Graber
HILM
190
0
0
13 Oct 2025
FactAppeal: Identifying Epistemic Factual Appeals in News Media
Guy Mor-Lan
Tamir Sheafer
Shaul R. Shenhav
HILM
133
0
0
12 Oct 2025
You're Not Gonna Believe This: A Computational Analysis of Factual Appeals and Sourcing in Partisan News
Guy Mor-Lan
Tamir Sheafer
Shaul R. Shenhav
80
0
0
12 Oct 2025
ADMIT: Few-shot Knowledge Poisoning Attacks on RAG-based Fact Checking
Yutao Wu
Xiao Liu
Y. Li
Yifeng Gao
Yifan Ding
Jiale Ding
Xiang Zheng
Xingjun Ma
AAML
KELM
154
0
0
11 Oct 2025
Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks
Cheng Yang
X. J. Yang
Licheng Wen
Daocheng Fu
Jianbiao Mei
...
Yufan Shen
Nianchen Deng
Ding Wang
Yu Qiao
Haifeng Li
LLMAG
RALM
157
2
0
09 Oct 2025
Text2Stories: Evaluating the Alignment Between Stakeholder Interviews and Generated User Stories
Francesco Dente
Fabiano Dalpiaz
Paolo Papotti
83
0
0
08 Oct 2025
GRACE: Generative Representation Learning via Contrastive Policy Optimization
Jiashuo Sun
Shixuan Liu
Zhaochen Su
Xianrui Zhong
Pengcheng Jiang
Sara Szymkuć
Peiran Li
Weijia Shi
Jiawei Han
95
1
0
06 Oct 2025
Equipping Retrieval-Augmented Large Language Models with Document Structure Awareness
Lingnan Xu
Chong Feng
Kaiyuan Zhang
Liu Zhengyong
Wenqiang Xu
Fanqing Meng
RALM
133
0
0
05 Oct 2025
Contrastive Retrieval Heads Improve Attention-Based Re-Ranking
Linh Tran
Yulong Li
Radu Florian
Wei-Ju Sun
129
0
0
02 Oct 2025
F2LLM Technical Report: Matching SOTA Embedding Performance with 6 Million Open-Source Data
Ziyin Zhang
Zihan Liao
Hang Yu
Peng Di
Rui Wang
144
1
0
02 Oct 2025
Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning
Qi He
Cheng Qian
Xiusi Chen
Bingxiang He
Yi R.
Fung
OffRL
LRM
189
2
0
02 Oct 2025
Milco: Learned Sparse Retrieval Across Languages via a Multilingual Connector
Thong Nguyen
Yibin Lei
Jia-Huei Ju
Eugene Yang
Andrew Yates
127
2
0
01 Oct 2025
MuPlon: Multi-Path Causal Optimization for Claim Verification through Controlling Confounding
Hanghui Guo
Shimin Di
Pasquale De Meo
Zhangze Chen
Jia Zhu
147
0
0
30 Sep 2025
MemGen: Weaving Generative Latent Memory for Self-Evolving Agents
Guibin Zhang
Muxin Fu
Shuicheng Yan
LLMAG
389
9
0
29 Sep 2025
AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play
Ran Xu
Yuchen Zhuang
Zihan Dong
Jonathan Wang
Yue Yu
Joyce C. Ho
Linjun Zhang
Haoyu Wang
W. Shi
Carl Yang
RALM
ReLM
KELM
LRM
148
3
0
29 Sep 2025
Detecting Corpus-Level Knowledge Inconsistencies in Wikipedia with Large Language Models
Sina J. Semnani
Jirayu Burapacheep
Arpandeep Khatua
Thanawan Atchariyachanvanit
Zheng Wang
M. Lam
KELM
129
2
0
27 Sep 2025
1
2
3
4
...
21
22
23
Next