ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.09000
  4. Cited By
Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic
  Fact-checkers

Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkers

15 November 2023
Yuxia Wang
Revanth Gangi Reddy
Zain Muhammad Mujahid
Arnav Arora
Aleksandr Rubashevskii
Jiahui Geng
Osama Mohammed Afzal
Liangming Pan
Nadav Borenstein
Aditya Pillai
Isabelle Augenstein
Iryna Gurevych
Preslav Nakov
    HILM
ArXivPDFHTML

Papers citing "Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkers"

31 / 31 papers shown
Title
Fact-checking AI-generated news reports: Can LLMs catch their own lies?
Fact-checking AI-generated news reports: Can LLMs catch their own lies?
Jiayi Yao
Haibo Sun
Nianwen Xue
HILM
52
0
0
24 Mar 2025
Optimizing Decomposition for Optimal Claim Verification
Optimizing Decomposition for Optimal Claim Verification
Yining Lu
Noah Ziems
Hy Dang
Meng-Long Jiang
56
0
0
19 Mar 2025
Evaluating open-source Large Language Models for automated fact-checking
Nicoló Fontana
Francesco Corso
Enrico Zuccolotto
Francesco Pierri
HILM
57
0
0
07 Mar 2025
Conformal Linguistic Calibration: Trading-off between Factuality and Specificity
Conformal Linguistic Calibration: Trading-off between Factuality and Specificity
Zhengping Jiang
Anqi Liu
Benjamin Van Durme
84
1
0
26 Feb 2025
BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking
BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking
Yuxuan Liu
Hongda Sun
Wenya Guo
Xinyan Xiao
Cunli Mao
Zhengtao Yu
Rui Yan
63
2
0
22 Feb 2025
ComparisonQA: Evaluating Factuality Robustness of LLMs Through Knowledge Frequency Control and Uncertainty
ComparisonQA: Evaluating Factuality Robustness of LLMs Through Knowledge Frequency Control and Uncertainty
Qing Zong
Z. Wang
Tianshi Zheng
Xiyu Ren
Y. Song
57
1
0
31 Dec 2024
A Reality Check on Context Utilisation for Retrieval-Augmented
  Generation
A Reality Check on Context Utilisation for Retrieval-Augmented Generation
Lovisa Hagström
Sara Vera Marjanović
Haeun Yu
Arnav Arora
Christina Lioma
Maria Maistro
Pepa Atanasova
Isabelle Augenstein
75
0
0
22 Dec 2024
DEFAME: Dynamic Evidence-based FAct-checking with Multimodal Experts
DEFAME: Dynamic Evidence-based FAct-checking with Multimodal Experts
Tobias Braun
Mark Rothermel
Marcus Rohrbach
Anna Rohrbach
83
1
0
13 Dec 2024
Multi-hop Evidence Pursuit Meets the Web: Team Papelo at FEVER 2024
Multi-hop Evidence Pursuit Meets the Web: Team Papelo at FEVER 2024
Christopher Malon
LRM
27
1
0
08 Nov 2024
FactLens: Benchmarking Fine-Grained Fact Verification
FactLens: Benchmarking Fine-Grained Fact Verification
Kushan Mitra
Dan Zhang
Sajjadur Rahman
Estevam R. Hruschka
HILM
38
1
0
08 Nov 2024
FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation
FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation
Farima Fatahi Bayat
Lechen Zhang
Sheza Munir
Lu Wang
HILM
39
3
0
29 Oct 2024
Enhancing Answer Attribution for Faithful Text Generation with Large
  Language Models
Enhancing Answer Attribution for Faithful Text Generation with Large Language Models
Juraj Vladika
Luca Mülln
Florian Matthes
23
0
0
22 Oct 2024
Loki: An Open-Source Tool for Fact Verification
Loki: An Open-Source Tool for Fact Verification
Haonan Li
Xudong Han
Hao Wang
Yuxia Wang
Minghan Wang
Rui Xing
Yilin Geng
Zenan Zhai
Preslav Nakov
Timothy Baldwin
SyDa
HILM
62
3
0
02 Oct 2024
LoraMap: Harnessing the Power of LoRA Connections
LoraMap: Harnessing the Power of LoRA Connections
Hyeryun Park
Jeongwon Kwak
Dongsuk Jang
Sumin Park
Jinwook Choi
MoMe
28
0
0
29 Aug 2024
Claim Verification in the Age of Large Language Models: A Survey
Claim Verification in the Age of Large Language Models: A Survey
A. Dmonte
Roland Oruche
Marcos Zampieri
Prasad Calyam
Isabelle Augenstein
44
8
0
26 Aug 2024
Zero-shot Factual Consistency Evaluation Across Domains
Zero-shot Factual Consistency Evaluation Across Domains
Raunak Agarwal
HILM
34
0
0
07 Aug 2024
Generative Large Language Models in Automated Fact-Checking: A Survey
Generative Large Language Models in Automated Fact-Checking: A Survey
Ivan Vykopal
Matúš Pikuliak
Simon Ostermann
Marián Simko
HILM
38
5
0
02 Jul 2024
Molecular Facts: Desiderata for Decontextualization in LLM Fact
  Verification
Molecular Facts: Desiderata for Decontextualization in LLM Fact Verification
Anisha Gunjal
Greg Durrett
HILM
44
13
0
28 Jun 2024
Do Not Design, Learn: A Trainable Scoring Function for Uncertainty Estimation in Generative LLMs
Do Not Design, Learn: A Trainable Scoring Function for Uncertainty Estimation in Generative LLMs
D. Yaldiz
Yavuz Faruk Bakman
Baturalp Buyukates
Chenyang Tao
Anil Ramakrishna
Dimitrios Dimitriadis
Jieyu Zhao
Salman Avestimehr
39
2
0
17 Jun 2024
Transferable and Efficient Non-Factual Content Detection via Probe
  Training with Offline Consistency Checking
Transferable and Efficient Non-Factual Content Detection via Probe Training with Offline Consistency Checking
Xiaokang Zhang
Zijun Yao
Jing Zhang
Kaifeng Yun
Jifan Yu
Juan-Zi Li
Jie Tang
HILM
32
2
0
10 Apr 2024
Enhancing Security of AI-Based Code Synthesis with GitHub Copilot via
  Cheap and Efficient Prompt-Engineering
Enhancing Security of AI-Based Code Synthesis with GitHub Copilot via Cheap and Efficient Prompt-Engineering
Jakub Res
I. Homoliak
Martin Peresíni
A. Smrčka
K. Malinka
P. Hanáček
29
3
0
19 Mar 2024
Re-Search for The Truth: Multi-round Retrieval-augmented Large Language
  Models are Strong Fake News Detectors
Re-Search for The Truth: Multi-round Retrieval-augmented Large Language Models are Strong Fake News Detectors
Guanghua Li
Wensheng Lu
Wei Zhang
Defu Lian
Kezhong Lu
Rui Mao
Kai Shu
Hao Liao
HILM
14
4
0
14 Mar 2024
Fact-Checking the Output of Large Language Models via Token-Level
  Uncertainty Quantification
Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification
Ekaterina Fadeeva
Aleksandr Rubashevskii
Artem Shelmanov
Sergey Petrakov
Haonan Li
...
Gleb Kuzmin
Alexander Panchenko
Timothy Baldwin
Preslav Nakov
Maxim Panov
HILM
40
38
0
07 Mar 2024
A Survey of AI-generated Text Forensic Systems: Detection, Attribution,
  and Characterization
A Survey of AI-generated Text Forensic Systems: Detection, Attribution, and Characterization
Tharindu Kumarage
Garima Agrawal
Paras Sheth
Raha Moraffah
Amanat Chadha
Joshua Garland
Huan Liu
DeLMO
34
11
0
02 Mar 2024
Leveraging Large Language Models for Concept Graph Recovery and Question
  Answering in NLP Education
Leveraging Large Language Models for Concept Graph Recovery and Question Answering in NLP Education
Rui Yang
Boming Yang
Sixun Ouyang
Tianwei She
Aosong Feng
Yuang Jiang
Freddy Lecue
Jinghui Lu
Irene Z Li
AI4Ed
24
5
0
22 Feb 2024
LEMMA: Towards LVLM-Enhanced Multimodal Misinformation Detection with
  External Knowledge Augmentation
LEMMA: Towards LVLM-Enhanced Multimodal Misinformation Detection with External Knowledge Augmentation
Keyang Xuan
Li Yi
Fan Yang
Ruochen Wu
Yi Ren Fung
Heng Ji
37
11
0
19 Feb 2024
Fact-checking based fake news detection: a review
Fact-checking based fake news detection: a review
Yuzhou Yang
Yangming Zhou
Qichao Ying
Zhenxing Qian
Dan Zeng
Liang Liu
EgoV
50
0
0
03 Jan 2024
How Language Model Hallucinations Can Snowball
How Language Model Hallucinations Can Snowball
Muru Zhang
Ofir Press
William Merrill
Alisa Liu
Noah A. Smith
HILM
LRM
78
252
0
22 May 2023
Teaching language models to support answers with verified quotes
Teaching language models to support answers with verified quotes
Jacob Menick
Maja Trebacz
Vladimir Mikulik
John Aslanides
Francis Song
...
Mia Glaese
Susannah Young
Lucy Campbell-Gillingham
G. Irving
Nat McAleese
ELM
RALM
235
257
0
21 Mar 2022
Towards Faithfulness in Open Domain Table-to-text Generation from an
  Entity-centric View
Towards Faithfulness in Open Domain Table-to-text Generation from an Entity-centric View
Tianyu Liu
Xin Zheng
Baobao Chang
Zhifang Sui
114
35
0
17 Feb 2021
Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit
  Reasoning Strategies
Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies
Mor Geva
Daniel Khashabi
Elad Segal
Tushar Khot
Dan Roth
Jonathan Berant
RALM
245
671
0
06 Jan 2021
1