ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2408.03325
  4. Cited By
CoverBench: A Challenging Benchmark for Complex Claim Verification

CoverBench: A Challenging Benchmark for Complex Claim Verification

6 August 2024
Alon Jacovi
Moran Ambar
Eyal Ben-David
Uri Shaham
Amir Feder
Mor Geva
Dror Marcus
Avi Caciularu
    LMTD
ArXivPDFHTML

Papers citing "CoverBench: A Challenging Benchmark for Complex Claim Verification"

7 / 7 papers shown
Title
The FACTS Grounding Leaderboard: Benchmarking LLMs' Ability to Ground Responses to Long-Form Input
Alon Jacovi
Andrew Wang
Chris Alberti
Connie Tao
Jon Lipovetz
...
Rachana Fellinger
Rui Wang
Zizhao Zhang
Sasha Goldshtein
Dipanjan Das
HILM
ALM
77
11
0
06 Jan 2025
FactLens: Benchmarking Fine-Grained Fact Verification
FactLens: Benchmarking Fine-Grained Fact Verification
Kushan Mitra
Dan Zhang
Sajjadur Rahman
Estevam R. Hruschka
HILM
25
1
0
08 Nov 2024
TACT: Advancing Complex Aggregative Reasoning with Information
  Extraction Tools
TACT: Advancing Complex Aggregative Reasoning with Information Extraction Tools
Avi Caciularu
Alon Jacovi
Eyal Ben-David
Sasha Goldshtein
Tal Schuster
Jonathan Herzig
G. Elidan
Amir Globerson
LMTD
22
3
0
05 Jun 2024
Are Machines Better at Complex Reasoning? Unveiling Human-Machine
  Inference Gaps in Entailment Verification
Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment Verification
Soumya Sanyal
Tianyi Xiao
Jiacheng Liu
Wenya Wang
Xiang Ren
LRM
ReLM
24
11
0
06 Feb 2024
ContractNLI: A Dataset for Document-level Natural Language Inference for
  Contracts
ContractNLI: A Dataset for Document-level Natural Language Inference for Contracts
Yuta Koreeda
Christopher D. Manning
AILaw
84
96
0
05 Oct 2021
Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit
  Reasoning Strategies
Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies
Mor Geva
Daniel Khashabi
Elad Segal
Tushar Khot
Dan Roth
Jonathan Berant
RALM
242
460
0
06 Jan 2021
PubMedQA: A Dataset for Biomedical Research Question Answering
PubMedQA: A Dataset for Biomedical Research Question Answering
Qiao Jin
Bhuwan Dhingra
Zhengping Liu
William W. Cohen
Xinghua Lu
196
791
0
13 Sep 2019
1