ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.13186
  4. Cited By
SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim
  Verification on Scientific Tables

SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables

22 May 2023
Xinyuan Lu
Liangming Pan
Qian Liu
Preslav Nakov
Min-Yen Kan
    LMTD
ArXivPDFHTML

Papers citing "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"

20 / 20 papers shown
Title
Can LLMs Generate Tabular Summaries of Science Papers? Rethinking the Evaluation Protocol
Can LLMs Generate Tabular Summaries of Science Papers? Rethinking the Evaluation Protocol
Weiqi Wang
Jiefu Ou
Y. Song
Benjamin Van Durme
Daniel Khashabi
LMTD
33
0
0
14 Apr 2025
CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?
CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?
Jiefu Ou
William Gantt Walden
Kate Sanders
Zhengping Jiang
Kaiser Sun
...
Weiqi Wang
Chandler May
Hannah Recknor
Daniel Khashabi
Benjamin Van Durme
44
0
0
27 Mar 2025
From Hypothesis to Publication: A Comprehensive Survey of AI-Driven Research Support Systems
Zekun Zhou
Xiaocheng Feng
L. Huang
Xiachong Feng
Ziyun Song
...
Baoxin Wang
Dayong Wu
Guoping Hu
Ting Liu
Bing Qin
AI4TS
66
0
0
03 Mar 2025
Step-by-Step Fact Verification System for Medical Claims with Explainable Reasoning
Step-by-Step Fact Verification System for Medical Claims with Explainable Reasoning
Juraj Vladika
Ivana Hacajová
Florian Matthes
LRM
90
0
0
21 Feb 2025
SCITAT: A Question Answering Benchmark for Scientific Tables and Text
  Covering Diverse Reasoning Types
SCITAT: A Question Answering Benchmark for Scientific Tables and Text Covering Diverse Reasoning Types
Xuanliang Zhang
Dingzirui Wang
Baoxin Wang
Longxu Dou
Xinyuan Lu
Keyan Xu
Dayong Wu
Qingfu Zhu
Wanxiang Che
LMTD
115
1
0
16 Dec 2024
FinDVer: Explainable Claim Verification over Long and Hybrid-Content
  Financial Documents
FinDVer: Explainable Claim Verification over Long and Hybrid-Content Financial Documents
Yilun Zhao
Yitao Long
Yuru Jiang
Chengye Wang
Weiyuan Chen
Hongjun Liu
Yiming Zhang
Xiangru Tang
Chen Zhao
Arman Cohan
VLM
28
1
0
08 Nov 2024
Claim Verification in the Age of Large Language Models: A Survey
Claim Verification in the Age of Large Language Models: A Survey
A. Dmonte
Roland Oruche
Marcos Zampieri
Prasad Calyam
Isabelle Augenstein
44
8
0
26 Aug 2024
CHECKWHY: Causal Fact Verification via Argument Structure
CHECKWHY: Causal Fact Verification via Argument Structure
Jiasheng Si
Yibo Zhao
Yingjie Zhu
Haiyang Zhu
Wenpeng Lu
Deyu Zhou
CML
HILM
LRM
27
1
0
20 Aug 2024
CoverBench: A Challenging Benchmark for Complex Claim Verification
CoverBench: A Challenging Benchmark for Complex Claim Verification
Alon Jacovi
Moran Ambar
Eyal Ben-David
Uri Shaham
Amir Feder
Mor Geva
Dror Marcus
Avi Caciularu
LMTD
45
3
0
06 Aug 2024
On the Robustness of Language Models for Tabular Question Answering
On the Robustness of Language Models for Tabular Question Answering
Kushal Raj Bhandari
Sixue Xing
Soham Dan
Jianxi Gao
LMTD
44
3
0
18 Jun 2024
Missci: Reconstructing Fallacies in Misrepresented Science
Missci: Reconstructing Fallacies in Misrepresented Science
Max Glockner
Yufang Hou
Preslav Nakov
Iryna Gurevych
27
4
0
05 Jun 2024
ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context
ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context
Zirui Wu
Yansong Feng
LMTD
ReLM
LRM
46
7
0
04 Mar 2024
A Survey of Table Reasoning with Large Language Models
A Survey of Table Reasoning with Large Language Models
Xuanliang Zhang
Dingzirui Wang
Longxu Dou
Qingfu Zhu
Wanxiang Che
LMTD
LRM
20
5
0
13 Feb 2024
QACHECK: A Demonstration System for Question-Guided Multi-Hop
  Fact-Checking
QACHECK: A Demonstration System for Question-Guided Multi-Hop Fact-Checking
Liangming Pan
Xinyuan Lu
Min-Yen Kan
Preslav Nakov
LRM
20
17
0
11 Oct 2023
Effective Distillation of Table-based Reasoning Ability from LLMs
Effective Distillation of Table-based Reasoning Ability from LLMs
Bohao Yang
Chen Tang
Kangning Zhao
Chenghao Xiao
Chenghua Lin
LRM
14
22
0
22 Sep 2023
Investigating Zero- and Few-shot Generalization in Fact Verification
Investigating Zero- and Few-shot Generalization in Fact Verification
Liangming Pan
Yunxiang Zhang
Min-Yen Kan
11
5
0
18 Sep 2023
We're Afraid Language Models Aren't Modeling Ambiguity
We're Afraid Language Models Aren't Modeling Ambiguity
Alisa Liu
Zhaofeng Wu
Julian Michael
Alane Suhr
Peter West
Alexander Koller
Swabha Swayamdipta
Noah A. Smith
Yejin Choi
63
87
0
27 Apr 2023
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
303
11,730
0
04 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,261
0
28 Jan 2022
PubMedQA: A Dataset for Biomedical Research Question Answering
PubMedQA: A Dataset for Biomedical Research Question Answering
Qiao Jin
Bhuwan Dhingra
Zhengping Liu
William W. Cohen
Xinghua Lu
202
791
0
13 Sep 2019
1