Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Home
Papers
2508.21422
Cited By
Automatic Reviewers Fail to Detect Faulty Reasoning in Research Papers: A New Counterfactual Evaluation Framework
29 August 2025
Nils Dycke
Iryna Gurevych
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Github
Papers citing
"Automatic Reviewers Fail to Detect Faulty Reasoning in Research Papers: A New Counterfactual Evaluation Framework"
1 / 1 papers shown
FLAWS: A Benchmark for Error Identification and Localization in Scientific Papers
Sarina Xi
Vishisht Rao
Justin Payan
Nihar B. Shah
72
2
0
26 Nov 2025
1