Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.03117
Cited By
ProcBench: Benchmark for Multi-Step Reasoning and Following Procedure
4 October 2024
Ippei Fujisawa
Sensho Nobe
Hiroki Seto
Rina Onda
Yoshiaki Uchida
Hiroki Ikoma
Pei-Chun Chien
Ryota Kanai
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ProcBench: Benchmark for Multi-Step Reasoning and Following Procedure"
2 / 2 papers shown
Title
LookAlike: Consistent Distractor Generation in Math MCQs
Nisarg Parikh
Nigel Fernandez
Alexander Scarlatos
Simon Woodhead
Andrew S. Lan
27
0
0
03 May 2025
L0-Reasoning Bench: Evaluating Procedural Correctness in Language Models via Simple Program Execution
Simeng Sun
Cheng-Ping Hsieh
Faisal Ladhak
Erik Arakelyan
Santiago Akle Serano
Boris Ginsburg
ReLM
ELM
LRM
39
0
0
28 Mar 2025
1