Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2509.26076
Cited By
IMProofBench: Benchmarking AI on Research-Level Mathematical Proof Generation
30 September 2025
Johannes Schmitt
Gergely Bérczi
Jasper Dekoninck
Jeremy Feusi
Tim Gehrunger
Raphael Appenzeller
Jim Bryan
Niklas Canova
Timo de Wolff
Filippo Gaia
Michel van Garrel
Baran Hashemi
David Holmes
Aitor Iribar Lopez
Victor Jaeck
Martina Jørgensen
Steven Kelk
Stefan Kuhlmann
Adam Kurpisz
Chiara Meroni
Ingmar Metzler
Martin Möller
Samuel Muñoz-Echániz
Robert Nowak
Georg Oberdieck
Daniel Platt
Dylan Possamaï
Gabriel Ribeiro
Raúl Sánchez Galán
Zheming Sun
Josef Teichmann
Richard P. Thomas
Charles Vial
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (1353★)
Papers citing
"IMProofBench: Benchmarking AI on Research-Level Mathematical Proof Generation"
2 / 2 papers shown
The ORCA Benchmark: Evaluating Real-World Calculation Accuracy in Large Language Models
Claudia Herambourg
Dawid Siuda
Julia Kopczyńska
Joao R. L. Santos
Wojciech Sas
Joanna Śmietańska-Nowak
ELM
ALM
LRM
399
0
0
04 Nov 2025
Putnam-like dataset summary: LLMs as mathematical competition contestants
Bartosz Bieganowski
Daniel Strzelecki
Robert Skiba
Mateusz Topolewski
AIMat
318
0
0
29 Sep 2025
1