ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2509.26076
  4. Cited By
IMProofBench: Benchmarking AI on Research-Level Mathematical Proof Generation

IMProofBench: Benchmarking AI on Research-Level Mathematical Proof Generation

30 September 2025
Johannes Schmitt
Gergely Bérczi
Jasper Dekoninck
Jeremy Feusi
Tim Gehrunger
Raphael Appenzeller
Jim Bryan
Niklas Canova
Timo de Wolff
Filippo Gaia
Michel van Garrel
Baran Hashemi
David Holmes
Aitor Iribar Lopez
Victor Jaeck
Martina Jørgensen
Steven Kelk
Stefan Kuhlmann
Adam Kurpisz
Chiara Meroni
Ingmar Metzler
Martin Möller
Samuel Muñoz-Echániz
Robert Nowak
Georg Oberdieck
Daniel Platt
Dylan Possamaï
Gabriel Ribeiro
Raúl Sánchez Galán
Zheming Sun
Josef Teichmann
Richard P. Thomas
Charles Vial
    LRM
ArXiv (abs)PDFHTMLGithub (1353★)

Papers citing "IMProofBench: Benchmarking AI on Research-Level Mathematical Proof Generation"

2 / 2 papers shown
The ORCA Benchmark: Evaluating Real-World Calculation Accuracy in Large Language Models
The ORCA Benchmark: Evaluating Real-World Calculation Accuracy in Large Language Models
Claudia Herambourg
Dawid Siuda
Julia Kopczyńska
Joao R. L. Santos
Wojciech Sas
Joanna Śmietańska-Nowak
ELMALMLRM
399
0
0
04 Nov 2025
Putnam-like dataset summary: LLMs as mathematical competition contestants
Putnam-like dataset summary: LLMs as mathematical competition contestants
Bartosz Bieganowski
Daniel Strzelecki
Robert Skiba
Mateusz Topolewski
AIMat
318
0
0
29 Sep 2025
1