ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2408.03350
  4. Cited By
miniCTX: Neural Theorem Proving with (Long-)Contexts
v1v2v3 (latest)

miniCTX: Neural Theorem Proving with (Long-)Contexts

International Conference on Learning Representations (ICLR), 2024
5 August 2024
Jiewen Hu
Thomas Zhu
Sean Welleck
    AIMat
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)Github (194★)

Papers citing "miniCTX: Neural Theorem Proving with (Long-)Contexts"

35 / 35 papers shown
miniF2F-Lean Revisited: Reviewing Limitations and Charting a Path Forward
miniF2F-Lean Revisited: Reviewing Limitations and Charting a Path Forward
Azim Ospanov
Farzan Farnia
Roozbeh Yousefzadeh
219
1
0
05 Nov 2025
FATE: A Formal Benchmark Series for Frontier Algebra of Multiple Difficulty Levels
FATE: A Formal Benchmark Series for Frontier Algebra of Multiple Difficulty Levels
Jiedong Jiang
Wanyi He
Yuefeng Wang
Guoxiong Gao
Yongle Hu
...
Nailing Guan
Peihao Wu
Chunbo Dai
Liang Xiao
Bin Dong
AIMatELMLRM
514
12
0
04 Nov 2025
RLMEval: Evaluating Research-Level Neural Theorem Proving
RLMEval: Evaluating Research-Level Neural Theorem Proving
Auguste Poiroux
Antoine Bosselut
Viktor Kuncak
AIMatOffRL
349
2
0
29 Oct 2025
FormalML: A Benchmark for Evaluating Formal Subgoal Completion in Machine Learning Theory
FormalML: A Benchmark for Evaluating Formal Subgoal Completion in Machine Learning Theory
Xiao-Wen Yang
Zihao Zhang
Jianuo Cao
Zhi Zhou
Zenan Li
Lan-Zhe Guo
Yuan Yao
Taolue Chen
Yu-Feng Li
Xiaoxing Ma
ALMLRM
179
2
0
26 Sep 2025
Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving
Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving
L. Chen
J. Gu
Daigang Xu
Wenhao Huang
Z. L. Jiang
...
Ge Zhang
Tianyun Zhao
Jianqiu Zhao
Yichi Zhou
Thomas Hanwen Zhu
AIMatLRM
311
64
0
31 Jul 2025
Premise Selection for a Lean Hammer
Premise Selection for a Lean Hammer
Thomas Zhu
Joshua Clune
Jeremy Avigad
Albert Qiaochu Jiang
Sean Welleck
234
6
0
09 Jun 2025
Rewarding the Unlikely: Lifting GRPO Beyond Distribution Sharpening
Rewarding the Unlikely: Lifting GRPO Beyond Distribution Sharpening
Andre He
Daniel Fried
Sean Welleck
452
45
0
03 Jun 2025
CLEVER: A Curated Benchmark for Formally Verified Code Generation
CLEVER: A Curated Benchmark for Formally Verified Code Generation
Amitayush Thakur
Jasper Lee
George Tsoukalas
Meghana Sistla
Matthew Zhao
Stefan Zetzsche
Greg Durrett
Yisong Yue
Swarat Chaudhuri
ALM
572
23
0
20 May 2025
APOLLO: Automated LLM and Lean Collaboration for Advanced Formal Reasoning
APOLLO: Automated LLM and Lean Collaboration for Advanced Formal Reasoning
Azim Ospanov
Farzan Farnia
Roozbeh Yousefzadeh
LRM
622
15
0
09 May 2025
APE-Bench: Evaluating Automated Proof Engineering for Formal Math Libraries
APE-Bench: Evaluating Automated Proof Engineering for Formal Math Libraries
Huajian Xin
Luming Li
Xiaoran Jin
Jacques Fleuriot
Wenda Li
AIMat
371
3
0
27 Apr 2025
Programming with Pixels: Can Computer-Use Agents do Software Engineering?
Programming with Pixels: Can Computer-Use Agents do Software Engineering?
Pranjal Aggarwal
Sean Welleck
480
1
0
24 Feb 2025
Formal Mathematical Reasoning: A New Frontier in AI
Formal Mathematical Reasoning: A New Frontier in AI
Kaiyu Yang
Gabriel Poesia
Jingxuan He
Wenda Li
Kristin Lauter
Swarat Chaudhuri
Dawn Song
LRMAI4CE
626
78
0
20 Dec 2024
ImProver: Agent-Based Automated Proof Optimization
ImProver: Agent-Based Automated Proof OptimizationInternational Conference on Learning Representations (ICLR), 2024
Riyaz Ahuja
Jeremy Avigad
Prasad Tetali
Sean Welleck
LLMAG
379
3
0
07 Oct 2024
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for
  Reinforcement Learning and Monte-Carlo Tree Search
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
Huajian Xin
Zhaochun Ren
Junxiao Song
Zhihong Shao
Wanjia Zhao
...
Dejian Yang
Zhibin Gou
Z. F. Wu
Fuli Luo
Chong Ruan
AIMatLRM
370
154
0
15 Aug 2024
Reliable Evaluation and Benchmarks for Statement Autoformalization
Reliable Evaluation and Benchmarks for Statement Autoformalization
Auguste Poiroux
Gail Weiss
Viktor Kuncak
Antoine Bosselut
521
10
0
11 Jun 2024
Lean Copilot: Large Language Models as Copilots for Theorem Proving in Lean
Lean Copilot: Large Language Models as Copilots for Theorem Proving in Lean
Peiyang Song
Kaiyu Yang
A. Anandkumar
455
10
0
18 Apr 2024
InternLM-Math: Open Math Large Language Models Toward Verifiable
  Reasoning
InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning
Huaiyuan Ying
Shuo Zhang
Linyang Li
Zhejian Zhou
Yunfan Shao
...
Hang Yan
Xipeng Qiu
Jiayu Wang
Kai-xiang Chen
Dahua Lin
ReLMLRM
276
125
0
09 Feb 2024
DeepSeek-Coder: When the Large Language Model Meets Programming -- The
  Rise of Code Intelligence
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence
Daya Guo
Qihao Zhu
Dejian Yang
Zhenda Xie
Kai Dong
...
Yu-Huan Wu
Yiming Li
Fuli Luo
Yingfei Xiong
W. Liang
ELM
526
1,565
0
25 Jan 2024
Graph2Tac: Online Representation Learning of Formal Math Concepts
Graph2Tac: Online Representation Learning of Formal Math Concepts
Lasse Blaauwbroek
Miroslav Olšák
Jason Rute
F. I. S. Massolo
Jelle Piepenbrock
Vasily Pestun
324
18
0
05 Jan 2024
LLMSTEP: LLM proofstep suggestions in Lean
LLMSTEP: LLM proofstep suggestions in Lean
Sean Welleck
Rahul Saha
282
39
0
27 Oct 2023
Llemma: An Open Language Model For Mathematics
Llemma: An Open Language Model For MathematicsInternational Conference on Learning Representations (ICLR), 2023
Zhangir Azerbayev
Hailey Schoelkopf
Keiran Paster
Marco Dos Santos
Alexander Shmakov
Albert Q. Jiang
Jia Deng
Stella Biderman
Sean Welleck
CLL
487
424
0
16 Oct 2023
FIMO: A Challenge Formal Dataset for Automated Theorem Proving
FIMO: A Challenge Formal Dataset for Automated Theorem Proving
Chengwu Liu
Jianhao Shen
Huajian Xin
Zhengying Liu
Ye Yuan
...
Chuanyang Zheng
Yichun Yin
Lin Li
Ming Zhang
Qun Liu
AIMatAI4CE
315
52
0
08 Sep 2023
LeanDojo: Theorem Proving with Retrieval-Augmented Language Models
LeanDojo: Theorem Proving with Retrieval-Augmented Language ModelsNeural Information Processing Systems (NeurIPS), 2023
Kaiyu Yang
Aidan M. Swope
Alex Gu
Rahul Chalamala
Peiyang Song
Shixing Yu
Saad Godil
R. Prenger
Anima Anandkumar
RALM
491
404
0
27 Jun 2023
Baldur: Whole-Proof Generation and Repair with Large Language Models
Baldur: Whole-Proof Generation and Repair with Large Language Models
E. First
M. Rabe
Talia Ringer
Yuriy Brun
368
155
0
08 Mar 2023
Magnushammer: A Transformer-Based Approach to Premise Selection
Magnushammer: A Transformer-Based Approach to Premise SelectionInternational Conference on Learning Representations (ICLR), 2023
Maciej Mikuła
Szymon Tworkowski
Szymon Antoniak
Bartosz Piotrowski
Albert Qiaochu Jiang
Jinyi Zhou
Christian Szegedy
Lukasz Kuciñski
Piotr Milo's
Yuhuai Wu
358
62
0
08 Mar 2023
ProofNet: Autoformalizing and Formally Proving Undergraduate-Level
  Mathematics
ProofNet: Autoformalizing and Formally Proving Undergraduate-Level Mathematics
Zhangir Azerbayev
Bartosz Piotrowski
Hailey Schoelkopf
Edward W. Ayers
Dragomir R. Radev
J. Avigad
AIMat
242
144
0
24 Feb 2023
A Survey of Deep Learning for Mathematical Reasoning
A Survey of Deep Learning for Mathematical ReasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Pan Lu
Liang Qiu
Wenhao Yu
Sean Welleck
Kai-Wei Chang
ReLMLRM
402
194
0
20 Dec 2022
Towards a Mathematics Formalisation Assistant using Large Language
  Models
Towards a Mathematics Formalisation Assistant using Large Language Models
Ayush Agrawal
Siddhartha Gadgil
Navin Goyal
Ashvni Narayanan
Anand Tadipatri
254
22
0
14 Nov 2022
Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal
  Proofs
Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal ProofsInternational Conference on Learning Representations (ICLR), 2022
Albert Q. Jiang
Sean Welleck
Jin Peng Zhou
Wenda Li
Hamish Ivison
M. Jamnik
Timothée Lacroix
Yuhuai Wu
Guillaume Lample
AIMat
383
273
0
21 Oct 2022
HyperTree Proof Search for Neural Theorem Proving
HyperTree Proof Search for Neural Theorem ProvingNeural Information Processing Systems (NeurIPS), 2022
Guillaume Lample
Marie-Anne Lachaux
Thibaut Lavril
Xavier Martinet
Amaury Hayat
Gabriel Ebner
Aurelien Rodriguez
Timothée Lacroix
AIMat
453
207
0
23 May 2022
Thor: Wielding Hammers to Integrate Language Models and Automated
  Theorem Provers
Thor: Wielding Hammers to Integrate Language Models and Automated Theorem ProversNeural Information Processing Systems (NeurIPS), 2022
Albert Q. Jiang
Wenda Li
Szymon Tworkowski
K. Czechowski
Tomasz Odrzygó'zd'z
Piotr Milo's
Yuhuai Wu
M. Jamnik
AIMatLRM
277
129
0
22 May 2022
MiniF2F: a cross-system benchmark for formal Olympiad-level mathematics
MiniF2F: a cross-system benchmark for formal Olympiad-level mathematicsInternational Conference on Learning Representations (ICLR), 2021
Kunhao Zheng
Jesse Michael Han
Stanislas Polu
AIMat
668
327
0
31 Aug 2021
Proof Artifact Co-training for Theorem Proving with Language Models
Proof Artifact Co-training for Theorem Proving with Language ModelsInternational Conference on Learning Representations (ICLR), 2021
Jesse Michael Han
Jason M. Rute
Yuhuai Wu
Edward W. Ayers
Stanislas Polu
AIMat
565
148
0
11 Feb 2021
Learning to Prove Theorems via Interacting with Proof Assistants
Learning to Prove Theorems via Interacting with Proof AssistantsInternational Conference on Machine Learning (ICML), 2019
Kaiyu Yang
Gaowen Liu
AIMatLRM
461
173
0
21 May 2019
DeepMath - Deep Sequence Models for Premise Selection
DeepMath - Deep Sequence Models for Premise Selection
Alexander A. Alemi
François Chollet
N. Eén
G. Irving
Christian Szegedy
Josef Urban
LRMAIMat
340
254
0
14 Jun 2016
1
Page 1 of 1