v1v2v3 (latest)

miniCTX: Neural Theorem Proving with (Long-)Contexts

International Conference on Learning Representations (ICLR), 2024

5 August 2024

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github (194★)

Papers citing "miniCTX: Neural Theorem Proving with (Long-)Contexts"

35 / 35 papers shown

miniF2F-Lean Revisited: Reviewing Limitations and Charting a Path Forward

Azim Ospanov

Farzan Farnia

Roozbeh Yousefzadeh

219

05 Nov 2025

FATE: A Formal Benchmark Series for Frontier Algebra of Multiple Difficulty Levels

...

514

04 Nov 2025

RLMEval: Evaluating Research-Level Neural Theorem Proving

349

29 Oct 2025

FormalML: A Benchmark for Evaluating Formal Subgoal Completion in Machine Learning Theory

179

26 Sep 2025

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

...

311

31 Jul 2025

Premise Selection for a Lean Hammer

234

09 Jun 2025

Rewarding the Unlikely: Lifting GRPO Beyond Distribution Sharpening

Andre He

Daniel Fried

Sean Welleck

452

03 Jun 2025

CLEVER: A Curated Benchmark for Formally Verified Code Generation

572

20 May 2025

APOLLO: Automated LLM and Lean Collaboration for Advanced Formal Reasoning

622

09 May 2025

APE-Bench: Evaluating Automated Proof Engineering for Formal Math Libraries

371

27 Apr 2025

Programming with Pixels: Can Computer-Use Agents do Software Engineering?

Pranjal Aggarwal

Sean Welleck

480

24 Feb 2025

Formal Mathematical Reasoning: A New Frontier in AI

626

20 Dec 2024

ImProver: Agent-Based Automated Proof OptimizationInternational Conference on Learning Representations (ICLR), 2024

379

07 Oct 2024

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Junxiao Song

...

Fuli Luo

370

154

15 Aug 2024

Reliable Evaluation and Benchmarks for Statement Autoformalization

521

11 Jun 2024

Lean Copilot: Large Language Models as Copilots for Theorem Proving in Lean

Peiyang Song

Kaiyu Yang

A. Anandkumar

455

18 Apr 2024

InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning

Zhejian Zhou

...

Xipeng Qiu

Dahua Lin

276

125

09 Feb 2024

DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

...

526

1,565

25 Jan 2024

Graph2Tac: Online Representation Learning of Formal Math Concepts

324

05 Jan 2024

LLMSTEP: LLM proofstep suggestions in Lean

Sean Welleck

Rahul Saha

282

27 Oct 2023

Llemma: An Open Language Model For MathematicsInternational Conference on Learning Representations (ICLR), 2023

Albert Q. Jiang

487

424

16 Oct 2023

FIMO: A Challenge Formal Dataset for Automated Theorem Proving

Zhengying Liu

...

Qun Liu

315

08 Sep 2023

LeanDojo: Theorem Proving with Retrieval-Augmented Language ModelsNeural Information Processing Systems (NeurIPS), 2023

491

404

27 Jun 2023

Baldur: Whole-Proof Generation and Repair with Large Language Models

368

155

08 Mar 2023

Magnushammer: A Transformer-Based Approach to Premise SelectionInternational Conference on Learning Representations (ICLR), 2023

Albert Qiaochu Jiang

358

08 Mar 2023

ProofNet: Autoformalizing and Formally Proving Undergraduate-Level Mathematics

242

144

24 Feb 2023

A Survey of Deep Learning for Mathematical ReasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Wenhao Yu

402

194

20 Dec 2022

Towards a Mathematics Formalisation Assistant using Large Language Models

254

14 Nov 2022

Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal ProofsInternational Conference on Learning Representations (ICLR), 2022

Albert Q. Jiang

383

273

21 Oct 2022

HyperTree Proof Search for Neural Theorem ProvingNeural Information Processing Systems (NeurIPS), 2022

453

207

23 May 2022

Thor: Wielding Hammers to Integrate Language Models and Automated Theorem ProversNeural Information Processing Systems (NeurIPS), 2022

Albert Q. Jiang

277

129

22 May 2022

MiniF2F: a cross-system benchmark for formal Olympiad-level mathematicsInternational Conference on Learning Representations (ICLR), 2021

668

327

31 Aug 2021

Proof Artifact Co-training for Theorem Proving with Language ModelsInternational Conference on Learning Representations (ICLR), 2021

565

148

11 Feb 2021

Learning to Prove Theorems via Interacting with Proof AssistantsInternational Conference on Machine Learning (ICML), 2019

Kaiyu Yang

Gaowen Liu

AIMat LRM

461

173

21 May 2019

DeepMath - Deep Sequence Models for Premise Selection

340

254

14 Jun 2016