v1v2 (latest)

LeanDojo: Theorem Proving with Retrieval-Augmented Language Models

Neural Information Processing Systems (NeurIPS), 2023

27 June 2023

ArXiv (abs)PDF HTML HuggingFace (17 upvotes)

Papers citing "LeanDojo: Theorem Proving with Retrieval-Augmented Language Models"

50 / 192 papers shown

ImProver: Agent-Based Automated Proof OptimizationInternational Conference on Learning Representations (ICLR), 2024

301

07 Oct 2024

Consistent Autoformalization for Constructing Mathematical LibrariesConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

214

05 Oct 2024

Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and GeneralizationNeural Information Processing Systems (NeurIPS), 2024

...

354

27 Sep 2024

In-Context Learning May Not Elicit Trustworthy Reasoning: A-Not-B Errors in Pretrained Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Jiaxuan You

194

23 Sep 2024

Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely

Siyun Zhao

Yuqing Yang

Zilong Wang

Zhiyuan He

Luna Qiu

Lili Qiu

SyDa RALM 3DV

338

23 Sep 2024

AutoVerus: Automated Proof Generation for Rust Code

Chenyuan Yang

Xuheng Li

Md Rakib Hossain Misu

...

456

19 Sep 2024

Great Memory, Shallow Reasoning: Limits of

k

NN-LMsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

246

21 Aug 2024

KAN 2.0: Kolmogorov-Arnold Networks Meet Science

Ziming Liu

Pingchuan Ma

Yixuan Wang

Wojciech Matusik

Max Tegmark

363

163

19 Aug 2024

QEDCartographer: Automating Formal Verification Using Reward-Free Reinforcement LearningInternational Conference on Software Engineering (ICSE), 2024

Yuriy Brun

572

17 Aug 2024

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Junxiao Song

...

Fuli Luo

291

138

15 Aug 2024

miniCTX: Neural Theorem Proving with (Long-)ContextsInternational Conference on Learning Representations (ICLR), 2024

474

05 Aug 2024

Mission Impossible: A Statistical Perspective on Jailbreaking LLMsNeural Information Processing Systems (NeurIPS), 2024

Jingtong Su

Mingyu Lee

SangKeun Lee

217

02 Aug 2024

LEAN-GitHub: Compiling GitHub LEAN repositories for a versatile LEAN prover

Zijian Wu

Jiayu Wang

Dahua Lin

Kai-xiang Chen

307

24 Jul 2024

Retrieval-Enhanced Machine Learning: Synthesis and Opportunities

376

17 Jul 2024

PutnamBench: Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition

George Tsoukalas

Jimmy Xin

Swarat Chaudhuri

271

15 Jul 2024

Lean-STaR: Learning to Interleave Thinking and Proving

752

14 Jul 2024

Solving General Natural-Language-Description Optimization Problems with Large Language Models

Wei Wang

168

09 Jul 2024

Towards Automated Functional Equation Proving: A Benchmark Dataset and A Domain-Specific In-Context Agent

Mahdi Buali

Robert Hoehndorf

254

05 Jul 2024

Learning Formal Mathematics From Intrinsic Motivation

317

30 Jun 2024

Towards Large Language Model Aided Program Refinement

YuFan Cai

Zhe Hou

Xiaokun Luan

David Miguel Sanan Baena

Yun Lin

Jun Sun

Jin Song Dong

179

26 Jun 2024

From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models

Sean Welleck

Ilia Kulikov

Zaid Harchaoui

400

113

24 Jun 2024

Specify What? Enhancing Neural Specification Synthesis by Symbolic Methods

George Granberry

Wolfgang Ahrendt

Moa Johansson

286

21 Jun 2024

FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving

Zhengying Liu

167

20 Jun 2024

Proving Olympiad Algebraic Inequalities without Human Demonstrations

246

20 Jun 2024

miniCodeProps: a Minimal Benchmark for Proving Code Properties

Evan Lohn

Sean Welleck

188

16 Jun 2024

Reliable Evaluation and Benchmarks for Statement Autoformalization

434

11 Jun 2024

Can I understand what I create? Self-Knowledge Evaluation of Large Language Models

197

10 Jun 2024

SelfDefend: LLMs Can Defend Themselves against Jailbreaking in a Practical Manner

Xunguang Wang

Shuai Wang

Yingjiu Li

Yang Liu

Ning Liu

Juergen Rahmel

AAML

485

08 Jun 2024

Lean Workbook: A large-scale Lean problem set formalized from natural language math problems

518

06 Jun 2024

RATT: A Thought Structure for Coherent and Correct LLM Reasoning

462

04 Jun 2024

Process-Driven Autoformalization in Lean 4

396

04 Jun 2024

Autoformalizing Euclidean Geometry

232

27 May 2024

Empowering Large Language Models to Set up a Knowledge Retrieval Indexer via Self-Learning

221

27 May 2024

Models That Prove Their Own Correctness

469

24 May 2024

Proving Theorems RecursivelyNeural Information Processing Systems (NeurIPS), 2024

Zhengying Liu

...

Xiaodan Liang

221

23 May 2024

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Zhihong Shao

Bo Liu

Xiaodan Liang

298

159

23 May 2024

HoneyBee: A Scalable Modular Framework for Creating Multimodal Oncology Datasets with Foundational Embedding Models

562

13 May 2024

ATG: Benchmarking Automated Theorem Generation for Generative Language Models

Zhengying Liu

Xiaodan Liang

284

05 May 2024

Generating Probabilistic Scenario Programs from Natural Language

Karim Elmaaroufi

Devan Shankar

Ana Cismaru

Marcell Vazquez-Chanlatte

Alberto L. Sangiovanni-Vincentelli

Matei A. Zaharia

Sanjit A. Seshia

262

03 May 2024

Towards Neural Synthesis for SMT-Assisted Proof-Oriented ProgrammingInternational Conference on Software Engineering (ICSE), 2024

220

03 May 2024

Towards Green AI: Current status and future research

262

01 May 2024

LLM-SR: Scientific Equation Discovery via Programming with Large Language Models

585

29 Apr 2024

Lean Copilot: Large Language Models as Copilots for Theorem Proving in Lean

Peiyang Song

Kaiyu Yang

A. Anandkumar

389

18 Apr 2024

A Survey on Deep Learning for Theorem Proving

294

15 Apr 2024

Learn from Failure: Fine-Tuning LLMs with Trial-and-Error Data for Intuitionistic Propositional Logic ProvingAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

368

10 Apr 2024

LeanReasoner: Boosting Complex Logical Reasoning with Lean

243

20 Mar 2024

StateFlow: Enhancing LLM Task-Solving through State-Driven Workflows

274

17 Mar 2024

Learning Guided Automated Reasoning: A Brief Survey

277

06 Mar 2024

Reliable, Adaptable, and Attributable Language Models with Retrieval

Akari Asai

Zexuan Zhong

Danqi Chen

Pang Wei Koh

Luke Zettlemoyer

Hanna Hajishirzi

Anuj Kumar

KELM RALM

340

05 Mar 2024

SynCode: LLM Generation with Grammar Augmentation

296

03 Mar 2024