v1v2 (latest)

OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data

International Conference on Learning Representations (ICLR), 2024

2 October 2024

ArXiv (abs)PDF HTML HuggingFace (4 upvotes)

Papers citing "OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data"

33 / 83 papers shown

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

...

380

14 Apr 2025

Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining

860

10 Apr 2025

^2

: Self-Distilled Sparse Drafters

Mike Lasby

Nish Sinnadurai

Valavan Manohararajah

Sean Lie

Yani Andrew Ioannou

Vithursan Thangarasa

789

10 Apr 2025

SEA-LION: Southeast Asian Languages in One Network

...

430

08 Apr 2025

MegaMath: Pushing the Limits of Open Math Corpora

304

03 Apr 2025

Scaling Laws of Synthetic Data for Language Models

...

382

25 Mar 2025

TreeSynth: Synthesizing Diverse Data from Scratch via Tree-Guided Subspace Partitioning

445

21 Mar 2025

MASS: Mathematical Data Selection via Skill Graphs for Pretraining Large Language Models

436

19 Mar 2025

KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for CodingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

436

04 Mar 2025

Large-Scale Data Selection for Instruction Tuning

369

03 Mar 2025

Thinking Slow, Fast: Scaling Inference Compute with Distilled Reasoners

308

27 Feb 2025

Self-rewarding correction for mathematical reasoning

428

26 Feb 2025

MathClean: A Benchmark for Synthetic Mathematical Data Cleaning

222

26 Feb 2025

M2-omni: Advancing Omni-MLLM for Comprehensive Modality Support with Competitive Performance

...

588

26 Feb 2025

S$^2$R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

^2

R: Teaching LLMs to Self-verify and Self-correct via Reinforcement LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

480

18 Feb 2025

NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions

...

520

18 Feb 2025

Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving

...

458

17 Feb 2025

MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task

300

17 Feb 2025

Small Models Struggle to Learn from Strong ReasonersAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Bhaskar Ramasubramanian

Radha Poovendran

LRM

462

17 Feb 2025

Optimizing Temperature for Language Models with Multi-Sample Inference

Weihua Du

Yiming Yang

Sean Welleck

497

07 Feb 2025

Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

1.2K

04 Feb 2025

Process Reinforcement through Implicit Rewards

...

514

223

03 Feb 2025

UGPhysics: A Comprehensive Benchmark for Undergraduate Physics Reasoning with Large Language Models

809

01 Feb 2025

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

357

249

08 Jan 2025

InfiFusion: A Unified Framework for Enhanced Cross-Model Reasoning via LLM Fusion

421

06 Jan 2025

Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap

459

05 Jan 2025

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

943

571

03 Jan 2025

Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning

337

23 Dec 2024

LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for Low-Resource Language Reasoning

373

17 Dec 2024

Entropy-Regularized Process Reward Model

218

15 Dec 2024

Learning by Analogy: Enhancing Few-Shot Prompting for Math Word Problem Solving with Computational Graph-Based Retrieval

287

25 Nov 2024

Teuken-7B-Base & Teuken-7B-Instruct: Towards European LLMs

...

323

30 Sep 2024

Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models

338

29 Jul 2024