v1v2 (latest)

OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data

International Conference on Learning Representations (ICLR), 2024

2 October 2024

ArXiv (abs)PDF HTML HuggingFace (4 upvotes)

Papers citing "OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data"

50 / 83 papers shown

Not-a-Bandit: Provably No-Regret Drafter Selection in Speculative Decoding for LLMs

119

22 Oct 2025

ECG-LLM-- training and evaluation of domain-specific large language models for electrocardiography

Lara Ahrens

Wilhelm Haverkamp

Nils Strodthoff

118

21 Oct 2025

Pay Attention to the Triggers: Constructing Backdoors That Survive Distillation

151

21 Oct 2025

FineVision: Open Data Is All You Need

Aritra Roy Gosthipaty

Andrés Marafioti

VLM

192

20 Oct 2025

QueST: Incentivizing LLMs to Generate Difficult Problems

255

20 Oct 2025

To Infinity and Beyond: Tool-Use Unlocks Length Generalization in State Space Models

152

16 Oct 2025

HoneyBee: Data Recipes for Vision-Language Reasoners

Hritik Bansal

Devandra Singh Sachan

146

14 Oct 2025

Pushing on Multilingual Reasoning Models with Language-Mixed Chain-of-Thought

Guijin Son

Donghun Yang

Hitesh Laxmichand Patel

...

146

05 Oct 2025

Principled and Tractable RL for Reasoning with Diffusion Language Models

Anthony Zhan

DiffM AI4CE

05 Oct 2025

GuidedSampling: Steering LLMs Towards Diverse Candidate Solutions at Inference-Time

04 Oct 2025

Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs

...

229

01 Oct 2025

Beyond English-Centric Training: How Reinforcement Learning Improves Cross-Lingual Reasoning in LLMs

108

28 Sep 2025

Learning More with Less: A Dynamic Dual-Level Down-Sampling Framework for Efficient Policy Optimization

117

26 Sep 2025

Exploring Solution Divergence and Its Effect on Large Language Model Problem Solving

121

26 Sep 2025

ScaleDiff: Scaling Difficult Problems for Advanced Mathematical Reasoning

223

25 Sep 2025

Expanding Reasoning Potential in Foundation Model by Learning Diverse Chains of Thought Patterns

209

25 Sep 2025

CogAtom: From Cognitive Atoms to Olympiad-level Mathematical Reasoning in Large Language Models

230

22 Sep 2025

SAIL-VL2 Technical Report

...

285

17 Sep 2025

Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

175

26 Aug 2025

Can Structured Templates Facilitate LLMs in Tackling Harder Tasks? : An Exploration of Scaling Laws by Difficulty

26 Aug 2025

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

...

291

20 Aug 2025

Data Mixing Optimization for Supervised Fine-Tuning of Large Language Models

Yuan Li

Zhengzhong Liu

Eric P. Xing

128

16 Aug 2025

Apriel-Nemotron-15B-Thinker

Shruthan Radhakrishna

...

Sathwik Tejaswi Madhusudhan

184

13 Aug 2025

MathSmith: Towards Extremely Hard Mathematical Reasoning by Forging Synthetic Problems with a Reinforced Policy

115

07 Aug 2025

WarriorMath: Enhancing the Mathematical Ability of Large Language Models with a Defect-aware Framework

...

172

02 Aug 2025

SAND-Math: Using LLMs to Generate Novel, Difficult and Useful Mathematics Questions and Answers

Chaitanya Manem

Pratik Prabhanjan Brahma

342

28 Jul 2025

Diversity-Enhanced Reasoning for Subjective Questions

470

27 Jul 2025

PITA: Preference-Guided Inference-Time Alignment for LLM Post-Training

Sarat Chandra Bobbili

Ujwal Dinesha

Dheeraj Narasimha

S. Shakkottai

147

26 Jul 2025

GenSelect: A Generative Approach to Best-of-N

135

23 Jul 2025

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

315

22 Jul 2025

EvoLM: In Search of Lost Language Model Training Dynamics

304

19 Jun 2025

Test-Time-Scaling for Zero-Shot Diagnosis with Visual-Language Reasoning

161

11 Jun 2025

TaskCraft: Automated Generation of Agentic Tasks

...

302

11 Jun 2025

Reinforce LLM Reasoning through Multi-Agent Reflection

Yurun Yuan

Tengyang Xie

LRM

305

10 Jun 2025

A Survey on Large Language Models for Mathematical Reasoning

...

269

10 Jun 2025

Improving Large Language Models with Concept-Aware Fine-Tuning

273

09 Jun 2025

SPARQ: Synthetic Problem Generation for Reasoning via Quality-Diversity Algorithms

306

06 Jun 2025

Establishing Trustworthy LLM Evaluation via Shortcut Neuron AnalysisAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

223

04 Jun 2025

DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

...

302

29 May 2025

Benchmarking Abstract and Reasoning Abilities Through A Theoretical Perspective

202

28 May 2025

LASER: Stratified Selective Sampling for Instruction Tuning with Dedicated Scoring Strategy

Paramita Mirza

Lucas Weber

Fabian Küch

272

28 May 2025

ReCopilot: Reverse Engineering Copilot in Binary Analysis

209

22 May 2025

Watch your steps: Dormant Adversarial Behaviors that Activate upon LLM Finetuning

469

22 May 2025

Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models

311

21 May 2025

Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards

345

19 May 2025

Multi-Token Prediction Needs Registers

Anastasios Gerontopoulos

Spyros Gidaris

N. Komodakis

375

15 May 2025

FineScope : Precision Pruning for Domain-Specialized Large Language Models Using SAE-Guided Self-Data Cultivation

Chaitali Bhattacharyya

297

01 May 2025

Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets

551

28 Apr 2025

AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset

286

23 Apr 2025

M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

340

14 Apr 2025

All Papers

OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data

Papers citing "OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data"