v1v2 (latest)

OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data

International Conference on Learning Representations (ICLR), 2024

2 October 2024

ArXiv (abs)PDF HTML HuggingFace (4 upvotes)

Papers citing "OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data"

50 / 83 papers shown

Not-a-Bandit: Provably No-Regret Drafter Selection in Speculative Decoding for LLMs

138

22 Oct 2025

ECG-LLM-- training and evaluation of domain-specific large language models for electrocardiography

Lara Ahrens

Wilhelm Haverkamp

Nils Strodthoff

128

21 Oct 2025

Pay Attention to the Triggers: Constructing Backdoors That Survive Distillation

165

21 Oct 2025

FineVision: Open Data Is All You Need

Aritra Roy Gosthipaty

Andrés Marafioti

VLM

195

20 Oct 2025

QueST: Incentivizing LLMs to Generate Difficult Problems

255

20 Oct 2025

To Infinity and Beyond: Tool-Use Unlocks Length Generalization in State Space Models

164

16 Oct 2025

HoneyBee: Data Recipes for Vision-Language Reasoners

Hritik Bansal

Devandra Singh Sachan

161

14 Oct 2025

Pushing on Multilingual Reasoning Models with Language-Mixed Chain-of-Thought

Guijin Son

Donghun Yang

Hitesh Laxmichand Patel

...

151

05 Oct 2025

Principled and Tractable RL for Reasoning with Diffusion Language Models

Anthony Zhan

DiffM AI4CE

111

05 Oct 2025

GuidedSampling: Steering LLMs Towards Diverse Candidate Solutions at Inference-Time

04 Oct 2025

Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs

...

237

01 Oct 2025

Beyond English-Centric Training: How Reinforcement Learning Improves Cross-Lingual Reasoning in LLMs

130

28 Sep 2025

Learning More with Less: A Dynamic Dual-Level Down-Sampling Framework for Efficient Policy Optimization

121

26 Sep 2025

Exploring Solution Divergence and Its Effect on Large Language Model Problem Solving

121

26 Sep 2025

ScaleDiff: Scaling Difficult Problems for Advanced Mathematical Reasoning

225

25 Sep 2025

Expanding Reasoning Potential in Foundation Model by Learning Diverse Chains of Thought Patterns

216

25 Sep 2025

CogAtom: From Cognitive Atoms to Olympiad-level Mathematical Reasoning in Large Language Models

239

22 Sep 2025

SAIL-VL2 Technical Report

...

297

17 Sep 2025

Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

175

26 Aug 2025

Can Structured Templates Facilitate LLMs in Tackling Harder Tasks? : An Exploration of Scaling Laws by Difficulty

26 Aug 2025

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

...

298

20 Aug 2025

Data Mixing Optimization for Supervised Fine-Tuning of Large Language Models

Yuan Li

Zhengzhong Liu

Eric P. Xing

139

16 Aug 2025

Apriel-Nemotron-15B-Thinker

Shruthan Radhakrishna

...

Sathwik Tejaswi Madhusudhan

205

13 Aug 2025

MathSmith: Towards Extremely Hard Mathematical Reasoning by Forging Synthetic Problems with a Reinforced Policy

124

07 Aug 2025

WarriorMath: Enhancing the Mathematical Ability of Large Language Models with a Defect-aware Framework

...

181

02 Aug 2025

SAND-Math: Using LLMs to Generate Novel, Difficult and Useful Mathematics Questions and Answers

Chaitanya Manem

Pratik Prabhanjan Brahma

344

28 Jul 2025

Diversity-Enhanced Reasoning for Subjective Questions

493

27 Jul 2025

PITA: Preference-Guided Inference-Time Alignment for LLM Post-Training

Sarat Chandra Bobbili

Ujwal Dinesha

Dheeraj Narasimha

S. Shakkottai

165

26 Jul 2025

GenSelect: A Generative Approach to Best-of-N

142

23 Jul 2025

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

326

22 Jul 2025

EvoLM: In Search of Lost Language Model Training Dynamics

316

19 Jun 2025

Test-Time-Scaling for Zero-Shot Diagnosis with Visual-Language Reasoning

161

11 Jun 2025

TaskCraft: Automated Generation of Agentic Tasks

...

305

11 Jun 2025

Reinforce LLM Reasoning through Multi-Agent Reflection

Yurun Yuan

Tengyang Xie

LRM

317

10 Jun 2025

A Survey on Large Language Models for Mathematical Reasoning

...

279

10 Jun 2025

Improving Large Language Models with Concept-Aware Fine-Tuning

283

09 Jun 2025

SPARQ: Synthetic Problem Generation for Reasoning via Quality-Diversity Algorithms

324

06 Jun 2025

Establishing Trustworthy LLM Evaluation via Shortcut Neuron AnalysisAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

226

04 Jun 2025

DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

...

310

29 May 2025

Benchmarking Abstract and Reasoning Abilities Through A Theoretical Perspective

202

28 May 2025

LASER: Stratified Selective Sampling for Instruction Tuning with Dedicated Scoring Strategy

Paramita Mirza

Lucas Weber

Fabian Küch

287

28 May 2025

ReCopilot: Reverse Engineering Copilot in Binary Analysis

217

22 May 2025

Watch your steps: Dormant Adversarial Behaviors that Activate upon LLM Finetuning

483

22 May 2025

Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models

321

21 May 2025

Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards

347

19 May 2025

Multi-Token Prediction Needs Registers

Anastasios Gerontopoulos

Spyros Gidaris

N. Komodakis

377

15 May 2025

FineScope : Precision Pruning for Domain-Specialized Large Language Models Using SAE-Guided Self-Data Cultivation

Chaitali Bhattacharyya

309

01 May 2025

Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets

553

28 Apr 2025

AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset

302

23 Apr 2025

M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

352

14 Apr 2025