ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.14318
  4. Cited By
Learning Math Reasoning from Self-Sampled Correct and Partially-Correct
  Solutions
v1v2 (latest)

Learning Math Reasoning from Self-Sampled Correct and Partially-Correct Solutions

International Conference on Learning Representations (ICLR), 2022
28 May 2022
Ansong Ni
J. Inala
Chenglong Wang
Oleksandr Polozov
Christopher Meek
Dragomir R. Radev
Jianfeng Gao
    ReLMAIMatLRM
ArXiv (abs)PDFHTMLGithub (27★)

Papers citing "Learning Math Reasoning from Self-Sampled Correct and Partially-Correct Solutions"

40 / 40 papers shown
Title
In-Token Rationality Optimization: Towards Accurate and Concise LLM Reasoning via Self-Feedback
In-Token Rationality Optimization: Towards Accurate and Concise LLM Reasoning via Self-Feedback
Mingye Zhu
Yi Liu
Zheren Fu
Quan Wang
Yongdong Zhang
LLMAGLRM
203
0
0
13 Nov 2025
ReviewScore: Misinformed Peer Review Detection with Large Language Models
ReviewScore: Misinformed Peer Review Detection with Large Language Models
Hyun Ryu
Doohyuk Jang
Hyemin S. Lee
Joonhyun Jeong
Gyeongman Kim
...
Kwanhyung Lee
Chanjae Park
Heecheol Yun
Gregor Betz
Eunho Yang
131
0
0
25 Sep 2025
GPO: Learning from Critical Steps to Improve LLM Reasoning
GPO: Learning from Critical Steps to Improve LLM Reasoning
Jiahao Yu
Zelei Cheng
Xian Wu
Xinyu Xing
LRM
175
2
0
19 Sep 2025
Supervised Fine Tuning on Curated Data is Reinforcement Learning (and can be improved)
Supervised Fine Tuning on Curated Data is Reinforcement Learning (and can be improved)
Chongli Qin
Jost Tobias Springenberg
OffRL
203
11
0
17 Jul 2025
Can Large Reasoning Models Self-Train?
Can Large Reasoning Models Self-Train?
Sheikh Shafayat
Fahim Tajwar
Ruslan Salakhutdinov
J. Schneider
Andrea Zanette
ReLMOffRLLRM
369
19
0
27 May 2025
Bridging Supervised Learning and Reinforcement Learning in Math Reasoning
Bridging Supervised Learning and Reinforcement Learning in Math Reasoning
Huayu Chen
Kaiwen Zheng
Qinsheng Zhang
Ganqu Cui
Yin Cui
Haotian Ye
Tsung-Yi Lin
Ming-Yu Liu
Jun Zhu
Haoxiang Wang
OffRLLRM
497
14
0
23 May 2025
STaR-SQL: Self-Taught Reasoner for Text-to-SQL
STaR-SQL: Self-Taught Reasoner for Text-to-SQLAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Mingqian He
Yongliang Shen
Weinan Zhang
Qiuying Peng
Jun Wang
Weiming Lu
ReLMLRM
182
9
0
20 Feb 2025
Evolutionary Pre-Prompt Optimization for Mathematical Reasoning
Evolutionary Pre-Prompt Optimization for Mathematical Reasoning
Mathurin Videau
Alessandro Leite
Marc Schoenauer
O. Teytaud
ReLMLRM
216
2
0
05 Dec 2024
Keep Guessing? When Considering Inference Scaling, Mind the Baselines
Keep Guessing? When Considering Inference Scaling, Mind the BaselinesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
G. Yona
Or Honovich
Omer Levy
Roee Aharoni
UQLMLRM
383
0
0
20 Oct 2024
Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement Learning
Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024
Hao Ma
Tianyi Hu
Zhiqiang Pu
Boyin Liu
Xiaolin Ai
Yanyan Liang
Min Chen
390
22
0
08 Oct 2024
Reasoning Paths Optimization: Learning to Reason and Explore From
  Diverse Paths
Reasoning Paths Optimization: Learning to Reason and Explore From Diverse PathsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Yew Ken Chia
Guizhen Chen
Weiwen Xu
Luu Anh Tuan
Soujanya Poria
Lidong Bing
LRM
212
3
0
07 Oct 2024
Interpreting and Improving Large Language Models in Arithmetic
  Calculation
Interpreting and Improving Large Language Models in Arithmetic CalculationInternational Conference on Machine Learning (ICML), 2024
Wei Zhang
Chaoqun Wan
Yonggang Zhang
Yiu-ming Cheung
Xinmei Tian
Xu Shen
Jieping Ye
LRM
313
36
0
03 Sep 2024
Weak-to-Strong Reasoning
Weak-to-Strong Reasoning
Yuqing Yang
Yan Ma
Pengfei Liu
LRM
306
28
0
18 Jul 2024
Advancing Process Verification for Large Language Models via Tree-Based
  Preference Learning
Advancing Process Verification for Large Language Models via Tree-Based Preference Learning
Mingqian He
Yongliang Shen
Wenqi Zhang
Zeqi Tan
Weiming Lu
LRM
190
13
0
29 Jun 2024
PORT: Preference Optimization on Reasoning Traces
PORT: Preference Optimization on Reasoning Traces
Salem Lahlou
Abdalgader Abubaker
Hakim Hacid
LRM
311
7
0
23 Jun 2024
Interactive Evolution: A Neural-Symbolic Self-Training Framework For
  Large Language Models
Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models
Fangzhi Xu
Qiushi Sun
Kanzhi Cheng
Jing Liu
Yu Qiao
Zhiyong Wu
LLMAG
166
8
0
17 Jun 2024
Chain of Preference Optimization: Improving Chain-of-Thought Reasoning
  in LLMs
Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs
Xuan Zhang
Chao Du
Tianyu Pang
Qian Liu
Wei Gao
Min Lin
LRMAI4CE
260
117
0
13 Jun 2024
AICoderEval: Improving AI Domain Code Generation of Large Language
  Models
AICoderEval: Improving AI Domain Code Generation of Large Language Models
Yinghui Xia
Yuyan Chen
Tianyu Shi
Jun Wang
Jinsong Yang
138
5
0
07 Jun 2024
mCoT: Multilingual Instruction Tuning for Reasoning Consistency in
  Language Models
mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models
Huiyuan Lai
Malvina Nissim
LRM
398
30
0
04 Jun 2024
NExT: Teaching Large Language Models to Reason about Code Execution
NExT: Teaching Large Language Models to Reason about Code Execution
Ansong Ni
Miltiadis Allamanis
Arman Cohan
Yinlin Deng
Kensen Shi
Charles Sutton
Pengcheng Yin
ReLMLRM
251
60
0
23 Apr 2024
Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of
  Language Models with Fine-grained Rewards
Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards
Hyeonbin Hwang
Doyoung Kim
Seungone Kim
Seonghyeon Ye
Minjoon Seo
LRMReLM
321
7
0
16 Apr 2024
Eliciting Better Multilingual Structured Reasoning from LLMs through
  Code
Eliciting Better Multilingual Structured Reasoning from LLMs through Code
Bryan Li
Tamer Alkhouli
Daniele Bonadiman
Nikolaos Pappas
Saab Mansour
LRM
331
15
0
05 Mar 2024
Debug like a Human: A Large Language Model Debugger via Verifying
  Runtime Execution Step-by-step
Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step
Li Zhong
Zilong Wang
Jingbo Shang
400
121
0
25 Feb 2024
An Empirical Study of Data Ability Boundary in LLMs' Math Reasoning
An Empirical Study of Data Ability Boundary in LLMs' Math Reasoning
Zui Chen
Yezeng Chen
Jiaqi Han
Zhijie Huang
Ji Qi
Yi Zhou
LRM
182
7
0
23 Feb 2024
Evaluating LLMs' Mathematical Reasoning in Financial Document Question Answering
Evaluating LLMs' Mathematical Reasoning in Financial Document Question Answering
Pragya Srivastava
Manuj Malik
Vivek Gupta
T. Ganu
Dan Roth
242
37
0
17 Feb 2024
V-STaR: Training Verifiers for Self-Taught Reasoners
V-STaR: Training Verifiers for Self-Taught Reasoners
Arian Hosseini
Xingdi Yuan
Nikolay Malkin
Rameswar Panda
Alessandro Sordoni
Rishabh Agarwal
ReLMLRM
297
191
0
09 Feb 2024
CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay
CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay
Natasha Butt
Blazej Manczak
Auke Wiggers
Corrado Rainone
David W. Zhang
Michaël Defferrard
Taco S. Cohen
ReLMLRM
196
26
0
07 Feb 2024
Multi-step Problem Solving Through a Verifier: An Empirical Analysis on
  Model-induced Process Supervision
Multi-step Problem Solving Through a Verifier: An Empirical Analysis on Model-induced Process Supervision
Zihan Wang
Yunxuan Li
Yuexin Wu
Liangchen Luo
Le Hou
Hongkun Yu
Jingbo Shang
LRM
200
42
0
05 Feb 2024
TinyGSM: achieving >80% on GSM8k with small language models
TinyGSM: achieving >80% on GSM8k with small language models
Bingbin Liu
Sébastien Bubeck
Ronen Eldan
Janardhan Kulkarni
Yuanzhi Li
Anh Nguyen
Rachel A. Ward
Yi Zhang
ALM
231
56
0
14 Dec 2023
Beyond Human Data: Scaling Self-Training for Problem-Solving with
  Language Models
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Avi Singh
John D. Co-Reyes
Rishabh Agarwal
Ankesh Anand
Piyush Patil
...
Yamini Bansal
Ethan Dyer
Behnam Neyshabur
Jascha Narain Sohl-Dickstein
Noah Fiedel
ALMLRMReLMSyDa
574
247
0
11 Dec 2023
SEGO: Sequential Subgoal Optimization for Mathematical Problem-Solving
SEGO: Sequential Subgoal Optimization for Mathematical Problem-Solving
Xueliang Zhao
Xinting Huang
Wei Bi
Lingpeng Kong
LRM
221
1
0
19 Oct 2023
Exploration with Principles for Diverse AI Supervision
Exploration with Principles for Diverse AI Supervision
Hao Liu
Matei A. Zaharia
Pieter Abbeel
274
2
0
13 Oct 2023
MuggleMath: Assessing the Impact of Query and Response Augmentation on
  Math Reasoning
MuggleMath: Assessing the Impact of Query and Response Augmentation on Math ReasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Chengpeng Li
Zheng Yuan
Hongyi Yuan
Guanting Dong
Keming Lu
Jiancan Wu
Chuanqi Tan
Xiang Wang
Chang Zhou
LRM
290
42
0
09 Oct 2023
Resprompt: Residual Connection Prompting Advances Multi-Step Reasoning
  in Large Language Models
Resprompt: Residual Connection Prompting Advances Multi-Step Reasoning in Large Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Song Jiang
Zahra Shakeri
Aaron Chan
Maziar Sanjabi
Hamed Firooz
...
Bugra Akyildiz
Luke Huan
Jinchao Li
Qifan Wang
Asli Celikyilmaz
LRMReLM
265
10
0
07 Oct 2023
Large Language Model Cascades with Mixture of Thoughts Representations
  for Cost-efficient Reasoning
Large Language Model Cascades with Mixture of Thoughts Representations for Cost-efficient ReasoningInternational Conference on Learning Representations (ICLR), 2023
Murong Yue
Jie Zhao
Min Zhang
Liang Du
Ziyu Yao
LRM
341
116
0
04 Oct 2023
L2CEval: Evaluating Language-to-Code Generation Capabilities of Large
  Language Models
L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language ModelsTransactions of the Association for Computational Linguistics (TACL), 2023
Ansong Ni
Pengcheng Yin
Yilun Zhao
Chen Wei
Yanjun Wang
...
Mingyuan Zhang
Chen Change Loy
Yingbo Zhou
Dragomir R. Radev
Arman Cohan
ELM
243
29
0
29 Sep 2023
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-InstructInternational Conference on Learning Representations (ICLR), 2023
Haipeng Luo
Qingfeng Sun
Can Xu
Lu Wang
Jian-Guang Lou
...
Xiubo Geng
Qingwei Lin
Shifeng Chen
Yansong Tang
Dongmei Zhang
LRMOSLM
788
622
0
18 Aug 2023
Scaling Relationship on Learning Mathematical Reasoning with Large
  Language Models
Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
Zheng Yuan
Hongyi Yuan
Cheng Li
Guanting Dong
Keming Lu
Chuanqi Tan
Chang Zhou
Jingren Zhou
LRMALM
307
281
0
03 Aug 2023
GRACE: Discriminator-Guided Chain-of-Thought Reasoning
GRACE: Discriminator-Guided Chain-of-Thought ReasoningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Muhammad Khalifa
Lajanugen Logeswaran
Moontae Lee
Ho Hin Lee
Lu Wang
LRM
343
52
0
24 May 2023
Has It All Been Solved? Open NLP Research Questions Not Solved by Large
  Language Models
Has It All Been Solved? Open NLP Research Questions Not Solved by Large Language ModelsInternational Conference on Language Resources and Evaluation (LREC), 2023
Oana Ignat
Zhijing Jin
Artem Abzaliev
Laura Biester
Santiago Castro
...
Verónica Pérez-Rosas
Siqi Shen
Zekun Wang
Winston Wu
Amélie Reymond
LRM
316
8
0
21 May 2023
1