Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.09583
Cited By
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
3 January 2025
Haipeng Luo
Qingfeng Sun
Can Xu
Pu Zhao
Jian-Guang Lou
Chongyang Tao
Xiubo Geng
Qingwei Lin
Shifeng Chen
Yansong Tang
Dongmei Zhang
OSLM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct"
50 / 338 papers shown
Title
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
Xinyu Guan
L. Zhang
Yifei Liu
Ning Shang
Youran Sun
Yi Zhu
Fan Yang
Mao Yang
LRM
SyDa
ReLM
50
74
0
08 Jan 2025
URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics
Ruilin Luo
Zhuofan Zheng
Yifan Wang
Yiyao Yu
Xinzhe Ni
Zicheng Lin
Jin Zeng
Yujiu Yang
LRM
42
12
0
08 Jan 2025
SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment
Yuchun Fan
Yongyu Mu
Yilin Wang
Lei Huang
Junhao Ruan
B. Li
Tong Xiao
Shujian Huang
Xiaocheng Feng
Jingbo Zhu
LRM
38
3
0
08 Jan 2025
PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models
Mingyang Song
Zhaochen Su
Xiaoye Qu
Jiawei Zhou
Yu-Xi Cheng
LRM
36
29
0
06 Jan 2025
Mathematical Language Models: A Survey
W. Liu
Hanglei Hu
Jie Zhou
Yuyang Ding
Junsong Li
...
Mengliang He
Qin Chen
Bo Jiang
Aimin Zhou
Liang He
LRM
62
12
0
03 Jan 2025
Malware Classification using a Hybrid Hidden Markov Model-Convolutional Neural Network
Ritik Mehta
Olha Jurecková
Mark Stamp
48
0
0
25 Dec 2024
Multilingual Mathematical Reasoning: Advancing Open-Source LLMs in Hindi and English
Avinash Anand
Kritarth Prasad
Chhavi Kirtani
Ashwin R Nair
Manvendra Kumar Nema
Raj Jaiswal
R. Shah
LRM
30
2
0
24 Dec 2024
System-2 Mathematical Reasoning via Enriched Instruction Tuning
Huanqia Cai
Yijun Yang
Zhifeng Li
LRM
60
0
0
22 Dec 2024
MetaRuleGPT: Recursive Numerical Reasoning of Language Models Trained with Simple Rules
Kejie Chen
Lin Wang
Qinghai Zhang
Renjun Xu
ReLM
LRM
73
0
0
18 Dec 2024
LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for Low-Resource Language Reasoning
Hongbin Zhang
K. Chen
Xuefeng Bai
Yang Xiang
Min Zhang
61
0
0
17 Dec 2024
CoinMath: Harnessing the Power of Coding Instruction for Math LLMs
Chengwei Wei
Bin Wang
Jung-jae Kim
Guimei Liu
Nancy F. Chen
LRM
66
0
0
16 Dec 2024
Smaller Language Models Are Better Instruction Evolvers
Tingfeng Hui
Lulu Zhao
Guanting Dong
Yaqi Zhang
Hua Zhou
Sen Su
ALM
79
1
0
15 Dec 2024
Label-Confidence-Aware Uncertainty Estimation in Natural Language Generation
Qinhong Lin
Linna Zhou
Zhongliang Yang
Yuang Cai
HILM
70
0
0
10 Dec 2024
Neuro-Symbolic Data Generation for Math Reasoning
Zenan Li
Zhi-Hua Zhou
Yuan Yao
Yu Li
Chun Cao
Fan Yang
Xian Zhang
Xiaoxing Ma
OffRL
LRM
57
0
0
06 Dec 2024
Does Few-Shot Learning Help LLM Performance in Code Synthesis?
Derek Xu
Tong Xie
Botao Xia
Haoyu Li
Yunsheng Bai
Yizhou Sun
Wei Wang
71
0
0
03 Dec 2024
Improving Physics Reasoning in Large Language Models Using Mixture of Refinement Agents
Raj Jaiswal
Dhruv Jain
Harsh Parimal Popat
Avinash Anand
Abhishek Dharmadhikari
Atharva Marathe
R. Shah
LRM
AI4CE
81
2
0
01 Dec 2024
Towards Adaptive Mechanism Activation in Language Agent
Ziyang Huang
Jun Zhao
Kang-Jun Liu
LLMAG
AI4CE
63
0
0
01 Dec 2024
Mars-PO: Multi-Agent Reasoning System Preference Optimization
Xiaoxuan Lou
Chaojie Wang
Bo An
LLMAG
LRM
59
0
0
28 Nov 2024
Task Arithmetic Through The Lens Of One-Shot Federated Learning
Zhixu Tao
I. Mason
Sanjeev R. Kulkarni
Xavier Boix
MoMe
FedML
66
1
0
27 Nov 2024
FREE-Merging: Fourier Transform for Efficient Model Merging
Shenghe Zheng
Hongzhi Wang
MoMe
59
0
0
25 Nov 2024
PSPO*: An Effective Process-supervised Policy Optimization for Reasoning Alignment
Jiawei Li
Xinyue Liang
Yizhe Yang
Chong Feng
Yang Gao
LRM
56
0
0
18 Nov 2024
Automatic Generation of Question Hints for Mathematics Problems using Large Language Models in Educational Technology
Junior Cedric Tonga
Benjamin Clément
Pierre-Yves Oudeyer
LRM
23
2
0
05 Nov 2024
Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch
Yuyang Ding
Xinyu Shi
Xiaobo Liang
Juntao Li
Qiaoming Zhu
Min Zhang
ELM
AIMat
SyDa
LRM
16
1
0
24 Oct 2024
Understanding Layer Significance in LLM Alignment
Guangyuan Shi
Zexin Lu
Xiaoyu Dong
Wenlong Zhang
Xuanyu Zhang
Yujie Feng
Xiao-Ming Wu
28
2
0
23 Oct 2024
Markov Chain of Thought for Efficient Mathematical Reasoning
Wen Yang
Kai Fan
Minpeng Liao
LRM
31
4
0
23 Oct 2024
Learning Mathematical Rules with Large Language Models
Antoine Gorceix
Bastien Le Chenadec
Ahmad Rammal
N. Vadori
Manuela Veloso
16
0
0
22 Oct 2024
Optimizing Chain-of-Thought Reasoning: Tackling Arranging Bottleneck via Plan Augmentation
Yuli Qiu
Jiashu Yao
Heyan Huang
Yuhang Guo
LRM
24
0
0
22 Oct 2024
Forewarned is Forearmed: Leveraging LLMs for Data Synthesis through Failure-Inducing Exploration
Qintong Li
Jiahui Gao
Sheng Wang
Renjie Pi
Xueliang Zhao
Chuan Wu
Xin Jiang
Z. Li
Lingpeng Kong
SyDa
13
0
0
22 Oct 2024
ToW: Thoughts of Words Improve Reasoning in Large Language Models
Zhikun Xu
Ming shen
Jacob Dineen
Zhaonan Li
Xiao Ye
Shijie Lu
Aswin Rrv
Chitta Baral
Ben Zhou
LRM
35
1
0
21 Oct 2024
A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models
Qiaoyu Tang
Le Yu
Bowen Yu
Hongyu Lin
K. Lu
Y. Lu
Xianpei Han
Le Sun
MoMe
19
1
0
17 Oct 2024
Unconstrained Model Merging for Enhanced LLM Reasoning
Yiming Zhang
Baoyi He
Shengyu Zhang
Yuhao Fu
Qi Zhou
...
Guanghan Ning
Linyi Li
Chunlin Ji
Fei Wu
Hongxia Yang
MoMe
27
0
0
17 Oct 2024
LLMOPT: Learning to Define and Solve General Optimization Problems from Scratch
Caigao Jiang
Xiang Shu
Hong Qian
Xingyu Lu
Jun-ping Zhou
Aimin Zhou
Yang Yu
30
1
0
17 Oct 2024
A Survey on Data Synthesis and Augmentation for Large Language Models
Ke Wang
Jiahui Zhu
Minjie Ren
Z. Liu
Shiwei Li
...
Chenkai Zhang
Xiaoyu Wu
Qiqi Zhan
Qingjie Liu
Yunhong Wang
SyDa
30
13
0
16 Oct 2024
Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning
Mingyang Chen
Haoze Sun
Tianpeng Li
Fan Yang
Hao Liang
Keer Lu
Bin Cui
Wentao Zhang
Zenan Zhou
Weipeng Chen
LRM
36
5
0
16 Oct 2024
Optimizing Instruction Synthesis: Effective Exploration of Evolutionary Space with Tree Search
Chenglin Li
Qianglong Chen
Zhi Li
Feng Tao
Yicheng Li
Hao Chen
Fei Yu
Yin Zhang
SyDa
21
0
0
14 Oct 2024
Toward General Instruction-Following Alignment for Retrieval-Augmented Generation
Guanting Dong
Xiaoshuai Song
Y. X. Zhu
Runqi Qiao
Zhicheng Dou
Ji-Rong Wen
3DV
32
4
0
12 Oct 2024
LLM
×
\times
×
MapReduce: Simplified Long-Sequence Processing using Large Language Models
Zihan Zhou
C. Li
Xinyi Chen
Shuo Wang
Yu Chao
...
Zhixing Tan
Xu Han
Xiaodong Shi
Zhiyuan Liu
Maosong Sun
14
0
0
12 Oct 2024
DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned Models
Wenlong Deng
Yize Zhao
V. Vakilian
Minghui Chen
Xiaoxiao Li
Christos Thrampoulidis
27
3
0
12 Oct 2024
DeltaDQ: Ultra-High Delta Compression for Fine-Tuned LLMs via Group-wise Dropout and Separate Quantization
Yanfeng Jiang
Zelan Yang
B. Chen
Shen Li
Yong Li
Tao Li
MQ
19
0
0
11 Oct 2024
SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
L. Yang
Zhaochen Yu
T. Zhang
Minkai Xu
Joseph E. Gonzalez
Bin Cui
Shuicheng Yan
ELM
ReLM
LRM
29
0
0
11 Oct 2024
Merging in a Bottle: Differentiable Adaptive Merging (DAM) and the Path from Averaging to Automation
Thomas Gauthier-Caron
Shamane Siriwardhana
Elliot Stein
Malikeh Ehghaghi
Charles Goddard
Mark McQuade
Jacob Solawetz
Maxime Labonne
MoMe
23
0
0
10 Oct 2024
MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
Yougang Lyu
Lingyong Yan
Zihan Wang
Dawei Yin
Pengjie Ren
Maarten de Rijke
Z. Z. Ren
53
6
0
10 Oct 2024
Self-Boosting Large Language Models with Synthetic Preference Data
Qingxiu Dong
Li Dong
Xingxing Zhang
Zhifang Sui
Furu Wei
SyDa
26
1
0
09 Oct 2024
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
Xiyao Wang
Linfeng Song
Ye Tian
Dian Yu
Baolin Peng
Haitao Mi
Furong Huang
Dong Yu
LRM
29
7
0
09 Oct 2024
Subtle Errors Matter: Preference Learning via Error-injected Self-editing
Kaishuai Xu
Tiezheng YU
Wenjun Hou
Yi Cheng
Chak Tou Leong
Liangyou Li
Xin Jiang
Lifeng Shang
Qun Liu
Wenjie Li
LRM
45
0
0
09 Oct 2024
TOWER: Tree Organized Weighting for Evaluating Complex Instructions
Noah Ziems
Zhihan Zhang
Meng-Long Jiang
ALM
16
0
0
08 Oct 2024
Polymath: A Challenging Multi-modal Mathematical Reasoning Benchmark
Himanshu Gupta
Shreyas Verma
Ujjwala Anantheswaran
Kevin Scaria
Mihir Parmar
Swaroop Mishra
Chitta Baral
ReLM
LRM
21
2
0
06 Oct 2024
Improving LLM Reasoning through Scaling Inference Computation with Collaborative Verification
Zhenwen Liang
Ye Liu
Tong Niu
Xiangliang Zhang
Yingbo Zhou
Semih Yavuz
LRM
17
8
0
05 Oct 2024
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
Murong Yue
Wenlin Yao
Haitao Mi
Dian Yu
Ziyu Yao
Dong Yu
LRM
25
4
0
04 Oct 2024
What Matters for Model Merging at Scale?
Prateek Yadav
Tu Vu
Jonathan Lai
Alexandra Chronopoulou
Manaal Faruqui
Mohit Bansal
Tsendsuren Munkhdalai
MoMe
34
12
0
04 Oct 2024
Previous
1
2
3
4
5
6
7
Next