ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.09583
  4. Cited By
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

3 January 2025
Haipeng Luo
Qingfeng Sun
Can Xu
Pu Zhao
Jian-Guang Lou
Chongyang Tao
Xiubo Geng
Qingwei Lin
Shifeng Chen
Yansong Tang
Dongmei Zhang
    OSLM
    LRM
ArXivPDFHTML

Papers citing "WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct"

50 / 338 papers shown
Title
DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning
  Graph
DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph
Zhehao Zhang
Jiaao Chen
Diyi Yang
LRM
19
5
0
25 Jun 2024
Does GPT Really Get It? A Hierarchical Scale to Quantify Human vs AI's Understanding of Algorithms
Does GPT Really Get It? A Hierarchical Scale to Quantify Human vs AI's Understanding of Algorithms
Mirabel Reid
Santosh Vempala
ELM
27
0
0
20 Jun 2024
Model Merging and Safety Alignment: One Bad Model Spoils the Bunch
Model Merging and Safety Alignment: One Bad Model Spoils the Bunch
Hasan Hammoud
Umberto Michieli
Fabio Pizzati
Philip H. S. Torr
Adel Bibi
Bernard Ghanem
Mete Ozay
MoMe
24
14
0
20 Jun 2024
Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning
Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning
Chaojie Wang
Yanchen Deng
Zhiyi Lyu
Liang Zeng
Jujie He
Shuicheng Yan
Bo An
LRM
ReLM
21
41
0
20 Jun 2024
Timo: Towards Better Temporal Reasoning for Language Models
Timo: Towards Better Temporal Reasoning for Language Models
Zhaochen Su
Jun Zhang
Tong Zhu
Xiaoye Qu
Juntao Li
Min Zhang
Yu Cheng
LRM
34
15
0
20 Jun 2024
LLM Critics Help Catch Bugs in Mathematics: Towards a Better
  Mathematical Verifier with Natural Language Feedback
LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback
Bofei Gao
Zefan Cai
Runxin Xu
Peiyi Wang
Ce Zheng
...
Chang Zhou
Wen Xiao
Junjie Hu
Tianyu Liu
Baobao Chang
LRM
22
6
0
20 Jun 2024
Low-Redundant Optimization for Large Language Model Alignment
Low-Redundant Optimization for Large Language Model Alignment
Zhipeng Chen
Kun Zhou
Wayne Xin Zhao
Jingyuan Wang
Ji-Rong Wen
24
0
0
18 Jun 2024
Unveiling the Flaws: Exploring Imperfections in Synthetic Data and
  Mitigation Strategies for Large Language Models
Unveiling the Flaws: Exploring Imperfections in Synthetic Data and Mitigation Strategies for Large Language Models
Jie Chen
Yupeng Zhang
Bingning Wang
Wayne Xin Zhao
Ji-Rong Wen
Weipeng Chen
SyDa
19
4
0
18 Jun 2024
DART-Math: Difficulty-Aware Rejection Tuning for Mathematical
  Problem-Solving
DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving
Yuxuan Tong
Xiwen Zhang
Rui Wang
R. Wu
Junxian He
AIMat
LRM
25
12
0
18 Jun 2024
Learn Beyond The Answer: Training Language Models with Reflection for
  Mathematical Reasoning
Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning
Zhihan Zhang
Zhenwen Liang
Wenhao Yu
Dian Yu
Mengzhao Jia
Dong Yu
Meng-Long Jiang
AIMat
RALM
LRM
ReLM
13
2
0
17 Jun 2024
DELLA-Merging: Reducing Interference in Model Merging through
  Magnitude-Based Sampling
DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling
Pala Tej Deep
Rishabh Bhardwaj
Soujanya Poria
MoMe
19
13
0
17 Jun 2024
GeoGPT4V: Towards Geometric Multi-modal Large Language Models with
  Geometric Image Generation
GeoGPT4V: Towards Geometric Multi-modal Large Language Models with Geometric Image Generation
Shihao Cai
Keqin Bao
Hangyu Guo
Jizhi Zhang
Jun Song
Bo Zheng
28
6
0
17 Jun 2024
A Survey on Human Preference Learning for Large Language Models
A Survey on Human Preference Learning for Large Language Models
Ruili Jiang
Kehai Chen
Xuefeng Bai
Zhixuan He
Juntao Li
Muyun Yang
Tiejun Zhao
Liqiang Nie
Min Zhang
31
1
0
17 Jun 2024
MFC-Bench: Benchmarking Multimodal Fact-Checking with Large Vision-Language Models
MFC-Bench: Benchmarking Multimodal Fact-Checking with Large Vision-Language Models
Shengkang Wang
Hongzhan Lin
Ziyang Luo
Zhen Ye
Guang Chen
Jing Ma
44
3
0
17 Jun 2024
Toward Optimal LLM Alignments Using Two-Player Games
Toward Optimal LLM Alignments Using Two-Player Games
Rui Zheng
Hongyi Guo
Zhihan Liu
Xiaoying Zhang
Yuanshun Yao
...
Tao Gui
Qi Zhang
Xuanjing Huang
Hang Li
Yang Liu
53
5
0
16 Jun 2024
Exposing the Achilles' Heel: Evaluating LLMs Ability to Handle Mistakes
  in Mathematical Reasoning
Exposing the Achilles' Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning
Joykirat Singh
A. Nambi
Vibhav Vineet
LRM
19
1
0
16 Jun 2024
A Comprehensive Survey of Scientific Large Language Models and Their
  Applications in Scientific Discovery
A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery
Yu Zhang
Xiusi Chen
Bowen Jin
Sheng Wang
Shuiwang Ji
Wei Wang
Jiawei Han
33
17
0
16 Jun 2024
On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A
  Survey
On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Survey
Lin Long
Rui Wang
Ruixuan Xiao
Junbo Zhao
Xiao Ding
Gang Chen
Haobo Wang
SyDa
45
88
0
14 Jun 2024
Living in the Moment: Can Large Language Models Grasp Co-Temporal
  Reasoning?
Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?
Zhaochen Su
Juntao Li
Jun Zhang
Tong Zhu
Xiaoye Qu
Pan Zhou
Yan Bowen
Yu Cheng
Min zhang
LRM
37
11
0
13 Jun 2024
ME-Switch: A Memory-Efficient Expert Switching Framework for Large
  Language Models
ME-Switch: A Memory-Efficient Expert Switching Framework for Large Language Models
Jing Liu
Ruihao Gong
Mingyang Zhang
Yefei He
Jianfei Cai
Bohan Zhuang
MoE
37
0
0
13 Jun 2024
Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for
  Large Language Models
Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models
Bowen Ping
Shuo Wang
Hanqing Wang
Xu Han
Yuzhuang Xu
Yukun Yan
Yun Chen
Baobao Chang
Zhiyuan Liu
Maosong Sun
MQ
37
4
0
13 Jun 2024
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo
  Tree Self-refine with LLaMa-3 8B
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B
Di Zhang
Xiaoshui Huang
Dongzhan Zhou
Yuqiang Li
Wanli Ouyang
LRM
25
51
0
11 Jun 2024
SciRIFF: A Resource to Enhance Language Model Instruction-Following over
  Scientific Literature
SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature
David Wadden
Kejian Shi
Jacob Morrison
Aakanksha Naik
Shruti Singh
...
Luca Soldaini
Shannon Zejiang Shen
Doug Downey
Hannaneh Hajishirzi
Arman Cohan
29
11
0
10 Jun 2024
mCoT: Multilingual Instruction Tuning for Reasoning Consistency in
  Language Models
mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models
Huiyuan Lai
Malvina Nissim
LRM
23
4
0
04 Jun 2024
Exploring Mathematical Extrapolation of Large Language Models with
  Synthetic Data
Exploring Mathematical Extrapolation of Large Language Models with Synthetic Data
Haolong Li
Yu Ma
Yinqi Zhang
Chen Ye
Jie Chen
ReLM
LRM
24
1
0
04 Jun 2024
TAIA: Large Language Models are Out-of-Distribution Data Learners
TAIA: Large Language Models are Out-of-Distribution Data Learners
Shuyang Jiang
Yusheng Liao
Ya-Qin Zhang
Yu Wang
Yanfeng Wang
19
2
0
30 May 2024
MathChat: Benchmarking Mathematical Reasoning and Instruction Following
  in Multi-Turn Interactions
MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
Zhenwen Liang
Dian Yu
Wenhao Yu
Wenlin Yao
Zhihan Zhang
Xiangliang Zhang
Dong Yu
LRM
27
1
0
29 May 2024
Automatically Generating Numerous Context-Driven SFT Data for LLMs
  across Diverse Granularity
Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity
Shanghaoran Quan
16
3
0
26 May 2024
GECKO: Generative Language Model for English, Code and Korean
GECKO: Generative Language Model for English, Code and Korean
Sungwoo Oh
Donggyu Kim
VLM
19
0
0
24 May 2024
SimPO: Simple Preference Optimization with a Reference-Free Reward
SimPO: Simple Preference Optimization with a Reference-Free Reward
Yu Meng
Mengzhou Xia
Danqi Chen
29
335
0
23 May 2024
JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training
  Small Data Synthesis Models
JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models
Kun Zhou
Beichen Zhang
Jiapeng Wang
Zhipeng Chen
Wayne Xin Zhao
Jing Sha
Zhichao Sheng
Shijin Wang
Ji-Rong Wen
SyDa
LRM
25
29
0
23 May 2024
ZipCache: Accurate and Efficient KV Cache Quantization with Salient
  Token Identification
ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification
Yefei He
Luoming Zhang
Weijia Wu
Jing Liu
Hong Zhou
Bohan Zhuang
MQ
22
12
0
23 May 2024
Can LLMs Solve longer Math Word Problems Better?
Can LLMs Solve longer Math Word Problems Better?
Xin Xu
Tong Xiao
Zitong Chao
Zhenya Huang
Can Yang
Yang Wang
54
10
0
23 May 2024
DOP: Diagnostic-Oriented Prompting for Large Language Models in
  Mathematical Correction
DOP: Diagnostic-Oriented Prompting for Large Language Models in Mathematical Correction
Hao Chen
Biaojie Zeng
Xin Lin
Liang He
Aimin Zhou
LRM
25
0
0
20 May 2024
LoRA Learns Less and Forgets Less
LoRA Learns Less and Forgets Less
D. Biderman
Jose Javier Gonzalez Ortiz
Jacob P. Portes
Mansheej Paul
Philip Greengard
...
Sam Havens
Vitaliy Chiley
Jonathan Frankle
Cody Blakeney
John P. Cunningham
CLL
18
109
0
15 May 2024
VisionGraph: Leveraging Large Multimodal Models for Graph Theory
  Problems in Visual Context
VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context
Yunxin Li
Baotian Hu
Haoyuan Shi
Wei Wang
Longyue Wang
Min-Ling Zhang
LRM
22
4
0
08 May 2024
MAmmoTH2: Scaling Instructions from the Web
MAmmoTH2: Scaling Instructions from the Web
Xiang Yue
Tuney Zheng
Ge Zhang
Wenhu Chen
ALM
LRM
27
77
0
06 May 2024
Exploring the Compositional Deficiency of Large Language Models in
  Mathematical Reasoning
Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning
Jun Zhao
Jingqi Tong
Yurong Mou
Ming Zhang
Qi Zhang
Xuanjing Huang
LRM
31
1
0
05 May 2024
The Power of Question Translation Training in Multilingual Reasoning:
  Broadened Scope and Deepened Insights
The Power of Question Translation Training in Multilingual Reasoning: Broadened Scope and Deepened Insights
Wenhao Zhu
Shujian Huang
Fei Yuan
Cheng Chen
Jiajun Chen
Alexandra Birch
LRM
31
5
0
02 May 2024
The Real, the Better: Aligning Large Language Models with Online Human
  Behaviors
The Real, the Better: Aligning Large Language Models with Online Human Behaviors
Guanying Jiang
Lingyong Yan
Haibo Shi
Dawei Yin
20
1
0
01 May 2024
Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models
Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models
Leonardo Ranaldi
André Freitas
LRM
ReLM
16
3
0
01 May 2024
Iterative Reasoning Preference Optimization
Iterative Reasoning Preference Optimization
Richard Yuanzhe Pang
Weizhe Yuan
Kyunghyun Cho
He He
Sainbayar Sukhbaatar
Jason Weston
LRM
23
89
0
30 Apr 2024
Mixture-of-Instructions: Aligning Large Language Models via Mixture Prompting
Mixture-of-Instructions: Aligning Large Language Models via Mixture Prompting
Bowen Xu
Shaoyu Wu
Kai Liu
Lulu Hu
26
1
0
29 Apr 2024
Continual Learning of Large Language Models: A Comprehensive Survey
Continual Learning of Large Language Models: A Comprehensive Survey
Haizhou Shi
Zihao Xu
Hengyi Wang
Weiyi Qin
Wenyuan Wang
Yibin Wang
Zifeng Wang
Sayna Ebrahimi
Hao Wang
CLL
KELM
LRM
21
47
0
25 Apr 2024
From Complex to Simple: Enhancing Multi-Constraint Complex Instruction
  Following Ability of Large Language Models
From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large Language Models
Qi He
Jie Zeng
Qianxi He
Jiaqing Liang
Yanghua Xiao
16
6
0
24 Apr 2024
Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems
Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems
Qihuang Zhong
Kang Wang
Ziyang Xu
Juhua Liu
Liang Ding
Bo Du
LRM
AIMat
42
3
0
23 Apr 2024
Toward Self-Improvement of LLMs via Imagination, Searching, and
  Criticizing
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Ye Tian
Baolin Peng
Linfeng Song
Lifeng Jin
Dian Yu
Haitao Mi
Dong Yu
LRM
ReLM
22
56
0
18 Apr 2024
Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of
  Language Models with Fine-grained Rewards
Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards
Hyeonbin Hwang
Doyoung Kim
Seungone Kim
Seonghyeon Ye
Minjoon Seo
LRM
ReLM
18
7
0
16 Apr 2024
Best Practices and Lessons Learned on Synthetic Data for Language Models
Best Practices and Lessons Learned on Synthetic Data for Language Models
Ruibo Liu
Jerry W. Wei
Fangyu Liu
Chenglei Si
Yanzhe Zhang
...
Steven Zheng
Daiyi Peng
Diyi Yang
Denny Zhou
Andrew M. Dai
SyDa
EgoV
25
75
0
11 Apr 2024
Improving Language Model Reasoning with Self-motivated Learning
Improving Language Model Reasoning with Self-motivated Learning
Yunlong Feng
Yang Xu
Libo Qin
Yasheng Wang
Wanxiang Che
LRM
ReLM
21
6
0
10 Apr 2024
Previous
1234567
Next