ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.09583
  4. Cited By
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

3 January 2025
Haipeng Luo
Qingfeng Sun
Can Xu
Pu Zhao
Jian-Guang Lou
Chongyang Tao
Xiubo Geng
Qingwei Lin
Shifeng Chen
Yansong Tang
Dongmei Zhang
    OSLM
    LRM
ArXivPDFHTML

Papers citing "WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct"

50 / 338 papers shown
Title
ReGenesis: LLMs can Grow into Reasoning Generalists via Self-Improvement
ReGenesis: LLMs can Grow into Reasoning Generalists via Self-Improvement
Xiangyu Peng
Congying Xia
Xinyi Yang
Caiming Xiong
Chien-Sheng Wu
Chen Xing
LRM
32
2
0
03 Oct 2024
Evaluating Robustness of Reward Models for Mathematical Reasoning
Evaluating Robustness of Reward Models for Mathematical Reasoning
Sunghwan Kim
Dongjin Kang
Taeyoon Kwon
Hyungjoo Chae
Jungsoo Won
Dongha Lee
Jinyoung Yeo
23
3
0
02 Oct 2024
Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter
  Merging
Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging
Yiming Ju
Ziyi Ni
Xingrun Xing
Zhixiong Zeng
hanyu Zhao
Siqi Fan
Zheng Zhang
MoMe
16
0
0
01 Oct 2024
Federated Instruction Tuning of LLMs with Domain Coverage Augmentation
Federated Instruction Tuning of LLMs with Domain Coverage Augmentation
Zezhou Wang
Yaxin Du
Zhuzhong Qian
Yugang Jiang
Zhuzhong Qian
Siheng Chen
FedML
30
0
0
30 Sep 2024
RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling
  Large Language Models
RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models
Shuhao Chen
Weisen Jiang
Baijiong Lin
James T. Kwok
Yu Zhang
RALM
MQ
33
5
0
30 Sep 2024
Balancing Cost and Effectiveness of Synthetic Data Generation Strategies
  for LLMs
Balancing Cost and Effectiveness of Synthetic Data Generation Strategies for LLMs
Yung-Chieh Chan
George Pu
Apaar Shanker
Parth Suresh
Penn Jenks
John Heyer
Sam Denton
SyDa
11
3
0
29 Sep 2024
HM3: Hierarchical Multi-Objective Model Merging for Pretrained Models
HM3: Hierarchical Multi-Objective Model Merging for Pretrained Models
Yu Zhou
Xingyu Wu
Jibin Wu
Liang Feng
Kay Chen Tan
MoMe
52
0
0
27 Sep 2024
IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation
IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation
Fan Lin
Shuyi Xie
Yong Dai
Wenlin Yao
Tianjiao Lang
Zishan Xu
Zhichao Hu
Xiao Xiao
Yuhong Liu
Yu Zhang
25
0
0
27 Sep 2024
BEATS: Optimizing LLM Mathematical Capabilities with BackVerify and
  Adaptive Disambiguate based Efficient Tree Search
BEATS: Optimizing LLM Mathematical Capabilities with BackVerify and Adaptive Disambiguate based Efficient Tree Search
Linzhuang Sun
Hao Liang
Jingxuan Wei
Bihui Yu
Conghui He
Zenan Zhou
Wentao Zhang
11
2
0
26 Sep 2024
CadVLM: Bridging Language and Vision in the Generation of Parametric CAD
  Sketches
CadVLM: Bridging Language and Vision in the Generation of Parametric CAD Sketches
Sifan Wu
Amir Khasahmadi
Mor Katz
P. Jayaraman
Yewen Pu
K. Willis
Bang Liu
3DV
16
2
0
26 Sep 2024
Enhancing elusive clues in knowledge learning by contrasting attention of language models
Enhancing elusive clues in knowledge learning by contrasting attention of language models
Jian Gao
Xiao Zhang
Ji Wu
Miao Li
33
0
0
26 Sep 2024
Beyond Accuracy Optimization: Computer Vision Losses for Large Language
  Model Fine-Tuning
Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-Tuning
Daniele Rege Cambrin
Giuseppe Gallipoli
Irene Benedetto
Luca Cagliero
Paolo Garza
18
0
0
20 Sep 2024
ControlMath: Controllable Data Generation Promotes Math Generalist
  Models
ControlMath: Controllable Data Generation Promotes Math Generalist Models
Nuo Chen
Ning Wu
Jianhui Chang
Jia Li
13
0
0
20 Sep 2024
LogicPro: Improving Complex Logical Reasoning via Program-Guided Learning
LogicPro: Improving Complex Logical Reasoning via Program-Guided Learning
Jin Jiang
Yuchen Yan
Yang Liu
Yonggang Jin
Shuai Peng
M. Zhang
Xunliang Cai
Yixin Cao
Liangcai Gao
Zhi Tang
LRM
22
3
0
19 Sep 2024
HexaCoder: Secure Code Generation via Oracle-Guided Synthetic Training
  Data
HexaCoder: Secure Code Generation via Oracle-Guided Synthetic Training Data
Hossein Hajipour
Lea Schönherr
Thorsten Holz
Mario Fritz
AAML
SyDa
16
0
0
10 Sep 2024
MathGLM-Vision: Solving Mathematical Problems with Multi-Modal Large
  Language Model
MathGLM-Vision: Solving Mathematical Problems with Multi-Modal Large Language Model
Zhen Yang
Jinhao Chen
Zhengxiao Du
Wenmeng Yu
Weihan Wang
Wenyi Hong
Zhihuan Jiang
Bin Xu
Yuxiao Dong
Jie Tang
VLM
LRM
27
8
0
10 Sep 2024
FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous
  Low-Rank Adaptations
FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous Low-Rank Adaptations
Ziyao Wang
Zheyu Shen
Yexiao He
Guoheng Sun
Hongyi Wang
Lingjuan Lyu
Ang Li
13
7
0
09 Sep 2024
Diagram Formalization Enhanced Multi-Modal Geometry Problem Solver
Diagram Formalization Enhanced Multi-Modal Geometry Problem Solver
Zeren Zhang
Jo-Ku Cheng
Jingyang Deng
Lu Tian
Jinwen Ma
Ziran Qin
Xiaokai Zhang
Na Zhu
Tuo Leng
21
0
0
06 Sep 2024
LongRecipe: Recipe for Efficient Long Context Generalization in Large
  Language Models
LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models
Zhiyuan Hu
Yuliang Liu
Jinman Zhao
Suyuchen Wang
Yan Wang
...
Qing Gu
Anh Tuan Luu
See-Kiong Ng
Zhiwei Jiang
Bryan Hooi
33
5
0
31 Aug 2024
MultiMath: Bridging Visual and Mathematical Reasoning for Large Language
  Models
MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
Shuai Peng
Di Fu
Liangcai Gao
Xiuqin Zhong
Hongguang Fu
Zhi Tang
MLLM
VLM
LRM
11
1
0
30 Aug 2024
SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large
  Language Models
SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models
Dian Yu
Baolin Peng
Ye Tian
Linfeng Song
Haitao Mi
Dong Yu
ALM
LRM
28
0
0
28 Aug 2024
LIMP: Large Language Model Enhanced Intent-aware Mobility Prediction
LIMP: Large Language Model Enhanced Intent-aware Mobility Prediction
Songwei Li
Jie Feng
Jiawei Chi
Xinyuan Hu
Xiaomeng Zhao
Fengli Xu
16
3
0
23 Aug 2024
Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free
  Curricular Meaningful Learning
Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning
Kai Xiong
Xiao Ding
Li Du
Jiahao Ying
Ting Liu
Bing Qin
Yixin Cao
21
1
0
21 Aug 2024
Benchmarking Large Language Models for Math Reasoning Tasks
Benchmarking Large Language Models for Math Reasoning Tasks
Kathrin Seßler
Yao Rong
Emek Gözlüklü
Enkelejda Kasneci
LRM
20
0
0
20 Aug 2024
Towards Efficient Large Language Models for Scientific Text: A Review
Towards Efficient Large Language Models for Scientific Text: A Review
H. To
Ming Liu
Guangyan Huang
25
0
0
20 Aug 2024
Revisiting the Graph Reasoning Ability of Large Language Models: Case Studies in Translation, Connectivity and Shortest Path
Revisiting the Graph Reasoning Ability of Large Language Models: Case Studies in Translation, Connectivity and Shortest Path
Xinnan Dai
Qihao Wen
Yifei Shen
Hongzhi Wen
Dongsheng Li
Jiliang Tang
Caihua Shan
LRM
29
3
0
18 Aug 2024
Can Large Language Models Understand Symbolic Graphics Programs?
Can Large Language Models Understand Symbolic Graphics Programs?
Zeju Qiu
Weiyang Liu
Haiwen Feng
Zhen Liu
Tim Z. Xiao
Katherine M. Collins
J. Tenenbaum
Adrian Weller
Michael J. Black
Bernhard Schölkopf
28
4
0
15 Aug 2024
Extend Model Merging from Fine-Tuned to Pre-Trained Large Language
  Models via Weight Disentanglement
Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement
Le Yu
Bowen Yu
Haiyang Yu
Fei Huang
Yongbin Li
MoMe
27
5
0
06 Aug 2024
AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation
AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation
Mengkang Hu
Yixiao Wang
Can Xu
Lingfeng Sun
Chensheng Peng
T. Hannagan
Nicola Poerio
Saravan Rajmohan
LM&Ro
LLMAG
49
14
0
01 Aug 2024
Effective Large Language Model Debugging with Best-first Tree Search
Effective Large Language Model Debugging with Best-first Tree Search
Jialin Song
Jonathan Raiman
Bryan Catanzaro
LRM
25
0
0
26 Jul 2024
Self-Training with Direct Preference Optimization Improves
  Chain-of-Thought Reasoning
Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning
Tianduo Wang
Shichen Li
Wei Lu
LRM
AI4CE
32
7
1
25 Jul 2024
Weak-to-Strong Reasoning
Weak-to-Strong Reasoning
Yuqing Yang
Yan Ma
Pengfei Liu
LRM
23
13
0
18 Jul 2024
COMET: "Cone of experience" enhanced large multimodal model for
  mathematical problem generation
COMET: "Cone of experience" enhanced large multimodal model for mathematical problem generation
Sannyuya Liu
Jintian Feng
Zongkai Yang
Yawei Luo
Qian Wan
Xiaoxuan Shen
Jianwen Sun
33
3
0
16 Jul 2024
Key-Point-Driven Mathematical Reasoning Distillation of Large Language
  Model
Key-Point-Driven Mathematical Reasoning Distillation of Large Language Model
Xunyu Zhu
Jian Li
Can Ma
Weiping Wang
LRM
23
0
0
14 Jul 2024
A Survey on Symbolic Knowledge Distillation of Large Language Models
A Survey on Symbolic Knowledge Distillation of Large Language Models
Kamal Acharya
Alvaro Velasquez
H. Song
SyDa
18
4
0
12 Jul 2024
Token-Supervised Value Models for Enhancing Mathematical Problem-Solving Capabilities of Large Language Models
Token-Supervised Value Models for Enhancing Mathematical Problem-Solving Capabilities of Large Language Models
Jung Hyun Lee
June Yong Yang
Byeongho Heo
Dongyoon Han
Kang Min Yoo
Eunho Yang
Kang Min Yoo
LRM
22
0
0
12 Jul 2024
Self-training Language Models for Arithmetic Reasoning
Self-training Language Models for Arithmetic Reasoning
Marek Kadlcík
Michal Štefánik
KELM
ReLM
OffRL
LRM
27
0
0
11 Jul 2024
ElecBench: a Power Dispatch Evaluation Benchmark for Large Language
  Models
ElecBench: a Power Dispatch Evaluation Benchmark for Large Language Models
Xiyuan Zhou
Huan Zhao
Yuheng Cheng
Yuji Cao
Gaoqi Liang
Guolong Liu
Wenxuan Liu
Yan Xu
Junhua Zhao
ELM
18
3
0
07 Jul 2024
Solving for X and Beyond: Can Large Language Models Solve Complex Math
  Problems with More-Than-Two Unknowns?
Solving for X and Beyond: Can Large Language Models Solve Complex Math Problems with More-Than-Two Unknowns?
Kuei-Chun Kao
Ruochen Wang
Cho-Jui Hsieh
ELM
LRM
22
2
0
06 Jul 2024
Raw Text is All you Need: Knowledge-intensive Multi-turn Instruction
  Tuning for Large Language Model
Raw Text is All you Need: Knowledge-intensive Multi-turn Instruction Tuning for Large Language Model
Xia Hou
Qifeng Li
Jian Yang
Tongliang Li
Linzheng Chai
...
Hangyuan Ji
Zhoujun Li
Jixuan Nie
Jingbo Dun
Wenfeng Song
20
2
0
03 Jul 2024
Survey on Knowledge Distillation for Large Language Models: Methods,
  Evaluation, and Application
Survey on Knowledge Distillation for Large Language Models: Methods, Evaluation, and Application
Chuanpeng Yang
Wang Lu
Yao Zhu
Yidong Wang
Qian Chen
Chenlong Gao
Bingjie Yan
Yiqiang Chen
ALM
KELM
31
3
0
02 Jul 2024
FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in Large
  Language Models
FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in Large Language Models
Yiyuan Li
Shichao Sun
Pengfei Liu
LRM
33
0
0
01 Jul 2024
Step-Controlled DPO: Leveraging Stepwise Error for Enhanced Mathematical
  Reasoning
Step-Controlled DPO: Leveraging Stepwise Error for Enhanced Mathematical Reasoning
Zimu Lu
Aojun Zhou
Ke Wang
Houxing Ren
Weikang Shi
Junting Pan
Mingjie Zhan
Hongsheng Li
LRM
16
21
0
30 Jun 2024
Advancing Process Verification for Large Language Models via Tree-Based
  Preference Learning
Advancing Process Verification for Large Language Models via Tree-Based Preference Learning
Mingqian He
Yongliang Shen
Wenqi Zhang
Zeqi Tan
Weiming Lu
LRM
22
5
0
29 Jun 2024
LiteSearch: Efficacious Tree Search for LLM
LiteSearch: Efficacious Tree Search for LLM
Ante Wang
Linfeng Song
Ye Tian
Baolin Peng
Dian Yu
Haitao Mi
Jinsong Su
Dong Yu
31
14
0
29 Jun 2024
Applying RLAIF for Code Generation with API-usage in Lightweight LLMs
Applying RLAIF for Code Generation with API-usage in Lightweight LLMs
Sujan Dutta
Sayantan Mahinder
R. Anantha
Bortik Bandyopadhyay
ALM
16
2
0
28 Jun 2024
Understand What LLM Needs: Dual Preference Alignment for
  Retrieval-Augmented Generation
Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented Generation
Guanting Dong
Yutao Zhu
Chenghao Zhang
Zechen Wang
Zhicheng Dou
Ji-Rong Wen
RALM
27
3
0
26 Jun 2024
Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of
  LLMs
Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs
Xin Lai
Zhuotao Tian
Yukang Chen
Senqiao Yang
Xiangru Peng
Jiaya Jia
LRM
36
27
0
26 Jun 2024
CharED: Character-wise Ensemble Decoding for Large Language Models
CharED: Character-wise Ensemble Decoding for Large Language Models
Kevin Gu
Eva Tuecke
Dmitriy Katz
R. Horesh
David Alvarez-Melis
Mikhail Yurochkin
23
2
0
25 Jun 2024
Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large
  Language Models
Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models
Wenhao Shi
Zhiqiang Hu
Yi Bin
Junhua Liu
Yang Yang
See-Kiong Ng
Lidong Bing
Roy Ka-Wei Lee
SyDa
MLLM
LRM
19
14
0
25 Jun 2024
Previous
1234567
Next