ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.13816
  4. Cited By
Execution-based Code Generation using Deep Reinforcement Learning

Execution-based Code Generation using Deep Reinforcement Learning

31 January 2023
Parshin Shojaee
Aneesh Jain
Sindhu Tipirneni
Chandan K. Reddy
ArXivPDFHTML

Papers citing "Execution-based Code Generation using Deep Reinforcement Learning"

49 / 49 papers shown
Title
Insights from Verification: Training a Verilog Generation LLM with Reinforcement Learning with Testbench Feedback
Insights from Verification: Training a Verilog Generation LLM with Reinforcement Learning with Testbench Feedback
N. Wang
Bingkun Yao
Jie Zhou
Yuchen Hu
Xi Wang
Nan Guan
Zhe Jiang
OffRL
22
0
0
22 Apr 2025
Improving RL Exploration for LLM Reasoning through Retrospective Replay
Improving RL Exploration for LLM Reasoning through Retrospective Replay
Shihan Dou
Muling Wu
Jingwen Xu
Rui Zheng
Tao Gui
Qi Zhang
Xuanjing Huang
OffRL
LRM
22
0
0
19 Apr 2025
Iterative Self-Training for Code Generation via Reinforced Re-Ranking
Iterative Self-Training for Code Generation via Reinforced Re-Ranking
Nikita Sorokin
I. Sedykh
Valentin Malykh
27
0
0
13 Apr 2025
ObscuraCoder: Powering Efficient Code LM Pre-Training Via Obfuscation Grounding
ObscuraCoder: Powering Efficient Code LM Pre-Training Via Obfuscation Grounding
Indraneil Paul
Haoyi Yang
Goran Glavas
Kristian Kersting
Iryna Gurevych
AAML
SyDa
34
0
0
27 Mar 2025
LLM Benchmarking with LLaMA2: Evaluating Code Development Performance Across Multiple Programming Languages
LLM Benchmarking with LLaMA2: Evaluating Code Development Performance Across Multiple Programming Languages
Patrick Diehl
Nojoud Nader
Maxim Moraru
Steven R. Brandt
36
1
0
24 Mar 2025
Aligning Crowd-sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models
Aligning Crowd-sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models
M. Wong
C. Tan
ALM
80
3
0
19 Mar 2025
The KoLMogorov Test: Compression by Code Generation
The KoLMogorov Test: Compression by Code Generation
Ori Yoran
Kunhao Zheng
Fabian Gloeckle
Jonas Gehring
Gabriel Synnaeve
Taco Cohen
62
1
0
18 Mar 2025
Enhancing High-Quality Code Generation in Large Language Models with Comparative Prefix-Tuning
Enhancing High-Quality Code Generation in Large Language Models with Comparative Prefix-Tuning
Yuan Jiang
Yujian Zhang
Liang Lu
Christoph Treude
Xiaohong Su
Shan Huang
Tiantian Wang
ALM
56
0
0
12 Mar 2025
Multi-Turn Code Generation Through Single-Step Rewards
Multi-Turn Code Generation Through Single-Step Rewards
A. Jain
Gonzalo Gonzalez-Pumariega
Wayne Chen
Alexander M. Rush
Wenting Zhao
Sanjiban Choudhury
LRM
47
1
0
27 Feb 2025
LeDex: Training LLMs to Better Self-Debug and Explain Code
LeDex: Training LLMs to Better Self-Debug and Explain Code
Nan Jiang
Xiaopeng Li
Shiqi Wang
Qiang Zhou
Soneya Binta Hossain
Baishakhi Ray
Varun Kumar
Xiaofei Ma
Anoop Deoras
LRM
86
10
0
17 Feb 2025
GenX: Mastering Code and Test Generation with Execution Feedback
GenX: Mastering Code and Test Generation with Execution Feedback
Nan Wang
Yafei Liu
Chen Chen
H. Lu
79
1
0
18 Dec 2024
DSTC: Direct Preference Learning with Only Self-Generated Tests and Code
  to Improve Code LMs
DSTC: Direct Preference Learning with Only Self-Generated Tests and Code to Improve Code LMs
Zhihan Liu
Shenao Zhang
Yongfei Liu
Boyi Liu
Yingxiang Yang
Zhaoran Wang
111
2
0
20 Nov 2024
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
Xinyan Guan
Yanjiang Liu
Xinyu Lu
Boxi Cao
Ben He
...
Le Sun
Jie Lou
Bowen Yu
Y. Lu
Hongyu Lin
ALM
79
2
0
18 Nov 2024
FALCON: Feedback-driven Adaptive Long/short-term memory reinforced Coding Optimization system
FALCON: Feedback-driven Adaptive Long/short-term memory reinforced Coding Optimization system
Zeyuan Li
Yangfan He
Lewei He
Jianhui Wang
Tianyu Shi
Bin Lei
Yuchen Li
Qiuwu Chen
ALM
51
5
0
28 Oct 2024
Process Supervision-Guided Policy Optimization for Code Generation
Process Supervision-Guided Policy Optimization for Code Generation
Ning Dai
Zheng Wu
Renjie Zheng
Ziyun Wei
Wenlei Shi
Xing Jin
Guanlin Liu
Chen Dun
Liang Huang
Lin Yan
51
7
0
23 Oct 2024
Exploring RL-based LLM Training for Formal Language Tasks with
  Programmed Rewards
Exploring RL-based LLM Training for Formal Language Tasks with Programmed Rewards
Alexander Padula
Dennis J. N. J. Soemers
OffRL
21
0
0
22 Oct 2024
Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement Learning
Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement Learning
Hao Ma
Tianyi Hu
Zhiqiang Pu
Boyin Liu
Xiaolin Ai
Yanyan Liang
Min Chen
37
3
0
08 Oct 2024
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
Yekun Chai
Haoran Sun
Huang Fang
Shuohuan Wang
Yu Sun
Hua-Hong Wu
50
1
0
03 Oct 2024
RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement
  Learning
RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning
Jonas Gehring
Kunhao Zheng
Jade Copet
Vegard Mella
Taco Cohen
Gabriel Synnaeve
LLMAG
27
20
0
02 Oct 2024
Evaluating Robustness of Reward Models for Mathematical Reasoning
Evaluating Robustness of Reward Models for Mathematical Reasoning
Sunghwan Kim
Dongjin Kang
Taeyoon Kwon
Hyungjoo Chae
Jungsoo Won
Dongha Lee
Jinyoung Yeo
23
4
0
02 Oct 2024
InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with
  Inverse-Instruct
InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct
Yutong Wu
Di Huang
Wenxuan Shi
Wei Wang
Lingzhe Gao
...
Qi Guo
Yewen Pu
Dawei Yin
Xing Hu
Yunji Chen
SyDa
18
1
0
08 Jul 2024
Unlock the Correlation between Supervised Fine-Tuning and Reinforcement
  Learning in Training Code Large Language Models
Unlock the Correlation between Supervised Fine-Tuning and Reinforcement Learning in Training Code Large Language Models
Jie Chen
Xintian Han
Yu Ma
Xun Zhou
Liang Xiang
ALM
LRM
26
2
0
14 Jun 2024
RLSF: Reinforcement Learning via Symbolic Feedback
RLSF: Reinforcement Learning via Symbolic Feedback
Piyush Jha
Prithwish Jana
Arnav Arora
Vijay Ganesh
LRM
36
3
0
26 May 2024
LLM-SR: Scientific Equation Discovery via Programming with Large Language Models
LLM-SR: Scientific Equation Discovery via Programming with Large Language Models
Parshin Shojaee
Kazem Meidani
Shashank Gupta
A. Farimani
Chandan K. Reddy
37
13
0
29 Apr 2024
CovRL: Fuzzing JavaScript Engines with Coverage-Guided Reinforcement
  Learning for LLM-based Mutation
CovRL: Fuzzing JavaScript Engines with Coverage-Guided Reinforcement Learning for LLM-based Mutation
Jueon Eom
Seyeon Jeong
Taekyoung Kwon
13
7
0
19 Feb 2024
AST-T5: Structure-Aware Pretraining for Code Generation and
  Understanding
AST-T5: Structure-Aware Pretraining for Code Generation and Understanding
Linyuan Gong
Mostafa Elhoushi
Alvin Cheung
26
11
0
05 Jan 2024
LLM-Assisted Code Cleaning For Training Accurate Code Generators
LLM-Assisted Code Cleaning For Training Accurate Code Generators
Naman Jain
Tianjun Zhang
Wei-Lin Chiang
Joseph E. Gonzalez
Koushik Sen
Ion Stoica
28
27
0
25 Nov 2023
Fine-Tuning Language Models Using Formal Methods Feedback
Fine-Tuning Language Models Using Formal Methods Feedback
Yunhao Yang
N. Bhatt
Tyler Ingebrand
William Ward
Steven Carr
Zhangyang Wang
Ufuk Topcu
8
8
0
27 Oct 2023
Fine-tune Language Models to Approximate Unbiased In-context Learning
Fine-tune Language Models to Approximate Unbiased In-context Learning
Timothy Chu
Zhao-quan Song
Chiwun Yang
22
15
0
05 Oct 2023
$\mathcal{B}$-Coder: Value-Based Deep Reinforcement Learning for Program
  Synthesis
B\mathcal{B}B-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis
Zishun Yu
Yunzhe Tao
Liyu Chen
Tao Sun
Hongxia Yang
21
7
0
04 Oct 2023
Tool-Augmented Reward Modeling
Tool-Augmented Reward Modeling
Lei Li
Yekun Chai
Shuohuan Wang
Yu Sun
Hao Tian
Ningyu Zhang
Hua-Hong Wu
OffRL
33
13
0
02 Oct 2023
Efficient Avoidance of Vulnerabilities in Auto-completed Smart Contract
  Code Using Vulnerability-constrained Decoding
Efficient Avoidance of Vulnerabilities in Auto-completed Smart Contract Code Using Vulnerability-constrained Decoding
André Storhaug
Jingyue Li
Tianyuan Hu
AAML
15
14
0
18 Sep 2023
Deep Reinforcement Learning from Hierarchical Preference Design
Deep Reinforcement Learning from Hierarchical Preference Design
Alexander Bukharin
Yixiao Li
Pengcheng He
Tuo Zhao
12
0
0
06 Sep 2023
Reinforcement Learning for Generative AI: A Survey
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
39
10
0
28 Aug 2023
Reinforcement Learning for Generative AI: State of the Art,
  Opportunities and Open Research Challenges
Reinforcement Learning for Generative AI: State of the Art, Opportunities and Open Research Challenges
Giorgio Franceschelli
Mirco Musolesi
AI4CE
24
19
0
31 Jul 2023
PanGu-Coder2: Boosting Large Language Models for Code with Ranking
  Feedback
PanGu-Coder2: Boosting Large Language Models for Code with Ranking Feedback
Bo Shen
Jiaxin Zhang
Taihong Chen
Daoguang Zan
Bing Geng
...
Ailun Yu
Jichuan Ji
Jingyang Zhao
Yuenan Guo
Qianxiang Wang
ALM
ELM
25
73
0
27 Jul 2023
Explaining Competitive-Level Programming Solutions using LLMs
Explaining Competitive-Level Programming Solutions using LLMs
Jierui Li
Szymon Tworkowski
Yingying Wu
Raymond J. Mooney
LRM
29
16
0
11 Jul 2023
RLTF: Reinforcement Learning from Unit Test Feedback
RLTF: Reinforcement Learning from Unit Test Feedback
Jiate Liu
Yiqin Zhu
Kaiwen Xiao
Qiang Fu
Xiao Han
Wei Yang
Deheng Ye
OffRL
39
53
0
10 Jul 2023
CoTran: An LLM-based Code Translator using Reinforcement Learning with
  Feedback from Compiler and Symbolic Execution
CoTran: An LLM-based Code Translator using Reinforcement Learning with Feedback from Compiler and Symbolic Execution
Prithwish Jana
Piyush Jha
Haoyang Ju
Gautham Kishore
Aryan Mahajan
Vijay Ganesh
19
12
0
11 Jun 2023
Neural Machine Translation for Code Generation
Neural Machine Translation for Code Generation
K. Dharma
Clayton T. Morrison
25
4
0
22 May 2023
Multi-Agent Reinforcement Learning: Methods, Applications, Visionary
  Prospects, and Challenges
Multi-Agent Reinforcement Learning: Methods, Applications, Visionary Prospects, and Challenges
Ziyuan Zhou
Guanjun Liu
Ying-Si Tang
28
14
0
17 May 2023
Transformer-based Planning for Symbolic Regression
Transformer-based Planning for Symbolic Regression
Parshin Shojaee
Kazem Meidani
A. Farimani
Chandan K. Reddy
36
33
0
13 Mar 2023
CodeRL: Mastering Code Generation through Pretrained Models and Deep
  Reinforcement Learning
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning
Hung Le
Yue Wang
Akhilesh Deepak Gotmare
Silvio Savarese
S. Hoi
SyDa
ALM
121
232
0
05 Jul 2022
Deep Learning Meets Software Engineering: A Survey on Pre-Trained Models
  of Source Code
Deep Learning Meets Software Engineering: A Survey on Pre-Trained Models of Source Code
Changan Niu
Chuanyi Li
Bin Luo
Vincent Ng
SyDa
VLM
36
48
0
24 May 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for
  Code Understanding and Generation
CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation
Yue Wang
Weishi Wang
Shafiq R. Joty
S. Hoi
204
1,451
0
02 Sep 2021
Measuring Coding Challenge Competence With APPS
Measuring Coding Challenge Competence With APPS
Dan Hendrycks
Steven Basart
Saurav Kadavath
Mantas Mazeika
Akul Arora
...
Collin Burns
Samir Puranik
Horace He
D. Song
Jacob Steinhardt
ELM
AIMat
ALM
194
614
0
20 May 2021
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding
  and Generation
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation
Shuai Lu
Daya Guo
Shuo Ren
Junjie Huang
Alexey Svyatkovskiy
...
Nan Duan
Neel Sundaresan
Shao Kun Deng
Shengyu Fu
Shujie Liu
ELM
190
853
0
09 Feb 2021
Effective Approaches to Attention-based Neural Machine Translation
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
214
7,687
0
17 Aug 2015
1