ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.16450
  4. Cited By
Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search

Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search

26 May 2024
Max Liu
Chan-Hung Yu
Wei-Hsu Lee
Cheng-Wei Hung
Yen-Chun Chen
Shao-Hua Sun
ArXivPDFHTML

Papers citing "Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search"

9 / 9 papers shown
Title
Reclaiming the Source of Programmatic Policies: Programmatic versus
  Latent Spaces
Reclaiming the Source of Programmatic Policies: Programmatic versus Latent Spaces
Tales H. Carvalho
Kenneth Tjhia
Levi H. S. Lelis
26
6
0
16 Oct 2024
COOL: Efficient and Reliable Chain-Oriented Objective Logic with Neural Networks Feedback Control for Program Synthesis
COOL: Efficient and Reliable Chain-Oriented Objective Logic with Neural Networks Feedback Control for Program Synthesis
Jipeng Han
18
0
0
02 Oct 2024
Vid2Robot: End-to-end Video-conditioned Policy Learning with
  Cross-Attention Transformers
Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Vidhi Jain
Maria Attarian
Nikhil J. Joshi
Ayzaan Wahid
Danny Driess
...
Stefan Welker
Christine Chan
Igor Gilitschenski
Yonatan Bisk
Debidatta Dwibedi
65
27
0
19 Mar 2024
CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay
CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay
Natasha Butt
Blazej Manczak
Auke Wiggers
Corrado Rainone
David W. Zhang
Michaël Defferrard
Taco S. Cohen
ReLM
LRM
41
17
0
07 Feb 2024
Hierarchical Programmatic Reinforcement Learning via Learning to Compose
  Programs
Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs
Guanhui. Liu
En-Pei Hu
Pu-Jen Cheng
Hung-yi Lee
Shao-Hua Sun
58
10
0
30 Jan 2023
CodeRL: Mastering Code Generation through Pretrained Models and Deep
  Reinforcement Learning
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning
Hung Le
Yue Wang
Akhilesh Deepak Gotmare
Silvio Savarese
S. Hoi
SyDa
ALM
116
232
0
05 Jul 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
A Systematic Evaluation of Large Language Models of Code
A Systematic Evaluation of Large Language Models of Code
Frank F. Xu
Uri Alon
Graham Neubig
Vincent J. Hellendoorn
ELM
ALM
185
624
0
26 Feb 2022
Towards A Rigorous Science of Interpretable Machine Learning
Towards A Rigorous Science of Interpretable Machine Learning
Finale Doshi-Velez
Been Kim
XAI
FaML
219
2,098
0
28 Feb 2017
1