Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.16450
Cited By
Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search
26 May 2024
Max Liu
Chan-Hung Yu
Wei-Hsu Lee
Cheng-Wei Hung
Yen-Chun Chen
Shao-Hua Sun
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search"
9 / 9 papers shown
Title
Reclaiming the Source of Programmatic Policies: Programmatic versus Latent Spaces
Tales H. Carvalho
Kenneth Tjhia
Levi H. S. Lelis
26
6
0
16 Oct 2024
COOL: Efficient and Reliable Chain-Oriented Objective Logic with Neural Networks Feedback Control for Program Synthesis
Jipeng Han
20
0
0
02 Oct 2024
Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Vidhi Jain
Maria Attarian
Nikhil J. Joshi
Ayzaan Wahid
Danny Driess
...
Stefan Welker
Christine Chan
Igor Gilitschenski
Yonatan Bisk
Debidatta Dwibedi
65
27
0
19 Mar 2024
CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay
Natasha Butt
Blazej Manczak
Auke Wiggers
Corrado Rainone
David W. Zhang
Michaël Defferrard
Taco S. Cohen
ReLM
LRM
41
17
0
07 Feb 2024
Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs
Guanhui. Liu
En-Pei Hu
Pu-Jen Cheng
Hung-yi Lee
Shao-Hua Sun
60
10
0
30 Jan 2023
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning
Hung Le
Yue Wang
Akhilesh Deepak Gotmare
Silvio Savarese
S. Hoi
SyDa
ALM
116
232
0
05 Jul 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
A Systematic Evaluation of Large Language Models of Code
Frank F. Xu
Uri Alon
Graham Neubig
Vincent J. Hellendoorn
ELM
ALM
188
624
0
26 Feb 2022
Towards A Rigorous Science of Interpretable Machine Learning
Finale Doshi-Velez
Been Kim
XAI
FaML
219
2,098
0
28 Feb 2017
1