Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.11489
Cited By
Text2Reward: Reward Shaping with Language Models for Reinforcement Learning
20 September 2023
Tianbao Xie
Siheng Zhao
Chen Henry Wu
Yitao Liu
Qian Luo
Victor Zhong
Yanchao Yang
Tao Yu
LM&Ro
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"
33 / 33 papers shown
Title
TREND: Tri-teaching for Robust Preference-based Reinforcement Learning with Demonstrations
Shuaiyi Huang
Mara Levy
Anubhav Gupta
Daniel Ekpo
Ruijie Zheng
Abhinav Shrivastava
23
0
0
09 May 2025
Multi-agent Embodied AI: Advances and Future Directions
Zhaohan Feng
Ruiqi Xue
Lei Yuan
Yang Yu
Ning Ding
M. Liu
Bingzhao Gao
Jian-jun Sun
Gang Wang
AI4CE
52
0
0
08 May 2025
An Automated Reinforcement Learning Reward Design Framework with Large Language Model for Cooperative Platoon Coordination
Dixiao Wei
Peng Yi
Jinlong Lei
Yiguang Hong
Yuchuan Du
69
0
0
28 Apr 2025
RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
Haoran Geng
Feishi Wang
Songlin Wei
Y. Li
Bangjun Wang
...
Hao Dong
Siyuan Huang
Yue Wang
Jitendra Malik
Pieter Abbeel
75
3
0
26 Apr 2025
Learning Explainable Dense Reward Shapes via Bayesian Optimization
Ryan Koo
Ian Yang
Vipul Raheja
Mingyi Hong
Kwang-Sung Jun
Dongyeop Kang
26
0
0
22 Apr 2025
Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision
Shilin Zhang
Zican Hu
Wenhao Wu
Xinyi Xie
Jianxiang Tang
Chunlin Chen
Daoyi Dong
Yu Cheng
Zhenhong Sun
Zhi Wang
OffRL
69
0
0
21 Apr 2025
GROVE: A Generalized Reward for Learning Open-Vocabulary Physical Skill
Jieming Cui
Tengyu Liu
Ziyu Meng
Jiale Yu
Ran Song
Wei Zhang
Yixin Zhu
Siyuan Huang
VLM
46
1
0
05 Apr 2025
Grounding Multimodal LLMs to Embodied Agents that Ask for Help with Reinforcement Learning
Ram Ramrakhya
Matthew Chang
Xavier Puig
Ruta Desai
Z. Kira
Roozbeh Mottaghi
LLMAG
LM&Ro
64
0
0
01 Apr 2025
LERO: LLM-driven Evolutionary framework with Hybrid Rewards and Enhanced Observation for Multi-Agent Reinforcement Learning
Yuan Wei
Xiaohan Shan
Jianmin Li
31
0
0
25 Mar 2025
LaMOuR: Leveraging Language Models for Out-of-Distribution Recovery in Reinforcement Learning
Chan Kim
Seung-Woo Seo
Seong-Woo Kim
OODD
110
0
0
21 Mar 2025
Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs using Semantic Space
Zhiliang Chen
Xinyuan Niu
Chuan-Sheng Foo
Bryan Kian Hsiang Low
53
1
0
14 Mar 2025
LuciBot: Automated Robot Policy Learning from Generated Videos
Xiaowen Qiu
Yian Wang
Jiting Cai
Zhehuan Chen
Chunru Lin
Tsun-Hsuan Wang
Chuang Gan
LM&Ro
VGen
67
0
0
12 Mar 2025
Multi-Stage Manipulation with Demonstration-Augmented Reward, Policy, and World Model Learning
Adrià López Escoriza
Nicklas Hansen
Stone Tao
Tongzhou Mu
H. Su
OffRL
58
0
0
03 Mar 2025
Never too Prim to Swim: An LLM-Enhanced RL-based Adaptive S-Surface Controller for AUVs under Extreme Sea Conditions
Guanwen Xie
Jingzehua Xu
Yimian Ding
Zhi Zhang
Shuai Zhang
Y. Li
37
0
0
01 Mar 2025
Evolving Symbolic 3D Visual Grounder with Weakly Supervised Reflection
Boyu Mi
Hanqing Wang
Tai Wang
Yilun Chen
Jiangmiao Pang
67
0
0
21 Feb 2025
Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards
Shresth Verma
Niclas Boehmer
Lingkai Kong
Milind Tambe
69
2
0
17 Jan 2025
Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning Applications
Sinan Ibrahim
Mostafa Mostafa
Ali Jnadi
Hadi Salloum
Pavel Osinenko
OffRL
32
12
0
31 Dec 2024
Evaluating Vision-Language Models as Evaluators in Path Planning
Mohamed Aghzal
Xiang Yue
E. Plaku
Ziyu Yao
LRM
72
1
0
27 Nov 2024
A Large Language Model-Driven Reward Design Framework via Dynamic Feedback for Reinforcement Learning
Shengjie Sun
Runze Liu
Jiafei Lyu
J. Yang
L. Zhang
Xiu Li
LRM
19
7
0
18 Oct 2024
Language-Model-Assisted Bi-Level Programming for Reward Learning from Internet Videos
Harsh Mahesheka
Zhixian Xie
Z. Wang
Wanxin Jin
29
0
0
11 Oct 2024
From Reward Shaping to Q-Shaping: Achieving Unbiased Learning with LLM-Guided Knowledge
Xiefeng Wu
OffRL
29
1
0
02 Oct 2024
Task Success Prediction for Open-Vocabulary Manipulation Based on Multi-Level Aligned Representations
Miyu Goko
Motonari Kambara
Daichi Saito
Seitaro Otsuki
Komei Sugiura
24
2
0
01 Oct 2024
Tulip Agent -- Enabling LLM-Based Agents to Solve Tasks Using Large Tool Libraries
Felix Ocker
Daniel Tanneberg
Julian Eggert
Michael Gienger
LLMAG
LM&Ro
VLM
42
5
0
31 Jul 2024
Text-Aware Diffusion for Policy Learning
Calvin Luo
Mandy He
Zilai Zeng
Chen Sun
23
4
0
02 Jul 2024
Language-Guided Object-Centric Diffusion Policy for Generalizable and Collision-Aware Robotic Manipulation
Hang Li
Qian Feng
Zhi Zheng
Jianxiang Feng
Zhaopeng Chen
Alois Knoll
21
1
0
29 Jun 2024
OCALM: Object-Centric Assessment with Language Models
Timo Kaufmann
Jannis Blüml
Antonia Wüst
Quentin Delfosse
Kristian Kersting
Eyke Hüllermeier
LM&Ro
LRM
34
1
0
24 Jun 2024
HackAtari: Atari Learning Environments for Robust and Continual Reinforcement Learning
Quentin Delfosse
Jannis Blüml
Bjarne Gregori
Kristian Kersting
31
7
0
06 Jun 2024
REvolve: Reward Evolution with Large Language Models using Human Feedback
Rishi Hazra
Alkis Sygkounas
A. Persson
Amy Loutfi
Pedro Zuidberg Dos Martires
25
3
0
03 Jun 2024
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods
Yuji Cao
Huan Zhao
Yuheng Cheng
Ting Shu
Guolong Liu
Gaoqi Liang
Junhua Zhao
Yun Li
LLMAG
KELM
OffRL
LM&Ro
27
48
0
30 Mar 2024
ARO: Large Language Model Supervised Robotics Text2Skill Autonomous Learning
Yiwen Chen
Yuyao Ye
Ziyi Chen
Chuheng Zhang
Marcelo H. Ang
25
0
0
23 Mar 2024
ProgPrompt: Generating Situated Robot Task Plans using Large Language Models
Ishika Singh
Valts Blukis
Arsalan Mousavian
Ankit Goyal
Danfei Xu
Jonathan Tremblay
D. Fox
Jesse Thomason
Animesh Garg
LM&Ro
LLMAG
112
619
0
22 Sep 2022
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning
Hung Le
Yue Wang
Akhilesh Deepak Gotmare
Silvio Savarese
S. Hoi
SyDa
ALM
124
236
0
05 Jul 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,402
0
28 Jan 2022
1