ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1903.02020
  4. Cited By
Using Natural Language for Reward Shaping in Reinforcement Learning

Using Natural Language for Reward Shaping in Reinforcement Learning

5 March 2019
Prasoon Goyal
S. Niekum
Raymond J. Mooney
    LM&Ro
ArXivPDFHTML

Papers citing "Using Natural Language for Reward Shaping in Reinforcement Learning"

50 / 109 papers shown
Title
Emotion Recognition Using Convolutional Neural Networks
Emotion Recognition Using Convolutional Neural Networks
Shaoyuan Xu
Yang Cheng
Qian Lin
J. Allebach
34
0
0
03 Apr 2025
IPCGRL: Language-Instructed Reinforcement Learning for Procedural Level Generation
IPCGRL: Language-Instructed Reinforcement Learning for Procedural Level Generation
In-Chang Baek
Sung-Hyun Kim
Seo-Young Lee
Dong-Hyeun Kim
Kyung-Joong Kim
56
0
0
16 Mar 2025
From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning
From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning
Pusen Dong
Tianchen Zhu
Yue Qiu
Haoyi Zhou
Jianxin Li
78
1
0
24 Feb 2025
Leveraging Large Language Models for Effective and Explainable Multi-Agent Credit Assignment
Kartik Nagpal
Dayi Dong
Jean-Baptiste Bouvier
Negar Mehr
LLMAG
49
1
0
24 Feb 2025
Inductive Biases for Zero-shot Systematic Generalization in Language-informed Reinforcement Learning
Negin Hashemi Dijujin
Seyed Roozbeh Razavi Rohani
Mohammad Samiei
M. Baghshah
53
0
0
28 Jan 2025
Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards
Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards
Shresth Verma
Niclas Boehmer
Lingkai Kong
Milind Tambe
69
2
0
17 Jan 2025
Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning Applications
Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning Applications
Sinan Ibrahim
Mostafa Mostafa
Ali Jnadi
Hadi Salloum
Pavel Osinenko
OffRL
44
12
0
31 Dec 2024
CAREL: Instruction-guided reinforcement learning with cross-modal
  auxiliary objectives
CAREL: Instruction-guided reinforcement learning with cross-modal auxiliary objectives
Armin Saghafian
Amirmohammad Izadi
Negin Hashemi Dijujin
M. Baghshah
64
0
0
29 Nov 2024
Fine-Grained Reward Optimization for Machine Translation using Error Severity Mappings
Fine-Grained Reward Optimization for Machine Translation using Error Severity Mappings
Miguel Moura Ramos
Tomás Almeida
Daniel Vareta
Filipe Azevedo
Sweta Agrawal
Patrick Fernandes
André F. T. Martins
31
1
0
08 Nov 2024
Teaching Embodied Reinforcement Learning Agents: Informativeness and
  Diversity of Language Use
Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use
Jiajun Xi
Yinong He
Jianing Yang
Yinpei Dai
Joyce Chai
LM&Ro
24
2
0
31 Oct 2024
Potential-Based Intrinsic Motivation: Preserving Optimality With
  Complex, Non-Markovian Shaping Rewards
Potential-Based Intrinsic Motivation: Preserving Optimality With Complex, Non-Markovian Shaping Rewards
Grant C. Forbes
Leonardo Villalobos-Arias
Jianxun Wang
Arnav Jhala
David L. Roberts
29
0
0
16 Oct 2024
Trajectory Improvement and Reward Learning from Comparative Language
  Feedback
Trajectory Improvement and Reward Learning from Comparative Language Feedback
Zhaojing Yang
Miru Jun
J. Tien
Stuart J. Russell
Anca Dragan
Erdem Bıyık
32
5
0
08 Oct 2024
DeepLTL: Learning to Efficiently Satisfy Complex LTL Specifications for Multi-Task RL
DeepLTL: Learning to Efficiently Satisfy Complex LTL Specifications for Multi-Task RL
Mathias Jackermeier
Alessandro Abate
OffRL
40
1
0
06 Oct 2024
CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models
CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models
Kanghyun Ryu
Qiayuan Liao
Zhongyu Li
K. Sreenath
Negar Mehr
Negar Mehr
LM&Ro
156
3
0
27 Sep 2024
Assessing the Zero-Shot Capabilities of LLMs for Action Evaluation in RL
Assessing the Zero-Shot Capabilities of LLMs for Action Evaluation in RL
Eduardo Pignatelli
Johan Ferret
Tim Rockäschel
Edward Grefenstette
Davide Paglieri
Samuel Coward
Laura Toni
38
2
0
19 Sep 2024
Automatic Environment Shaping is the Next Frontier in RL
Automatic Environment Shaping is the Next Frontier in RL
Younghyo Park
G. Margolis
Pulkit Agrawal
OffRL
34
3
0
23 Jul 2024
How Much Progress Did I Make? An Unexplored Human Feedback Signal for
  Teaching Robots
How Much Progress Did I Make? An Unexplored Human Feedback Signal for Teaching Robots
Hang Yu
Qidi Fang
Shijie Fang
Reuben M. Aronson
E. Short
25
0
0
08 Jul 2024
$\mathrm{E^{2}CFD}$: Towards Effective and Efficient Cost Function
  Design for Safe Reinforcement Learning via Large Language Model
E2CFD\mathrm{E^{2}CFD}E2CFD: Towards Effective and Efficient Cost Function Design for Safe Reinforcement Learning via Large Language Model
Zepeng Wang
Chao Ma
Linjiang Zhou
Libing Wu
Lei Yang
Xiaochuan Shi
Guojun Peng
OffRL
35
0
0
08 Jul 2024
Improving Sample Efficiency of Reinforcement Learning with Background
  Knowledge from Large Language Models
Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models
Fuxiang Zhang
Junyou Li
Yi-Chen Li
Zongzhang Zhang
Yang Yu
Deheng Ye
OffRL
KELM
47
1
0
04 Jul 2024
VLM Agents Generate Their Own Memories: Distilling Experience into Embodied Programs of Thought
VLM Agents Generate Their Own Memories: Distilling Experience into Embodied Programs of Thought
Gabriel H. Sarch
Lawrence Jang
Michael J. Tarr
William W. Cohen
Kenneth Marino
Katerina Fragkiadaki
LLMAG
47
0
0
20 Jun 2024
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
Utsav Singh
Pramit Bhattacharyya
Vinay P. Namboodiri
LM&Ro
42
1
0
09 Jun 2024
A Generalized Apprenticeship Learning Framework for Modeling
  Heterogeneous Student Pedagogical Strategies
A Generalized Apprenticeship Learning Framework for Modeling Heterogeneous Student Pedagogical Strategies
Md Mirajul Islam
Xi Yang
J. Hostetter
Adittya Soukarjya Saha
Min Chi
23
1
0
04 Jun 2024
Reward Machines for Deep RL in Noisy and Uncertain Environments
Reward Machines for Deep RL in Noisy and Uncertain Environments
Andrew C. Li
Zizhao Chen
Toryn Q. Klassen
Pashootan Vaezipoor
Rodrigo Toro Icarte
Sheila A. McIlraith
48
6
0
31 May 2024
A Design Trajectory Map of Human-AI Collaborative Reinforcement Learning
  Systems: Survey and Taxonomy
A Design Trajectory Map of Human-AI Collaborative Reinforcement Learning Systems: Survey and Taxonomy
Zhaoxing Li
22
2
0
16 May 2024
Natural Language Can Help Bridge the Sim2Real Gap
Natural Language Can Help Bridge the Sim2Real Gap
Albert Yu
Adeline Foote
Raymond J. Mooney
Roberto Martín-Martín
LM&Ro
43
11
0
16 May 2024
NARRATE: Versatile Language Architecture for Optimal Control in Robotics
NARRATE: Versatile Language Architecture for Optimal Control in Robotics
Seif Ismail
Antonio Arbues
Ryan Cotterell
René Zurbrügg
Carmen Amo Alonso
LM&Ro
24
4
0
16 Mar 2024
Learning with Language-Guided State Abstractions
Learning with Language-Guided State Abstractions
Andi Peng
Ilia Sucholutsky
Belinda Z. Li
T. Sumers
Thomas L. Griffiths
Jacob Andreas
Julie A. Shah
LM&Ro
46
13
0
28 Feb 2024
Transformable Gaussian Reward Function for Socially-Aware Navigation
  with Deep Reinforcement Learning
Transformable Gaussian Reward Function for Socially-Aware Navigation with Deep Reinforcement Learning
Jinyeob Kim
Sumin Kang
Sungwoo Yang
Beomjoon Kim
Jargalbaatar Yura
Donghan Kim
118
1
0
22 Feb 2024
Potential-Based Reward Shaping For Intrinsic Motivation
Potential-Based Reward Shaping For Intrinsic Motivation
Grant C. Forbes
Nitish Gupta
Leonardo Villalobos-Arias
Colin M. Potts
Arnav Jhala
David L. Roberts
6
4
0
12 Feb 2024
Informativeness of Reward Functions in Reinforcement Learning
Informativeness of Reward Functions in Reinforcement Learning
R. Devidze
Parameswaran Kamalaruban
Adish Singla
23
2
0
10 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement
  Learning and Large Language Models
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
18
7
0
02 Feb 2024
Safe Reinforcement Learning with Free-form Natural Language Constraints
  and Pre-Trained Language Models
Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models
Xingzhou Lou
Junge Zhang
Ziyan Wang
Kaiqi Huang
Yali Du
38
3
0
15 Jan 2024
Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language
  Model Critique in Text Generation
Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation
Meng Cao
Lei Shu
Lei Yu
Yun Zhu
Nevan Wichers
Yinxiao Liu
Lei Meng
OffRL
ALM
19
4
0
14 Jan 2024
RePLan: Robotic Replanning with Perception and Language Models
RePLan: Robotic Replanning with Perception and Language Models
Marta Skreta
Zihan Zhou
Jia Lin Yuan
Kourosh Darvish
Alán Aspuru-Guzik
Animesh Garg
LM&Ro
LRM
40
26
0
08 Jan 2024
GLIDE-RL: Grounded Language Instruction through DEmonstration in RL
GLIDE-RL: Grounded Language Instruction through DEmonstration in RL
Chaitanya Kharyal
S. Gottipati
Tanmay Kumar Sinha
Srijita Das
Matthew E. Taylor
LLMAG
13
1
0
03 Jan 2024
Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Filippos Christianos
Georgios Papoudakis
Matthieu Zimmer
Thomas Coste
Zhihao Wu
...
Yicheng Luo
Jianye Hao
Kun Shao
Haitham Bou-Ammar
Jun Wang
30
18
0
22 Dec 2023
FoMo Rewards: Can we cast foundation models as reward functions?
FoMo Rewards: Can we cast foundation models as reward functions?
Ekdeep Singh Lubana
Johann Brehmer
P. D. Haan
Taco S. Cohen
OffRL
LRM
48
2
0
06 Dec 2023
Agent-Aware Training for Agent-Agnostic Action Advising in Deep
  Reinforcement Learning
Agent-Aware Training for Agent-Agnostic Action Advising in Deep Reinforcement Learning
Yaoquan Wei
Shunyu Liu
Jie Song
Tongya Zheng
Kaixuan Chen
Yong Wang
Mingli Song
25
0
0
28 Nov 2023
LLM Augmented Hierarchical Agents
LLM Augmented Hierarchical Agents
Bharat Prakash
Tim Oates
T. Mohsenin
16
4
0
09 Nov 2023
Make a Donut: Hierarchical EMD-Space Planning for Zero-Shot Deformable Manipulation with Tools
Make a Donut: Hierarchical EMD-Space Planning for Zero-Shot Deformable Manipulation with Tools
Yang You
Bokui Shen
Congyue Deng
Haoran Geng
Songlin Wei
He-Nan Wang
Leonidas J. Guibas
26
1
0
05 Nov 2023
Accelerating Reinforcement Learning of Robotic Manipulations via
  Feedback from Large Language Models
Accelerating Reinforcement Learning of Robotic Manipulations via Feedback from Large Language Models
Kun-Mo Chu
Xufeng Zhao
C. Weber
Mengdi Li
Stefan Wermter
LLMAG
LM&Ro
36
14
0
04 Nov 2023
$f$-Policy Gradients: A General Framework for Goal Conditioned RL using
  $f$-Divergences
fff-Policy Gradients: A General Framework for Goal Conditioned RL using fff-Divergences
Siddhant Agarwal
Ishan Durugkar
Peter Stone
Amy Zhang
31
4
0
10 Oct 2023
LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving
LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving
Hao Sha
Yao Mu
Yuxuan Jiang
Li Chen
Chenfeng Xu
Ping Luo
Shengbo Eben Li
Masayoshi Tomizuka
Wei Zhan
Mingyu Ding
114
159
0
04 Oct 2023
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable
  Diffusion Model
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
Zibin Dong
Yifu Yuan
Jianye Hao
Fei Ni
Yao Mu
Yan Zheng
Yujing Hu
Tangjie Lv
Changjie Fan
Zhipeng Hu
45
29
0
03 Oct 2023
State2Explanation: Concept-Based Explanations to Benefit Agent Learning
  and User Understanding
State2Explanation: Concept-Based Explanations to Benefit Agent Learning and User Understanding
Devleena Das
Sonia Chernova
Been Kim
LRM
LLMAG
28
22
0
21 Sep 2023
Improve the efficiency of deep reinforcement learning through semantic
  exploration guided by natural language
Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language
Zhourui Guo
Meng Yao
Yang Yu
Qiyue Yin
OnRL
28
1
0
21 Sep 2023
Guide Your Agent with Adaptive Multimodal Rewards
Guide Your Agent with Adaptive Multimodal Rewards
Changyeon Kim
Younggyo Seo
Hao Liu
Lisa Lee
Jinwoo Shin
Honglak Lee
Kimin Lee
20
9
0
19 Sep 2023
Open Problems and Fundamental Limitations of Reinforcement Learning from
  Human Feedback
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Stephen Casper
Xander Davies
Claudia Shi
T. Gilbert
Jérémy Scheurer
...
Erdem Biyik
Anca Dragan
David M. Krueger
Dorsa Sadigh
Dylan Hadfield-Menell
ALM
OffRL
47
470
0
27 Jul 2023
Improving Long-Horizon Imitation Through Instruction Prediction
Improving Long-Horizon Imitation Through Instruction Prediction
Joey Hejna
Pieter Abbeel
Lerrel Pinto
14
7
0
21 Jun 2023
Semantic HELM: A Human-Readable Memory for Reinforcement Learning
Semantic HELM: A Human-Readable Memory for Reinforcement Learning
Fabian Paischer
Thomas Adler
M. Hofmarcher
Sepp Hochreiter
21
9
0
15 Jun 2023
123
Next