ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.06459
  4. Cited By
On Learning Intrinsic Rewards for Policy Gradient Methods

On Learning Intrinsic Rewards for Policy Gradient Methods

17 April 2018
Zeyu Zheng
Junhyuk Oh
Satinder Singh
ArXivPDFHTML

Papers citing "On Learning Intrinsic Rewards for Policy Gradient Methods"

50 / 50 papers shown
Title
Fairness in Reinforcement Learning with Bisimulation Metrics
Fairness in Reinforcement Learning with Bisimulation Metrics
S. Rezaei-Shoshtari
Hanna Yurchyk
Scott Fujimoto
Doina Precup
D. Meger
85
0
0
03 Jan 2025
Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning Applications
Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning Applications
Sinan Ibrahim
Mostafa Mostafa
Ali Jnadi
Hadi Salloum
Pavel Osinenko
OffRL
52
12
0
31 Dec 2024
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Haozhe Ma
Zhengding Luo
Thanh Vinh Vo
Kuankuan Sima
Tze-Yun Leong
34
6
0
06 Aug 2024
Automatic Environment Shaping is the Next Frontier in RL
Automatic Environment Shaping is the Next Frontier in RL
Younghyo Park
G. Margolis
Pulkit Agrawal
OffRL
40
3
0
23 Jul 2024
$\mathrm{E^{2}CFD}$: Towards Effective and Efficient Cost Function
  Design for Safe Reinforcement Learning via Large Language Model
E2CFD\mathrm{E^{2}CFD}E2CFD: Towards Effective and Efficient Cost Function Design for Safe Reinforcement Learning via Large Language Model
Zepeng Wang
Chao Ma
Linjiang Zhou
Libing Wu
Lei Yang
Xiaochuan Shi
Guojun Peng
OffRL
40
0
0
08 Jul 2024
Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity
Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity
Yan Yang
Bin Gao
Ya-xiang Yuan
46
2
0
30 May 2024
Informativeness of Reward Functions in Reinforcement Learning
Informativeness of Reward Functions in Reinforcement Learning
R. Devidze
Parameswaran Kamalaruban
Adish Singla
29
2
0
10 Feb 2024
SDSRA: A Skill-Driven Skill-Recombination Algorithm for Efficient Policy
  Learning
SDSRA: A Skill-Driven Skill-Recombination Algorithm for Efficient Policy Learning
Eric Hanchen Jiang
Andrew Lizarraga
26
0
0
06 Dec 2023
CLIP-Motion: Learning Reward Functions for Robotic Actions Using Consecutive Observations
CLIP-Motion: Learning Reward Functions for Robotic Actions Using Consecutive Observations
Xuzhe Dang
Stefan Edelkamp
37
4
0
06 Nov 2023
Learning How to Propagate Messages in Graph Neural Networks
Learning How to Propagate Messages in Graph Neural Networks
Teng Xiao
Zhengyu Chen
Donglin Wang
Suhang Wang
GNN
34
76
0
01 Oct 2023
PIMbot: Policy and Incentive Manipulation for Multi-Robot Reinforcement
  Learning in Social Dilemmas
PIMbot: Policy and Incentive Manipulation for Multi-Robot Reinforcement Learning in Social Dilemmas
Shahab Nikkhoo
Zexin Li
Aritra Samanta
Yufei Li
Cong Liu
35
6
0
29 Jul 2023
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement
  Learning
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
OffRL
26
8
0
26 Jan 2023
A Survey of Meta-Reinforcement Learning
A Survey of Meta-Reinforcement Learning
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
OOD
OffRL
37
122
0
19 Jan 2023
Hypernetworks for Zero-shot Transfer in Reinforcement Learning
Hypernetworks for Zero-shot Transfer in Reinforcement Learning
S. Rezaei-Shoshtari
Charlotte Morissette
F. Hogan
Gregory Dudek
D. Meger
OffRL
17
14
0
28 Nov 2022
Evaluating the Perceived Safety of Urban City via Maximum Entropy Deep
  Inverse Reinforcement Learning
Evaluating the Perceived Safety of Urban City via Maximum Entropy Deep Inverse Reinforcement Learning
Yaxuan Wang
Zhixin Zeng
Qijun Zhao
13
0
0
19 Nov 2022
Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer
  Value Function
Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function
Clément Bonnet
Laurence Midgley
Alexandre Laterre
24
1
0
19 Nov 2022
Redeeming Intrinsic Rewards via Constrained Optimization
Redeeming Intrinsic Rewards via Constrained Optimization
Eric Chen
Zhang-Wei Hong
Joni Pajarinen
Pulkit Agrawal
OnRL
36
24
0
14 Nov 2022
Reward Shaping Using Convolutional Neural Network
Reward Shaping Using Convolutional Neural Network
Hani Sami
Hadi Otrok
Jamal Bentahar
Azzam Mourad
Ernesto Damiani
26
3
0
30 Oct 2022
Deep Intrinsically Motivated Exploration in Continuous Control
Deep Intrinsically Motivated Exploration in Continuous Control
Baturay Saglam
Suleyman Serdar Kozat
18
4
0
01 Oct 2022
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients
Risto Vuorio
Jacob Beck
Shimon Whiteson
Jakob N. Foerster
Gregory Farquhar
27
8
0
22 Sep 2022
Meta-Gradients in Non-Stationary Environments
Meta-Gradients in Non-Stationary Environments
Jelena Luketina
Sebastian Flennerhag
Yannick Schroecker
David Abel
Tom Zahavy
Satinder Singh
31
10
0
13 Sep 2022
Continuously Discovering Novel Strategies via Reward-Switching Policy
  Optimization
Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization
Zihan Zhou
Wei Fu
Bingliang Zhang
Yi Wu
25
28
0
04 Apr 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in
  Environments with Sparse Rewards: What and When to Share?
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
22
9
0
24 Feb 2022
Open-Ended Reinforcement Learning with Neural Reward Functions
Open-Ended Reinforcement Learning with Neural Reward Functions
Robert Meier
Asier Mujika
37
7
0
16 Feb 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
33
100
0
11 Jan 2022
Adaptive Incentive Design with Multi-Agent Meta-Gradient Reinforcement
  Learning
Adaptive Incentive Design with Multi-Agent Meta-Gradient Reinforcement Learning
Jiachen Yang
Ethan Wang
Rakshit S. Trivedi
T. Zhao
H. Zha
30
21
0
20 Dec 2021
LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent
  Learning
LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning
D. Mguni
Taher Jafferjee
Jianhong Wang
Oliver Slumbers
Nicolas Perez Nieves
Feifei Tong
Yang Li
Jiangcheng Zhu
Yaodong Yang
Jun Wang
39
18
0
05 Dec 2021
Wasserstein Distance Maximizing Intrinsic Control
Wasserstein Distance Maximizing Intrinsic Control
Ishan Durugkar
Steven Hansen
Stephen Spencer
Volodymyr Mnih
18
6
0
28 Oct 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
36
92
0
14 Sep 2021
Bootstrapped Meta-Learning
Bootstrapped Meta-Learning
Sebastian Flennerhag
Yannick Schroecker
Tom Zahavy
Hado van Hasselt
David Silver
Satinder Singh
38
58
0
09 Sep 2021
Toward a `Standard Model' of Machine Learning
Toward a `Standard Model' of Machine Learning
Zhiting Hu
Eric P. Xing
34
12
0
17 Aug 2021
Towards Practical Credit Assignment for Deep Reinforcement Learning
Towards Practical Credit Assignment for Deep Reinforcement Learning
Vyacheslav Alipov
Riley Simmons-Edler
N.Yu. Putintsev
Pavel Kalinin
Dmitry Vetrov
OffRL
32
11
0
08 Jun 2021
Credit Assignment with Meta-Policy Gradient for Multi-Agent
  Reinforcement Learning
Credit Assignment with Meta-Policy Gradient for Multi-Agent Reinforcement Learning
Jianzhun Shao
Hongchang Zhang
Yuhang Jiang
Shuncheng He
Xiangyang Ji
29
5
0
24 Feb 2021
Discovery of Options via Meta-Learned Subgoals
Discovery of Options via Meta-Learned Subgoals
Vivek Veeriah
Tom Zahavy
Matteo Hessel
Zhongwen Xu
Junhyuk Oh
Iurii Kemaev
H. V. Hasselt
David Silver
Satinder Singh
26
33
0
12 Feb 2021
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping
Yujing Hu
Weixun Wang
Hangtian Jia
Yixiang Wang
Yingfeng Chen
Jianye Hao
Feng Wu
Changjie Fan
OffRL
11
173
0
05 Nov 2020
Learning Intrinsic Symbolic Rewards in Reinforcement Learning
Learning Intrinsic Symbolic Rewards in Reinforcement Learning
Hassam Sheikh
Shauharda Khadka
Santiago Miret
Somdeb Majumdar
OffRL
21
7
0
08 Oct 2020
FORK: A Forward-Looking Actor For Model-Free Reinforcement Learning
FORK: A Forward-Looking Actor For Model-Free Reinforcement Learning
Honghao Wei
Lei Ying
16
7
0
04 Oct 2020
Learning with AMIGo: Adversarially Motivated Intrinsic Goals
Learning with AMIGo: Adversarially Motivated Intrinsic Goals
Andres Campero
Roberta Raileanu
Heinrich Küttler
J. Tenenbaum
Tim Rocktaschel
Edward Grefenstette
38
125
0
22 Jun 2020
Meta-Learning in Neural Networks: A Survey
Meta-Learning in Neural Networks: A Survey
Timothy M. Hospedales
Antreas Antoniou
P. Micaelli
Amos Storkey
OOD
55
1,928
0
11 Apr 2020
Online Meta-Learning for Multi-Source and Semi-Supervised Domain
  Adaptation
Online Meta-Learning for Multi-Source and Semi-Supervised Domain Adaptation
Da Li
Timothy M. Hospedales
22
102
0
09 Apr 2020
Learning to Ask Medical Questions using Reinforcement Learning
Learning to Ask Medical Questions using Reinforcement Learning
Uri Shaham
Tom Zahavy
C. Caraballo
S. Mahajan
D. Massey
H. Krumholz
OOD
24
1
0
31 Mar 2020
Online Meta-Critic Learning for Off-Policy Actor-Critic Methods
Online Meta-Critic Learning for Off-Policy Actor-Critic Methods
Wei Zhou
Yiying Li
Yongxin Yang
Huaimin Wang
Timothy M. Hospedales
OffRL
30
46
0
11 Mar 2020
Population-Guided Parallel Policy Search for Reinforcement Learning
Population-Guided Parallel Policy Search for Reinforcement Learning
Whiyoung Jung
Giseung Park
Y. Sung
OffRL
24
38
0
09 Jan 2020
How Should an Agent Practice?
How Should an Agent Practice?
Janarthanan Rajendran
Richard L. Lewis
Vivek Veeriah
Honglak Lee
Satinder Singh
26
9
0
15 Dec 2019
Learning Data Manipulation for Augmentation and Weighting
Learning Data Manipulation for Augmentation and Weighting
Zhiting Hu
Bowen Tan
Ruslan Salakhutdinov
Tom Michael Mitchell
Eric P. Xing
24
116
0
28 Oct 2019
Discovery of Useful Questions as Auxiliary Tasks
Discovery of Useful Questions as Auxiliary Tasks
Vivek Veeriah
Matteo Hessel
Zhongwen Xu
Richard L. Lewis
Janarthanan Rajendran
Junhyuk Oh
H. V. Hasselt
David Silver
Satinder Singh
LLMAG
14
86
0
10 Sep 2019
Learning to Generalize from Sparse and Underspecified Rewards
Learning to Generalize from Sparse and Underspecified Rewards
Rishabh Agarwal
Chen Liang
Dale Schuurmans
Mohammad Norouzi
OffRL
43
97
0
19 Feb 2019
Provably Efficient Maximum Entropy Exploration
Provably Efficient Maximum Entropy Exploration
Elad Hazan
Sham Kakade
Karan Singh
A. V. Soest
25
292
0
06 Dec 2018
Meta-Gradient Reinforcement Learning
Meta-Gradient Reinforcement Learning
Zhongwen Xu
H. V. Hasselt
David Silver
20
324
0
24 May 2018
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
362
11,700
0
09 Mar 2017
1