ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.09801
  4. Cited By
Meta-Gradient Reinforcement Learning

Meta-Gradient Reinforcement Learning

24 May 2018
Zhongwen Xu
H. V. Hasselt
David Silver
ArXivPDFHTML

Papers citing "Meta-Gradient Reinforcement Learning"

50 / 203 papers shown
Title
TorchOpt: An Efficient Library for Differentiable Optimization
TorchOpt: An Efficient Library for Differentiable Optimization
Jie Ren
Xidong Feng
Bo Liu
Xuehai Pan
Yao Fu
Luo Mai
Yaodong Yang
37
11
0
13 Nov 2022
Auxiliary task discovery through generate-and-test
Auxiliary task discovery through generate-and-test
Banafsheh Rafiee
Sina Ghiassian
Jun Jin
R. Sutton
Jun Luo
Adam White
21
0
0
25 Oct 2022
Simple Emergent Action Representations from Multi-Task Policy Training
Simple Emergent Action Representations from Multi-Task Policy Training
Pu Hua
Yubei Chen
Huazhe Xu
MLAU
22
6
0
18 Oct 2022
Discovered Policy Optimisation
Discovered Policy Optimisation
Chris Xiaoxuan Lu
J. Kuba
Alistair Letcher
Luke Metz
Christian Schroeder de Witt
Jakob N. Foerster
OffRL
42
75
0
11 Oct 2022
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients
Risto Vuorio
Jacob Beck
Shimon Whiteson
Jakob N. Foerster
Gregory Farquhar
30
8
0
22 Sep 2022
Constrained Policy Optimization for Controlled Self-Learning in
  Conversational AI Systems
Constrained Policy Optimization for Controlled Self-Learning in Conversational AI Systems
Mohammad Kachuee
Sungjin Lee
73
4
0
17 Sep 2022
Meta-Gradients in Non-Stationary Environments
Meta-Gradients in Non-Stationary Environments
Jelena Luketina
Sebastian Flennerhag
Yannick Schroecker
David Abel
Tom Zahavy
Satinder Singh
31
10
0
13 Sep 2022
Adaptive Asynchronous Control Using Meta-learned Neural Ordinary
  Differential Equations
Adaptive Asynchronous Control Using Meta-learned Neural Ordinary Differential Equations
Achkan Salehi
Steffen Rühl
Stéphane Doncieux
AI4CE
18
2
0
25 Jul 2022
Bayesian Generational Population-Based Training
Bayesian Generational Population-Based Training
Xingchen Wan
Cong Lu
Jack Parker-Holder
Philip J. Ball
Vu-Linh Nguyen
Binxin Ru
Michael A. Osborne
OffRL
31
15
0
19 Jul 2022
Learning Action Translator for Meta Reinforcement Learning on
  Sparse-Reward Tasks
Learning Action Translator for Meta Reinforcement Learning on Sparse-Reward Tasks
Yijie Guo
Qiucheng Wu
Honglak Lee
OffRL
21
8
0
19 Jul 2022
On Provably Robust Meta-Bayesian Optimization
On Provably Robust Meta-Bayesian Optimization
Zhongxiang Dai
Yizhou Chen
Haibin Yu
K. H. Low
Patrick Jaillet
AAML
28
10
0
14 Jun 2022
On the Role of Discount Factor in Offline Reinforcement Learning
On the Role of Discount Factor in Offline Reinforcement Learning
Haotian Hu
Yiqin Yang
Qianchuan Zhao
Chongjie Zhang
OffRL
29
18
0
07 Jun 2022
On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning
On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning
Mandi Zhao
Pieter Abbeel
Stephen James
OffRL
28
33
0
07 Jun 2022
No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL
No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL
Han Wang
Archit Sakhadeo
Adam White
James Bell
Vincent Liu
Xutong Zhao
Puer Liu
Tadashi Kozuno
Alona Fyshe
Martha White
OffRL
OnRL
22
7
0
18 May 2022
Learning to Transfer Role Assignment Across Team Sizes
Learning to Transfer Role Assignment Across Team Sizes
D. Nguyen
Phuoc Nguyen
Svetha Venkatesh
T. Tran
13
7
0
17 Apr 2022
Evolving Pareto-Optimal Actor-Critic Algorithms for Generalizability and
  Stability
Evolving Pareto-Optimal Actor-Critic Algorithms for Generalizability and Stability
Juan Jose Garau-Luis
Yingjie Miao
John D. Co-Reyes
Aaron T Parisi
Jie Tan
Esteban Real
Aleksandra Faust
31
0
0
08 Apr 2022
Learning to Accelerate by the Methods of Step-size Planning
Learning to Accelerate by the Methods of Step-size Planning
Hengshuai Yao
21
0
0
01 Apr 2022
Continuous-Time Meta-Learning with Forward Mode Differentiation
Continuous-Time Meta-Learning with Forward Mode Differentiation
T. Deleu
David Kanaa
Leo Feng
Giancarlo Kerg
Yoshua Bengio
Guillaume Lajoie
Pierre-Luc Bacon
25
19
0
02 Mar 2022
A History of Meta-gradient: Gradient Methods for Meta-learning
A History of Meta-gradient: Gradient Methods for Meta-learning
R. Sutton
8
11
0
20 Feb 2022
Selective Credit Assignment
Selective Credit Assignment
Veronica Chelu
Diana Borsa
Doina Precup
Hado van Hasselt
29
2
0
20 Feb 2022
Meta Knowledge Distillation
Meta Knowledge Distillation
Jihao Liu
Boxiao Liu
Hongsheng Li
Yu Liu
18
25
0
16 Feb 2022
Meta-Reinforcement Learning with Self-Modifying Networks
Meta-Reinforcement Learning with Self-Modifying Networks
Mathieu Chalvidal
Thomas Serre
Rufin VanRullen
KELM
24
5
0
04 Feb 2022
Chaining Value Functions for Off-Policy Learning
Chaining Value Functions for Off-Policy Learning
Simon Schmitt
John Shawe-Taylor
Hado van Hasselt
OffRL
23
2
0
17 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
33
100
0
11 Jan 2022
Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement
  Learning
Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement Learning
Ziyang Tang
Yihao Feng
Qiang Liu
OffRL
9
1
0
01 Jan 2022
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement
  Learning
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning
Xidong Feng
Bo Liu
Jie Ren
Luo Mai
Rui Zhu
Haifeng Zhang
Jun Wang
Yaodong Yang
14
12
0
31 Dec 2021
Adaptive Incentive Design with Multi-Agent Meta-Gradient Reinforcement
  Learning
Adaptive Incentive Design with Multi-Agent Meta-Gradient Reinforcement Learning
Jiachen Yang
Ethan Wang
Rakshit S. Trivedi
T. Zhao
H. Zha
30
21
0
20 Dec 2021
Automated Deep Learning: Neural Architecture Search Is Not the End
Automated Deep Learning: Neural Architecture Search Is Not the End
Xuanyi Dong
D. Kedziora
Katarzyna Musial
Bogdan Gabrys
25
26
0
16 Dec 2021
Automatic tuning of hyper-parameters of reinforcement learning
  algorithms using Bayesian optimization with behavioral cloning
Automatic tuning of hyper-parameters of reinforcement learning algorithms using Bayesian optimization with behavioral cloning
Juan Cruz Barsce
J. Palombarini
Ernesto C. Martínez
OffRL
33
1
0
15 Dec 2021
Episodic Policy Gradient Training
Episodic Policy Gradient Training
Hung Le
Majid Abdolshah
Thommen George Karimpanal
Kien Do
D. Nguyen
Svetha Venkatesh
BDL
OffRL
25
6
0
03 Dec 2021
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Charles Packer
Pieter Abbeel
Joseph E. Gonzalez
OffRL
23
18
0
02 Dec 2021
On the Practical Consistency of Meta-Reinforcement Learning Algorithms
On the Practical Consistency of Meta-Reinforcement Learning Algorithms
Zheng Xiong
L. Zintgraf
Jacob Beck
Risto Vuorio
Shimon Whiteson
CLL
AIFin
31
9
0
01 Dec 2021
Learning Long-Term Reward Redistribution via Randomized Return
  Decomposition
Learning Long-Term Reward Redistribution via Randomized Return Decomposition
Zhizhou Ren
Ruihan Guo
Yuanshuo Zhou
Jian-wei Peng
24
35
0
26 Nov 2021
Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning
Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning
Nicolai Dorka
Tim Welschehold
Joschka Boedecker
Wolfram Burgard
OffRL
30
9
0
24 Nov 2021
B-Pref: Benchmarking Preference-Based Reinforcement Learning
B-Pref: Benchmarking Preference-Based Reinforcement Learning
Kimin Lee
Laura M. Smith
Anca Dragan
Pieter Abbeel
OffRL
40
93
0
04 Nov 2021
Robust Dynamic Bus Control: A Distributional Multi-agent Reinforcement
  Learning Approach
Robust Dynamic Bus Control: A Distributional Multi-agent Reinforcement Learning Approach
Changyin Sun
Lijun Sun
19
6
0
02 Nov 2021
One Step at a Time: Pros and Cons of Multi-Step Meta-Gradient
  Reinforcement Learning
One Step at a Time: Pros and Cons of Multi-Step Meta-Gradient Reinforcement Learning
Clément Bonnet
Paul Caron
Thomas D. Barrett
Ian Davies
Alexandre Laterre
14
6
0
30 Oct 2021
Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot
  Learning
Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot Learning
Yang Shu
Zhangjie Cao
Jing Gao
Jianmin Wang
Philip S. Yu
Mingsheng Long
32
11
0
14 Oct 2021
Learning to Learn End-to-End Goal-Oriented Dialog From Related Dialog
  Tasks
Learning to Learn End-to-End Goal-Oriented Dialog From Related Dialog Tasks
Janarthanan Rajendran
Jonathan K. Kummerfeld
Satinder Singh
35
0
0
10 Oct 2021
Meta-Learning with Task-Adaptive Loss Function for Few-Shot Learning
Meta-Learning with Task-Adaptive Loss Function for Few-Shot Learning
Sungyong Baik
Janghoon Choi
Heewon Kim
Dohee Cho
Jaesik Min
Kyoung Mu Lee
18
97
0
08 Oct 2021
Introducing Symmetries to Black Box Meta Reinforcement Learning
Introducing Symmetries to Black Box Meta Reinforcement Learning
Louis Kirsch
Sebastian Flennerhag
Hado van Hasselt
A. Friesen
Junhyuk Oh
Yutian Chen
22
30
0
22 Sep 2021
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Michael Wan
Jian-wei Peng
Tanmay Gangwani
34
7
0
18 Sep 2021
Few-shot Quality-Diversity Optimization
Few-shot Quality-Diversity Optimization
Achkan Salehi
Alexandre Coninx
Stéphane Doncieux
19
14
0
14 Sep 2021
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic
  Reinforcement Learning and Global Convergence of Policy Gradient Methods
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods
Xin Guo
Anran Hu
Junzi Zhang
OffRL
28
6
0
13 Sep 2021
Bootstrapped Meta-Learning
Bootstrapped Meta-Learning
Sebastian Flennerhag
Yannick Schroecker
Tom Zahavy
Hado van Hasselt
David Silver
Satinder Singh
38
59
0
09 Sep 2021
ETA Prediction with Graph Neural Networks in Google Maps
ETA Prediction with Graph Neural Networks in Google Maps
Austin Derrow-Pinion
Jennifer She
David Wong
O. Lange
Todd Hester
...
Ang Li
Zhongwen Xu
Alvaro Sanchez-Gonzalez
Yujia Li
Petar Velivcković
AI4TS
16
224
0
25 Aug 2021
Offline Meta-Reinforcement Learning with Online Self-Supervision
Offline Meta-Reinforcement Learning with Online Self-Supervision
Vitchyr H. Pong
Ashvin Nair
Laura M. Smith
Catherine Huang
Sergey Levine
OffRL
32
66
0
08 Jul 2021
Meta-Reinforcement Learning for Heuristic Planning
Meta-Reinforcement Learning for Heuristic Planning
Ricardo Luna Gutierrez
Matteo Leonetti
OffRL
AIFin
21
4
0
06 Jul 2021
Examining average and discounted reward optimality criteria in
  reinforcement learning
Examining average and discounted reward optimality criteria in reinforcement learning
Vektor Dewanto
M. Gallagher
OffRL
9
16
0
03 Jul 2021
Tuning Mixed Input Hyperparameters on the Fly for Efficient Population
  Based AutoRL
Tuning Mixed Input Hyperparameters on the Fly for Efficient Population Based AutoRL
Jack Parker-Holder
Vu Nguyen
Shaan Desai
Stephen J. Roberts
43
16
0
30 Jun 2021
Previous
12345
Next