Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1805.09801
Cited By
Meta-Gradient Reinforcement Learning
24 May 2018
Zhongwen Xu
H. V. Hasselt
David Silver
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Meta-Gradient Reinforcement Learning"
50 / 203 papers shown
Title
TorchOpt: An Efficient Library for Differentiable Optimization
Jie Ren
Xidong Feng
Bo Liu
Xuehai Pan
Yao Fu
Luo Mai
Yaodong Yang
37
11
0
13 Nov 2022
Auxiliary task discovery through generate-and-test
Banafsheh Rafiee
Sina Ghiassian
Jun Jin
R. Sutton
Jun Luo
Adam White
21
0
0
25 Oct 2022
Simple Emergent Action Representations from Multi-Task Policy Training
Pu Hua
Yubei Chen
Huazhe Xu
MLAU
22
6
0
18 Oct 2022
Discovered Policy Optimisation
Chris Xiaoxuan Lu
J. Kuba
Alistair Letcher
Luke Metz
Christian Schroeder de Witt
Jakob N. Foerster
OffRL
42
75
0
11 Oct 2022
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients
Risto Vuorio
Jacob Beck
Shimon Whiteson
Jakob N. Foerster
Gregory Farquhar
30
8
0
22 Sep 2022
Constrained Policy Optimization for Controlled Self-Learning in Conversational AI Systems
Mohammad Kachuee
Sungjin Lee
73
4
0
17 Sep 2022
Meta-Gradients in Non-Stationary Environments
Jelena Luketina
Sebastian Flennerhag
Yannick Schroecker
David Abel
Tom Zahavy
Satinder Singh
31
10
0
13 Sep 2022
Adaptive Asynchronous Control Using Meta-learned Neural Ordinary Differential Equations
Achkan Salehi
Steffen Rühl
Stéphane Doncieux
AI4CE
18
2
0
25 Jul 2022
Bayesian Generational Population-Based Training
Xingchen Wan
Cong Lu
Jack Parker-Holder
Philip J. Ball
Vu-Linh Nguyen
Binxin Ru
Michael A. Osborne
OffRL
31
15
0
19 Jul 2022
Learning Action Translator for Meta Reinforcement Learning on Sparse-Reward Tasks
Yijie Guo
Qiucheng Wu
Honglak Lee
OffRL
21
8
0
19 Jul 2022
On Provably Robust Meta-Bayesian Optimization
Zhongxiang Dai
Yizhou Chen
Haibin Yu
K. H. Low
Patrick Jaillet
AAML
28
10
0
14 Jun 2022
On the Role of Discount Factor in Offline Reinforcement Learning
Haotian Hu
Yiqin Yang
Qianchuan Zhao
Chongjie Zhang
OffRL
29
18
0
07 Jun 2022
On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning
Mandi Zhao
Pieter Abbeel
Stephen James
OffRL
28
33
0
07 Jun 2022
No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL
Han Wang
Archit Sakhadeo
Adam White
James Bell
Vincent Liu
Xutong Zhao
Puer Liu
Tadashi Kozuno
Alona Fyshe
Martha White
OffRL
OnRL
22
7
0
18 May 2022
Learning to Transfer Role Assignment Across Team Sizes
D. Nguyen
Phuoc Nguyen
Svetha Venkatesh
T. Tran
13
7
0
17 Apr 2022
Evolving Pareto-Optimal Actor-Critic Algorithms for Generalizability and Stability
Juan Jose Garau-Luis
Yingjie Miao
John D. Co-Reyes
Aaron T Parisi
Jie Tan
Esteban Real
Aleksandra Faust
31
0
0
08 Apr 2022
Learning to Accelerate by the Methods of Step-size Planning
Hengshuai Yao
21
0
0
01 Apr 2022
Continuous-Time Meta-Learning with Forward Mode Differentiation
T. Deleu
David Kanaa
Leo Feng
Giancarlo Kerg
Yoshua Bengio
Guillaume Lajoie
Pierre-Luc Bacon
25
19
0
02 Mar 2022
A History of Meta-gradient: Gradient Methods for Meta-learning
R. Sutton
8
11
0
20 Feb 2022
Selective Credit Assignment
Veronica Chelu
Diana Borsa
Doina Precup
Hado van Hasselt
29
2
0
20 Feb 2022
Meta Knowledge Distillation
Jihao Liu
Boxiao Liu
Hongsheng Li
Yu Liu
18
25
0
16 Feb 2022
Meta-Reinforcement Learning with Self-Modifying Networks
Mathieu Chalvidal
Thomas Serre
Rufin VanRullen
KELM
24
5
0
04 Feb 2022
Chaining Value Functions for Off-Policy Learning
Simon Schmitt
John Shawe-Taylor
Hado van Hasselt
OffRL
23
2
0
17 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
33
100
0
11 Jan 2022
Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement Learning
Ziyang Tang
Yihao Feng
Qiang Liu
OffRL
9
1
0
01 Jan 2022
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning
Xidong Feng
Bo Liu
Jie Ren
Luo Mai
Rui Zhu
Haifeng Zhang
Jun Wang
Yaodong Yang
14
12
0
31 Dec 2021
Adaptive Incentive Design with Multi-Agent Meta-Gradient Reinforcement Learning
Jiachen Yang
Ethan Wang
Rakshit S. Trivedi
T. Zhao
H. Zha
30
21
0
20 Dec 2021
Automated Deep Learning: Neural Architecture Search Is Not the End
Xuanyi Dong
D. Kedziora
Katarzyna Musial
Bogdan Gabrys
25
26
0
16 Dec 2021
Automatic tuning of hyper-parameters of reinforcement learning algorithms using Bayesian optimization with behavioral cloning
Juan Cruz Barsce
J. Palombarini
Ernesto C. Martínez
OffRL
33
1
0
15 Dec 2021
Episodic Policy Gradient Training
Hung Le
Majid Abdolshah
Thommen George Karimpanal
Kien Do
D. Nguyen
Svetha Venkatesh
BDL
OffRL
25
6
0
03 Dec 2021
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Charles Packer
Pieter Abbeel
Joseph E. Gonzalez
OffRL
23
18
0
02 Dec 2021
On the Practical Consistency of Meta-Reinforcement Learning Algorithms
Zheng Xiong
L. Zintgraf
Jacob Beck
Risto Vuorio
Shimon Whiteson
CLL
AIFin
31
9
0
01 Dec 2021
Learning Long-Term Reward Redistribution via Randomized Return Decomposition
Zhizhou Ren
Ruihan Guo
Yuanshuo Zhou
Jian-wei Peng
24
35
0
26 Nov 2021
Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning
Nicolai Dorka
Tim Welschehold
Joschka Boedecker
Wolfram Burgard
OffRL
30
9
0
24 Nov 2021
B-Pref: Benchmarking Preference-Based Reinforcement Learning
Kimin Lee
Laura M. Smith
Anca Dragan
Pieter Abbeel
OffRL
40
93
0
04 Nov 2021
Robust Dynamic Bus Control: A Distributional Multi-agent Reinforcement Learning Approach
Changyin Sun
Lijun Sun
19
6
0
02 Nov 2021
One Step at a Time: Pros and Cons of Multi-Step Meta-Gradient Reinforcement Learning
Clément Bonnet
Paul Caron
Thomas D. Barrett
Ian Davies
Alexandre Laterre
14
6
0
30 Oct 2021
Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot Learning
Yang Shu
Zhangjie Cao
Jing Gao
Jianmin Wang
Philip S. Yu
Mingsheng Long
32
11
0
14 Oct 2021
Learning to Learn End-to-End Goal-Oriented Dialog From Related Dialog Tasks
Janarthanan Rajendran
Jonathan K. Kummerfeld
Satinder Singh
35
0
0
10 Oct 2021
Meta-Learning with Task-Adaptive Loss Function for Few-Shot Learning
Sungyong Baik
Janghoon Choi
Heewon Kim
Dohee Cho
Jaesik Min
Kyoung Mu Lee
18
97
0
08 Oct 2021
Introducing Symmetries to Black Box Meta Reinforcement Learning
Louis Kirsch
Sebastian Flennerhag
Hado van Hasselt
A. Friesen
Junhyuk Oh
Yutian Chen
22
30
0
22 Sep 2021
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Michael Wan
Jian-wei Peng
Tanmay Gangwani
34
7
0
18 Sep 2021
Few-shot Quality-Diversity Optimization
Achkan Salehi
Alexandre Coninx
Stéphane Doncieux
19
14
0
14 Sep 2021
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods
Xin Guo
Anran Hu
Junzi Zhang
OffRL
28
6
0
13 Sep 2021
Bootstrapped Meta-Learning
Sebastian Flennerhag
Yannick Schroecker
Tom Zahavy
Hado van Hasselt
David Silver
Satinder Singh
38
59
0
09 Sep 2021
ETA Prediction with Graph Neural Networks in Google Maps
Austin Derrow-Pinion
Jennifer She
David Wong
O. Lange
Todd Hester
...
Ang Li
Zhongwen Xu
Alvaro Sanchez-Gonzalez
Yujia Li
Petar Velivcković
AI4TS
16
224
0
25 Aug 2021
Offline Meta-Reinforcement Learning with Online Self-Supervision
Vitchyr H. Pong
Ashvin Nair
Laura M. Smith
Catherine Huang
Sergey Levine
OffRL
32
66
0
08 Jul 2021
Meta-Reinforcement Learning for Heuristic Planning
Ricardo Luna Gutierrez
Matteo Leonetti
OffRL
AIFin
21
4
0
06 Jul 2021
Examining average and discounted reward optimality criteria in reinforcement learning
Vektor Dewanto
M. Gallagher
OffRL
9
16
0
03 Jul 2021
Tuning Mixed Input Hyperparameters on the Fly for Efficient Population Based AutoRL
Jack Parker-Holder
Vu Nguyen
Shaan Desai
Stephen J. Roberts
43
16
0
30 Jun 2021
Previous
1
2
3
4
5
Next