Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1805.09801
Cited By
Meta-Gradient Reinforcement Learning
24 May 2018
Zhongwen Xu
H. V. Hasselt
David Silver
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Meta-Gradient Reinforcement Learning"
50 / 203 papers shown
Title
A nonlinear hidden layer enables actor-critic agents to learn multiple paired association navigation
M Ganesh Kumar
Cheston Tan
C. Libedinsky
S. Yen
A. Tan
7
5
0
25 Jun 2021
Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation
Yunhao Tang
Tadashi Kozuno
Mark Rowland
Rémi Munos
Michal Valko
OffRL
18
9
0
24 Jun 2021
Preferential Temporal Difference Learning
N. Anand
Doina Precup
OOD
11
8
0
11 Jun 2021
Taylor Expansion of Discount Factors
Yunhao Tang
Mark Rowland
Rémi Munos
Michal Valko
OffRL
32
5
0
11 Jun 2021
Towards Practical Credit Assignment for Deep Reinforcement Learning
Vyacheslav Alipov
Riley Simmons-Edler
N.Yu. Putintsev
Pavel Kalinin
Dmitry Vetrov
OffRL
32
11
0
08 Jun 2021
Neural Auto-Curricula
Xidong Feng
Oliver Slumbers
Bo Liu
Bo Liu
Stephen Marcus McAleer
Ying Wen
Jun Wang
Yaodong Yang
26
1
0
04 Jun 2021
Discovering Diverse Nearly Optimal Policies with Successor Features
Tom Zahavy
Brendan O'Donoghue
André Barreto
Volodymyr Mnih
Sebastian Flennerhag
Satinder Singh
20
20
0
01 Jun 2021
A nearly Blackwell-optimal policy gradient method
Vektor Dewanto
M. Gallagher
OffRL
16
0
0
28 May 2021
Fast and Slow Learning of Recurrent Independent Mechanisms
Kanika Madan
Rosemary Nan Ke
Anirudh Goyal
Bernhard Schölkopf
Yoshua Bengio
OCL
32
40
0
18 May 2021
Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Tom Schaul
Georg Ostrovski
Iurii Kemaev
Diana Borsa
11
19
0
11 May 2021
What is Going on Inside Recurrent Meta Reinforcement Learning Agents?
Safa Alver
Doina Precup
AIFin
16
4
0
29 Apr 2021
Muesli: Combining Improvements in Policy Optimization
Matteo Hessel
Ivo Danihelka
Fabio Viola
A. Guez
Simon Schmitt
Laurent Sifre
T. Weber
David Silver
H. V. Hasselt
16
66
0
13 Apr 2021
Learning to Reweight Imaginary Transitions for Model-Based Reinforcement Learning
Wenzhen Huang
Qiyue Yin
Junge Zhang
Kaiqi Huang
15
4
0
09 Apr 2021
Meta-Adversarial Inverse Reinforcement Learning for Decision-making Tasks
Pin Wang
Hanhan Li
Ching-yao Chan
11
9
0
23 Mar 2021
Regularized Softmax Deep Multi-Agent
Q
Q
Q
-Learning
L. Pan
Tabish Rashid
Bei Peng
Longbo Huang
Shimon Whiteson
42
31
0
22 Mar 2021
Reinforcement Learning with Algorithms from Probabilistic Structure Estimation
J. Epperlein
R. Overko
Sergiy Zhuk
Christopher K. King
Djallel Bouneffouf
Andrew Cullen
Robert Shorten
OffRL
8
6
0
15 Mar 2021
Credit Assignment with Meta-Policy Gradient for Multi-Agent Reinforcement Learning
Jianzhun Shao
Hongchang Zhang
Yuhang Jiang
Shuncheng He
Xiangyang Ji
29
5
0
24 Feb 2021
Identifying Physical Law of Hamiltonian Systems via Meta-Learning
Seungjun Lee
Haesang Yang
W. Seong
30
13
0
23 Feb 2021
Multi-Objective Meta Learning
Feiyang Ye
Baijiong Lin
Zhixiong Yue
Pengxin Guo
Qiao Xiao
Yu Zhang
41
47
0
14 Feb 2021
Discovery of Options via Meta-Learned Subgoals
Vivek Veeriah
Tom Zahavy
Matteo Hessel
Zhongwen Xu
Junhyuk Oh
Iurii Kemaev
H. V. Hasselt
David Silver
Satinder Singh
26
33
0
12 Feb 2021
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Florian E. Dorner
25
12
0
09 Feb 2021
Alchemy: A benchmark and analysis toolkit for meta-reinforcement learning agents
Jane X. Wang
Michael King
Nicolas Porcel
Z. Kurth-Nelson
Tina Zhu
...
Neil C. Rabinowitz
Loic Matthey
Demis Hassabis
Alexander Lerchner
M. Botvinick
OffRL
23
30
0
04 Feb 2021
Investigating Bi-Level Optimization for Learning and Vision from a Unified Perspective: A Survey and Beyond
Risheng Liu
Jiaxin Gao
Jin Zhang
Deyu Meng
Zhouchen Lin
AI4CE
59
222
0
27 Jan 2021
Training Learned Optimizers with Randomly Initialized Learned Optimizers
Luke Metz
C. Freeman
Niru Maheswaranathan
Jascha Narain Sohl-Dickstein
43
12
0
14 Jan 2021
Towards Continual Reinforcement Learning: A Review and Perspectives
Khimya Khetarpal
Matthew D Riemer
Irina Rish
Doina Precup
CLL
OffRL
49
310
0
25 Dec 2020
Personalized Adaptive Meta Learning for Cold-start User Preference Prediction
Runsheng Yu
Yu Gong
Xu He
Bo An
Yu Zhu
Qingwen Liu
Wenwu Ou
25
57
0
22 Dec 2020
Distributed Multi-agent Meta Learning for Trajectory Design in Wireless Drone Networks
Ye Hu
Mingzhe Chen
Walid Saad
H. Vincent Poor
Shuguang Cui
29
104
0
06 Dec 2020
Latent Skill Planning for Exploration and Transfer
Kevin Xie
Homanga Bharadhwaj
Danijar Hafner
Animesh Garg
Florian Shkurti
39
20
0
27 Nov 2020
Adaptable Automation with Modular Deep Reinforcement Learning and Policy Transfer
Zohreh Raziei
Mohsen Moghaddam
26
25
0
27 Nov 2020
Meta-learning in natural and artificial intelligence
Jane X. Wang
30
110
0
26 Nov 2020
Information-theoretic Task Selection for Meta-Reinforcement Learning
Ricardo Luna Gutierrez
Matteo Leonetti
24
19
0
02 Nov 2020
Forethought and Hindsight in Credit Assignment
Veronica Chelu
Doina Precup
H. V. Hasselt
19
25
0
26 Oct 2020
Balancing Constraints and Rewards with Meta-Gradient D4PG
D. A. Calian
D. Mankowitz
Tom Zahavy
Zhongwen Xu
Junhyuk Oh
Nir Levine
Timothy A. Mann
31
25
0
13 Oct 2020
FOCAL: Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior Regularization
Lanqing Li
Rui Yang
Dijun Luo
OffRL
27
10
0
02 Oct 2020
A Primal-Dual Subgradient Approachfor Fair Meta Learning
Chenxu Zhao
Feng Chen
Zhuoyi Wang
Latifur Khan
FaML
16
0
0
26 Sep 2020
Meta-AAD: Active Anomaly Detection with Deep Reinforcement Learning
Daochen Zha
Kwei-Herng Lai
Mingyang Wan
X. Hu
19
53
0
16 Sep 2020
"It's Unwieldy and It Takes a Lot of Time." Challenges and Opportunities for Creating Agents in Commercial Games
Mikhail Jacob
Sam Devlin
Katja Hofmann
LLMAG
AI4CE
14
16
0
01 Sep 2020
learn2learn: A Library for Meta-Learning Research
Sébastien M. R. Arnold
Praateek Mahajan
Debajyoti Datta
Ian Bunner
Konstantinos Saitas Zarkias
18
95
0
27 Aug 2020
Discovering Reinforcement Learning Algorithms
Junhyuk Oh
Matteo Hessel
Wojciech M. Czarnecki
Zhongwen Xu
H. V. Hasselt
Satinder Singh
David Silver
21
126
0
17 Jul 2020
Contextualizing Enhances Gradient Based Meta Learning
Evan Vogelbaum
Rumen Dangovski
L. Jing
Marin Soljacic
34
3
0
17 Jul 2020
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Zhongwen Xu
H. V. Hasselt
Matteo Hessel
Junhyuk Oh
Satinder Singh
David Silver
16
77
0
16 Jul 2020
Discount Factor as a Regularizer in Reinforcement Learning
Ron Amit
Ron Meir
K. Ciosek
OffRL
22
71
0
04 Jul 2020
Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient
Yufei Wang
Tianwei Ni
4
20
0
03 Jul 2020
Policy-GNN: Aggregation Optimization for Graph Neural Networks
Kwei-Herng Lai
Daochen Zha
Kaixiong Zhou
Xia Hu
21
90
0
26 Jun 2020
META-Learning Eligibility Traces for More Sample Efficient Temporal Difference Learning
Mingde Zhao
8
0
0
16 Jun 2020
Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary Strategies
Yunhao Tang
K. Choromanski
OffRL
9
13
0
13 Jun 2020
Learning to Incentivize Other Learning Agents
Jiachen Yang
Ang Li
Mehrdad Farajtabar
P. Sunehag
Edward Hughes
H. Zha
18
67
0
10 Jun 2020
Dual Policy Distillation
Kwei-Herng Lai
Daochen Zha
Yuening Li
Xia Hu
OffRL
13
10
0
07 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
65
225
0
01 Jun 2020
Meta-Reinforcement Learning for Trajectory Design in Wireless UAV Networks
Ye Hu
Mingzhe Chen
Walid Saad
H. Vincent Poor
Shuguang Cui
11
25
0
25 May 2020
Previous
1
2
3
4
5
Next