ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.09801
  4. Cited By
Meta-Gradient Reinforcement Learning

Meta-Gradient Reinforcement Learning

24 May 2018
Zhongwen Xu
H. V. Hasselt
David Silver
ArXivPDFHTML

Papers citing "Meta-Gradient Reinforcement Learning"

50 / 203 papers shown
Title
A nonlinear hidden layer enables actor-critic agents to learn multiple
  paired association navigation
A nonlinear hidden layer enables actor-critic agents to learn multiple paired association navigation
M Ganesh Kumar
Cheston Tan
C. Libedinsky
S. Yen
A. Tan
7
5
0
25 Jun 2021
Unifying Gradient Estimators for Meta-Reinforcement Learning via
  Off-Policy Evaluation
Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation
Yunhao Tang
Tadashi Kozuno
Mark Rowland
Rémi Munos
Michal Valko
OffRL
18
9
0
24 Jun 2021
Preferential Temporal Difference Learning
Preferential Temporal Difference Learning
N. Anand
Doina Precup
OOD
11
8
0
11 Jun 2021
Taylor Expansion of Discount Factors
Taylor Expansion of Discount Factors
Yunhao Tang
Mark Rowland
Rémi Munos
Michal Valko
OffRL
32
5
0
11 Jun 2021
Towards Practical Credit Assignment for Deep Reinforcement Learning
Towards Practical Credit Assignment for Deep Reinforcement Learning
Vyacheslav Alipov
Riley Simmons-Edler
N.Yu. Putintsev
Pavel Kalinin
Dmitry Vetrov
OffRL
32
11
0
08 Jun 2021
Neural Auto-Curricula
Neural Auto-Curricula
Xidong Feng
Oliver Slumbers
Bo Liu
Bo Liu
Stephen Marcus McAleer
Ying Wen
Jun Wang
Yaodong Yang
26
1
0
04 Jun 2021
Discovering Diverse Nearly Optimal Policies with Successor Features
Discovering Diverse Nearly Optimal Policies with Successor Features
Tom Zahavy
Brendan O'Donoghue
André Barreto
Volodymyr Mnih
Sebastian Flennerhag
Satinder Singh
20
20
0
01 Jun 2021
A nearly Blackwell-optimal policy gradient method
A nearly Blackwell-optimal policy gradient method
Vektor Dewanto
M. Gallagher
OffRL
16
0
0
28 May 2021
Fast and Slow Learning of Recurrent Independent Mechanisms
Fast and Slow Learning of Recurrent Independent Mechanisms
Kanika Madan
Rosemary Nan Ke
Anirudh Goyal
Bernhard Schölkopf
Yoshua Bengio
OCL
32
40
0
18 May 2021
Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Return-based Scaling: Yet Another Normalisation Trick for Deep RL
Tom Schaul
Georg Ostrovski
Iurii Kemaev
Diana Borsa
11
19
0
11 May 2021
What is Going on Inside Recurrent Meta Reinforcement Learning Agents?
What is Going on Inside Recurrent Meta Reinforcement Learning Agents?
Safa Alver
Doina Precup
AIFin
16
4
0
29 Apr 2021
Muesli: Combining Improvements in Policy Optimization
Muesli: Combining Improvements in Policy Optimization
Matteo Hessel
Ivo Danihelka
Fabio Viola
A. Guez
Simon Schmitt
Laurent Sifre
T. Weber
David Silver
H. V. Hasselt
16
66
0
13 Apr 2021
Learning to Reweight Imaginary Transitions for Model-Based Reinforcement
  Learning
Learning to Reweight Imaginary Transitions for Model-Based Reinforcement Learning
Wenzhen Huang
Qiyue Yin
Junge Zhang
Kaiqi Huang
15
4
0
09 Apr 2021
Meta-Adversarial Inverse Reinforcement Learning for Decision-making
  Tasks
Meta-Adversarial Inverse Reinforcement Learning for Decision-making Tasks
Pin Wang
Hanhan Li
Ching-yao Chan
11
9
0
23 Mar 2021
Regularized Softmax Deep Multi-Agent $Q$-Learning
Regularized Softmax Deep Multi-Agent QQQ-Learning
L. Pan
Tabish Rashid
Bei Peng
Longbo Huang
Shimon Whiteson
42
31
0
22 Mar 2021
Reinforcement Learning with Algorithms from Probabilistic Structure
  Estimation
Reinforcement Learning with Algorithms from Probabilistic Structure Estimation
J. Epperlein
R. Overko
Sergiy Zhuk
Christopher K. King
Djallel Bouneffouf
Andrew Cullen
Robert Shorten
OffRL
8
6
0
15 Mar 2021
Credit Assignment with Meta-Policy Gradient for Multi-Agent
  Reinforcement Learning
Credit Assignment with Meta-Policy Gradient for Multi-Agent Reinforcement Learning
Jianzhun Shao
Hongchang Zhang
Yuhang Jiang
Shuncheng He
Xiangyang Ji
29
5
0
24 Feb 2021
Identifying Physical Law of Hamiltonian Systems via Meta-Learning
Identifying Physical Law of Hamiltonian Systems via Meta-Learning
Seungjun Lee
Haesang Yang
W. Seong
30
13
0
23 Feb 2021
Multi-Objective Meta Learning
Multi-Objective Meta Learning
Feiyang Ye
Baijiong Lin
Zhixiong Yue
Pengxin Guo
Qiao Xiao
Yu Zhang
41
47
0
14 Feb 2021
Discovery of Options via Meta-Learned Subgoals
Discovery of Options via Meta-Learned Subgoals
Vivek Veeriah
Tom Zahavy
Matteo Hessel
Zhongwen Xu
Junhyuk Oh
Iurii Kemaev
H. V. Hasselt
David Silver
Satinder Singh
26
33
0
12 Feb 2021
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Florian E. Dorner
25
12
0
09 Feb 2021
Alchemy: A benchmark and analysis toolkit for meta-reinforcement
  learning agents
Alchemy: A benchmark and analysis toolkit for meta-reinforcement learning agents
Jane X. Wang
Michael King
Nicolas Porcel
Z. Kurth-Nelson
Tina Zhu
...
Neil C. Rabinowitz
Loic Matthey
Demis Hassabis
Alexander Lerchner
M. Botvinick
OffRL
23
30
0
04 Feb 2021
Investigating Bi-Level Optimization for Learning and Vision from a
  Unified Perspective: A Survey and Beyond
Investigating Bi-Level Optimization for Learning and Vision from a Unified Perspective: A Survey and Beyond
Risheng Liu
Jiaxin Gao
Jin Zhang
Deyu Meng
Zhouchen Lin
AI4CE
59
222
0
27 Jan 2021
Training Learned Optimizers with Randomly Initialized Learned Optimizers
Training Learned Optimizers with Randomly Initialized Learned Optimizers
Luke Metz
C. Freeman
Niru Maheswaranathan
Jascha Narain Sohl-Dickstein
43
12
0
14 Jan 2021
Towards Continual Reinforcement Learning: A Review and Perspectives
Towards Continual Reinforcement Learning: A Review and Perspectives
Khimya Khetarpal
Matthew D Riemer
Irina Rish
Doina Precup
CLL
OffRL
49
310
0
25 Dec 2020
Personalized Adaptive Meta Learning for Cold-start User Preference
  Prediction
Personalized Adaptive Meta Learning for Cold-start User Preference Prediction
Runsheng Yu
Yu Gong
Xu He
Bo An
Yu Zhu
Qingwen Liu
Wenwu Ou
25
57
0
22 Dec 2020
Distributed Multi-agent Meta Learning for Trajectory Design in Wireless
  Drone Networks
Distributed Multi-agent Meta Learning for Trajectory Design in Wireless Drone Networks
Ye Hu
Mingzhe Chen
Walid Saad
H. Vincent Poor
Shuguang Cui
29
104
0
06 Dec 2020
Latent Skill Planning for Exploration and Transfer
Latent Skill Planning for Exploration and Transfer
Kevin Xie
Homanga Bharadhwaj
Danijar Hafner
Animesh Garg
Florian Shkurti
39
20
0
27 Nov 2020
Adaptable Automation with Modular Deep Reinforcement Learning and Policy
  Transfer
Adaptable Automation with Modular Deep Reinforcement Learning and Policy Transfer
Zohreh Raziei
Mohsen Moghaddam
26
25
0
27 Nov 2020
Meta-learning in natural and artificial intelligence
Meta-learning in natural and artificial intelligence
Jane X. Wang
30
110
0
26 Nov 2020
Information-theoretic Task Selection for Meta-Reinforcement Learning
Information-theoretic Task Selection for Meta-Reinforcement Learning
Ricardo Luna Gutierrez
Matteo Leonetti
24
19
0
02 Nov 2020
Forethought and Hindsight in Credit Assignment
Forethought and Hindsight in Credit Assignment
Veronica Chelu
Doina Precup
H. V. Hasselt
19
25
0
26 Oct 2020
Balancing Constraints and Rewards with Meta-Gradient D4PG
Balancing Constraints and Rewards with Meta-Gradient D4PG
D. A. Calian
D. Mankowitz
Tom Zahavy
Zhongwen Xu
Junhyuk Oh
Nir Levine
Timothy A. Mann
31
25
0
13 Oct 2020
FOCAL: Efficient Fully-Offline Meta-Reinforcement Learning via Distance
  Metric Learning and Behavior Regularization
FOCAL: Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior Regularization
Lanqing Li
Rui Yang
Dijun Luo
OffRL
27
10
0
02 Oct 2020
A Primal-Dual Subgradient Approachfor Fair Meta Learning
A Primal-Dual Subgradient Approachfor Fair Meta Learning
Chenxu Zhao
Feng Chen
Zhuoyi Wang
Latifur Khan
FaML
16
0
0
26 Sep 2020
Meta-AAD: Active Anomaly Detection with Deep Reinforcement Learning
Meta-AAD: Active Anomaly Detection with Deep Reinforcement Learning
Daochen Zha
Kwei-Herng Lai
Mingyang Wan
X. Hu
19
53
0
16 Sep 2020
"It's Unwieldy and It Takes a Lot of Time." Challenges and Opportunities
  for Creating Agents in Commercial Games
"It's Unwieldy and It Takes a Lot of Time." Challenges and Opportunities for Creating Agents in Commercial Games
Mikhail Jacob
Sam Devlin
Katja Hofmann
LLMAG
AI4CE
14
16
0
01 Sep 2020
learn2learn: A Library for Meta-Learning Research
learn2learn: A Library for Meta-Learning Research
Sébastien M. R. Arnold
Praateek Mahajan
Debajyoti Datta
Ian Bunner
Konstantinos Saitas Zarkias
18
95
0
27 Aug 2020
Discovering Reinforcement Learning Algorithms
Discovering Reinforcement Learning Algorithms
Junhyuk Oh
Matteo Hessel
Wojciech M. Czarnecki
Zhongwen Xu
H. V. Hasselt
Satinder Singh
David Silver
21
126
0
17 Jul 2020
Contextualizing Enhances Gradient Based Meta Learning
Contextualizing Enhances Gradient Based Meta Learning
Evan Vogelbaum
Rumen Dangovski
L. Jing
Marin Soljacic
34
3
0
17 Jul 2020
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Zhongwen Xu
H. V. Hasselt
Matteo Hessel
Junhyuk Oh
Satinder Singh
David Silver
16
77
0
16 Jul 2020
Discount Factor as a Regularizer in Reinforcement Learning
Discount Factor as a Regularizer in Reinforcement Learning
Ron Amit
Ron Meir
K. Ciosek
OffRL
22
71
0
04 Jul 2020
Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via
  Metagradient
Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient
Yufei Wang
Tianwei Ni
4
20
0
03 Jul 2020
Policy-GNN: Aggregation Optimization for Graph Neural Networks
Policy-GNN: Aggregation Optimization for Graph Neural Networks
Kwei-Herng Lai
Daochen Zha
Kaixiong Zhou
Xia Hu
21
90
0
26 Jun 2020
META-Learning Eligibility Traces for More Sample Efficient Temporal
  Difference Learning
META-Learning Eligibility Traces for More Sample Efficient Temporal Difference Learning
Mingde Zhao
8
0
0
16 Jun 2020
Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary
  Strategies
Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary Strategies
Yunhao Tang
K. Choromanski
OffRL
9
13
0
13 Jun 2020
Learning to Incentivize Other Learning Agents
Learning to Incentivize Other Learning Agents
Jiachen Yang
Ang Li
Mehrdad Farajtabar
P. Sunehag
Edward Hughes
H. Zha
18
67
0
10 Jun 2020
Dual Policy Distillation
Dual Policy Distillation
Kwei-Herng Lai
Daochen Zha
Yuening Li
Xia Hu
OffRL
13
10
0
07 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
65
225
0
01 Jun 2020
Meta-Reinforcement Learning for Trajectory Design in Wireless UAV
  Networks
Meta-Reinforcement Learning for Trajectory Design in Wireless UAV Networks
Ye Hu
Mingzhe Chen
Walid Saad
H. Vincent Poor
Shuguang Cui
11
25
0
25 May 2020
Previous
12345
Next