ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.08433
  4. Cited By
Meta-Gradient Reinforcement Learning with an Objective Discovered Online

Meta-Gradient Reinforcement Learning with an Objective Discovered Online

16 July 2020
Zhongwen Xu
H. V. Hasselt
Matteo Hessel
Junhyuk Oh
Satinder Singh
David Silver
ArXivPDFHTML

Papers citing "Meta-Gradient Reinforcement Learning with an Objective Discovered Online"

20 / 20 papers shown
Title
Discovering Quality-Diversity Algorithms via Meta-Black-Box Optimization
Discovering Quality-Diversity Algorithms via Meta-Black-Box Optimization
Maxence Faldor
Robert Tjarko Lange
Antoine Cully
81
0
0
04 Feb 2025
Dynamic Learning Rate for Deep Reinforcement Learning: A Bandit Approach
Dynamic Learning Rate for Deep Reinforcement Learning: A Bandit Approach
Henrique Donâncio
Antoine Barrier
Leah F. South
Florence Forbes
28
0
0
16 Oct 2024
Learning to Optimize for Reinforcement Learning
Learning to Optimize for Reinforcement Learning
Qingfeng Lan
Rupam Mahmood
Shuicheng Yan
Zhongwen Xu
OffRL
26
6
0
03 Feb 2023
A Survey of Meta-Reinforcement Learning
A Survey of Meta-Reinforcement Learning
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
OOD
OffRL
37
122
0
19 Jan 2023
Hypernetworks for Zero-shot Transfer in Reinforcement Learning
Hypernetworks for Zero-shot Transfer in Reinforcement Learning
S. Rezaei-Shoshtari
Charlotte Morissette
F. Hogan
Gregory Dudek
D. Meger
OffRL
17
14
0
28 Nov 2022
Discovering Evolution Strategies via Meta-Black-Box Optimization
Discovering Evolution Strategies via Meta-Black-Box Optimization
R. T. Lange
Tom Schaul
Yutian Chen
Tom Zahavy
Valenti Dallibard
Chris Xiaoxuan Lu
Satinder Singh
Sebastian Flennerhag
44
47
0
21 Nov 2022
Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer
  Value Function
Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function
Clément Bonnet
Laurence Midgley
Alexandre Laterre
26
1
0
19 Nov 2022
Auxiliary task discovery through generate-and-test
Auxiliary task discovery through generate-and-test
Banafsheh Rafiee
Sina Ghiassian
Jun Jin
R. Sutton
Jun Luo
Adam White
21
0
0
25 Oct 2022
Meta-Gradients in Non-Stationary Environments
Meta-Gradients in Non-Stationary Environments
Jelena Luketina
Sebastian Flennerhag
Yannick Schroecker
David Abel
Tom Zahavy
Satinder Singh
31
10
0
13 Sep 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
33
100
0
11 Jan 2022
Adaptive Incentive Design with Multi-Agent Meta-Gradient Reinforcement
  Learning
Adaptive Incentive Design with Multi-Agent Meta-Gradient Reinforcement Learning
Jiachen Yang
Ethan Wang
Rakshit S. Trivedi
T. Zhao
H. Zha
30
21
0
20 Dec 2021
Automated Deep Learning: Neural Architecture Search Is Not the End
Automated Deep Learning: Neural Architecture Search Is Not the End
Xuanyi Dong
D. Kedziora
Katarzyna Musial
Bogdan Gabrys
25
26
0
16 Dec 2021
B-Pref: Benchmarking Preference-Based Reinforcement Learning
B-Pref: Benchmarking Preference-Based Reinforcement Learning
Kimin Lee
Laura M. Smith
Anca Dragan
Pieter Abbeel
OffRL
40
93
0
04 Nov 2021
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement
  Learning
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Edoardo Cetin
Oya Celiktutan
OffRL
42
16
0
07 Oct 2021
Introducing Symmetries to Black Box Meta Reinforcement Learning
Introducing Symmetries to Black Box Meta Reinforcement Learning
Louis Kirsch
Sebastian Flennerhag
Hado van Hasselt
A. Friesen
Junhyuk Oh
Yutian Chen
22
30
0
22 Sep 2021
Offline Meta-Reinforcement Learning with Online Self-Supervision
Offline Meta-Reinforcement Learning with Online Self-Supervision
Vitchyr H. Pong
Ashvin Nair
Laura M. Smith
Catherine Huang
Sergey Levine
OffRL
32
66
0
08 Jul 2021
Training Learned Optimizers with Randomly Initialized Learned Optimizers
Training Learned Optimizers with Randomly Initialized Learned Optimizers
Luke Metz
C. Freeman
Niru Maheswaranathan
Jascha Narain Sohl-Dickstein
43
12
0
14 Jan 2021
Forethought and Hindsight in Credit Assignment
Forethought and Hindsight in Credit Assignment
Veronica Chelu
Doina Precup
H. V. Hasselt
19
25
0
26 Oct 2020
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
362
11,700
0
09 Mar 2017
Off-Policy Actor-Critic
Off-Policy Actor-Critic
T. Degris
Martha White
R. Sutton
OffRL
CML
163
220
0
22 May 2012
1