ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.05334
  4. Cited By
Online Meta-Critic Learning for Off-Policy Actor-Critic Methods

Online Meta-Critic Learning for Off-Policy Actor-Critic Methods

11 March 2020
Wei Zhou
Yiying Li
Yongxin Yang
Huaimin Wang
Timothy M. Hospedales
    OffRL
ArXivPDFHTML

Papers citing "Online Meta-Critic Learning for Off-Policy Actor-Critic Methods"

24 / 24 papers shown
Title
Eau De QQQ-Network: Adaptive Distillation of Neural Networks in Deep Reinforcement Learning
Théo Vincent
Tim Lukas Faust
Yogesh Tripathi
Jan Peters
Carlo DÉramo
37
0
0
03 Mar 2025
Black box meta-learning intrinsic rewards for sparse-reward environments
Black box meta-learning intrinsic rewards for sparse-reward environments
Octavio Pappalardo
Rodrigo Ramele
Juan Miguel Santos
OffRL
38
0
0
31 Jul 2024
Multi-agent Off-policy Actor-Critic Reinforcement Learning for Partially
  Observable Environments
Multi-agent Off-policy Actor-Critic Reinforcement Learning for Partially Observable Environments
Ainur Zhaikhan
Ali H. Sayed
OffRL
31
0
0
06 Jul 2024
A Survey of Meta-Reinforcement Learning
A Survey of Meta-Reinforcement Learning
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
OOD
OffRL
37
122
0
19 Jan 2023
Building a Subspace of Policies for Scalable Continual Learning
Building a Subspace of Policies for Scalable Continual Learning
Jean-Baptiste Gaya
T. Doan
Lucas Page-Caccia
Laure Soulier
Ludovic Denoyer
Roberta Raileanu
CLL
29
29
0
18 Nov 2022
MetaEMS: A Meta Reinforcement Learning-based Control Framework for
  Building Energy Management System
MetaEMS: A Meta Reinforcement Learning-based Control Framework for Building Energy Management System
Huiliang Zhang
Di Wu
Benoit Boulet
21
6
0
23 Oct 2022
Online Bilevel Optimization: Regret Analysis of Online Alternating
  Gradient Methods
Online Bilevel Optimization: Regret Analysis of Online Alternating Gradient Methods
Davoud Ataee Tarzanagh
Parvin Nazari
Bojian Hou
Li Shen
Laura Balzano
44
10
0
06 Jul 2022
Beyond backpropagation: bilevel optimization through implicit
  differentiation and equilibrium propagation
Beyond backpropagation: bilevel optimization through implicit differentiation and equilibrium propagation
Nicolas Zucchet
João Sacramento
28
15
0
06 May 2022
Model Based Meta Learning of Critics for Policy Gradients
Model Based Meta Learning of Critics for Policy Gradients
Sarah Bechtle
Ludovic Righetti
Franziska Meier
OffRL
14
0
0
05 Apr 2022
MetaCon: Unified Predictive Segments System with Trillion Concept
  Meta-Learning
MetaCon: Unified Predictive Segments System with Trillion Concept Meta-Learning
Keqian Li
Yifan Hu
Logan Palanisamy
Lisa Jones
A. Gupta
J. E. Grigsby
Ili Selinger
Matt Gillingham
Fei Tan
21
1
0
09 Mar 2022
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement
  Learning
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning
Xidong Feng
Bo Liu
Jie Ren
Luo Mai
Rui Zhu
Haifeng Zhang
Jun Wang
Yaodong Yang
14
12
0
31 Dec 2021
Automatic tuning of hyper-parameters of reinforcement learning
  algorithms using Bayesian optimization with behavioral cloning
Automatic tuning of hyper-parameters of reinforcement learning algorithms using Bayesian optimization with behavioral cloning
Juan Cruz Barsce
J. Palombarini
Ernesto C. Martínez
OffRL
14
1
0
15 Dec 2021
Sharing to learn and learning to share; Fitting together Meta-Learning,
  Multi-Task Learning, and Transfer Learning: A meta review
Sharing to learn and learning to share; Fitting together Meta-Learning, Multi-Task Learning, and Transfer Learning: A meta review
Richa Upadhyay
Ronald Phlypo
Rajkumar Saini
Marcus Liwicki
CLL
21
5
0
23 Nov 2021
One Step at a Time: Pros and Cons of Multi-Step Meta-Gradient
  Reinforcement Learning
One Step at a Time: Pros and Cons of Multi-Step Meta-Gradient Reinforcement Learning
Clément Bonnet
Paul Caron
Thomas D. Barrett
Ian Davies
Alexandre Laterre
12
6
0
30 Oct 2021
Improved Robustness and Safety for Pre-Adaptation of Meta Reinforcement
  Learning with Prior Regularization
Improved Robustness and Safety for Pre-Adaptation of Meta Reinforcement Learning with Prior Regularization
Lu Wen
Songan Zhang
H. E. Tseng
Baljeet Singh
Dimitar Filev
H. Peng
OffRL
OnRL
19
1
0
19 Aug 2021
Neural Auto-Curricula
Neural Auto-Curricula
Xidong Feng
Oliver Slumbers
Ziyu Wan
Bo Liu
Stephen Marcus McAleer
Ying Wen
Jun Wang
Yaodong Yang
18
1
0
04 Jun 2021
Credit Assignment with Meta-Policy Gradient for Multi-Agent
  Reinforcement Learning
Credit Assignment with Meta-Policy Gradient for Multi-Agent Reinforcement Learning
Jianzhun Shao
Hongchang Zhang
Yuhang Jiang
Shuncheng He
Xiangyang Ji
29
5
0
24 Feb 2021
Towards Continual Reinforcement Learning: A Review and Perspectives
Towards Continual Reinforcement Learning: A Review and Perspectives
Khimya Khetarpal
Matthew D Riemer
Irina Rish
Doina Precup
CLL
OffRL
49
307
0
25 Dec 2020
Performance-Weighed Policy Sampling for Meta-Reinforcement Learning
Performance-Weighed Policy Sampling for Meta-Reinforcement Learning
Ibrahim Ahmed
Marcos Quiñones-Grueiro
G. Biswas
18
2
0
10 Dec 2020
Effective Regularization Through Loss-Function Metalearning
Effective Regularization Through Loss-Function Metalearning
Santiago Gonzalez
Risto Miikkulainen
24
5
0
02 Oct 2020
Discovering Reinforcement Learning Algorithms
Discovering Reinforcement Learning Algorithms
Junhyuk Oh
Matteo Hessel
Wojciech M. Czarnecki
Zhongwen Xu
H. V. Hasselt
Satinder Singh
David Silver
21
126
0
17 Jul 2020
Meta-Learning in Neural Networks: A Survey
Meta-Learning in Neural Networks: A Survey
Timothy M. Hospedales
Antreas Antoniou
P. Micaelli
Amos Storkey
OOD
43
1,928
0
11 Apr 2020
Bilevel Programming for Hyperparameter Optimization and Meta-Learning
Bilevel Programming for Hyperparameter Optimization and Meta-Learning
Luca Franceschi
P. Frasconi
Saverio Salzo
Riccardo Grazzi
Massimiliano Pontil
110
716
0
13 Jun 2018
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
329
11,684
0
09 Mar 2017
1