ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.01118
  4. Cited By
Some Considerations on Learning to Explore via Meta-Reinforcement
  Learning
v1v2 (latest)

Some Considerations on Learning to Explore via Meta-Reinforcement Learning

3 March 2018
Bradly C. Stadie
Ge Yang
Rein Houthooft
Xi Chen
Yan Duan
Yuhuai Wu
Pieter Abbeel
Ilya Sutskever
    LRM
ArXiv (abs)PDFHTML

Papers citing "Some Considerations on Learning to Explore via Meta-Reinforcement Learning"

50 / 75 papers shown
An Information-Theoretic Analysis of Out-of-Distribution Generalization in Meta-Learning with Applications to Meta-RL
An Information-Theoretic Analysis of Out-of-Distribution Generalization in Meta-Learning with Applications to Meta-RL
Xingtu Liu
OOD
173
0
0
27 Oct 2025
Adaptive Policy Backbone via Shared Network
Adaptive Policy Backbone via Shared Network
Bumgeun Park
Donghwan Lee
OffRLOnRL
184
0
0
26 Sep 2025
Combining Bayesian Inference and Reinforcement Learning for Agent Decision Making: A Review
Combining Bayesian Inference and Reinforcement Learning for Agent Decision Making: A Review
Chengmin Zhou
Ville Kyrki
Pasi Fränti
Laura Ruotsalainen
BDLAI4CE
421
0
0
12 May 2025
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning
Yuxiao Qu
Matthew Y. R. Yang
Amrith Rajagopal Setlur
Lewis Tunstall
E. Beeching
Ruslan Salakhutdinov
Aviral Kumar
OffRL
410
84
0
10 Mar 2025
AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with TransformersNeural Information Processing Systems (NeurIPS), 2024
Jake Grigsby
Justin Sasek
Samyak Parajuli
Daniel Adebi
Amy Zhang
Yuke Zhu
OffRL
291
13
0
17 Nov 2024
Meta-Reinforcement Learning with Universal Policy Adaptation: Provable
  Near-Optimality under All-task Optimum Comparator
Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum ComparatorNeural Information Processing Systems (NeurIPS), 2024
Siyuan Xu
Minghui Zhu
OffRL
220
5
0
13 Oct 2024
Black box meta-learning intrinsic rewards for sparse-reward environments
Black box meta-learning intrinsic rewards for sparse-reward environments
Octavio Pappalardo
Rodrigo Ramele
Juan Miguel Santos
OffRL
282
1
0
31 Jul 2024
Memory Sequence Length of Data Sampling Impacts the Adaptation of
  Meta-Reinforcement Learning Agents
Memory Sequence Length of Data Sampling Impacts the Adaptation of Meta-Reinforcement Learning Agents
Menglong Zhang
Fuyuan Qian
Quanying Liu
252
1
0
18 Jun 2024
In-context Exploration-Exploitation for Reinforcement Learning
In-context Exploration-Exploitation for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2024
Zhenwen Dai
Federico Tomasi
Sina Ghiassian
OffRLOnRL
214
12
0
11 Mar 2024
XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
Alexander Nikulin
Vladislav Kurenkov
Ilya Zisman
Artem Agarkov
Viacheslav Sinii
Sergey Kolesnikov
440
46
0
19 Dec 2023
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
Jake Grigsby
Linxi Fan
Yuke Zhu
OffRLLM&Ro
347
44
0
15 Oct 2023
First-Explore, then Exploit: Meta-Learning Intelligent Exploration
First-Explore, then Exploit: Meta-Learning Intelligent ExplorationNeural Information Processing Systems (NeurIPS), 2023
Ben Norman
Jeff Clune
197
3
0
05 Jul 2023
RL$^3$: Boosting Meta Reinforcement Learning via RL inside RL$^2$
RL3^33: Boosting Meta Reinforcement Learning via RL inside RL2^22
Abhinav Bhatia
Samer B. Nashed
S. Zilberstein
OffRL
366
0
0
28 Jun 2023
Meta Generative Flow Networks with Personalization for Task-Specific
  Adaptation
Meta Generative Flow Networks with Personalization for Task-Specific AdaptationInformation Sciences (Inf. Sci.), 2023
Xinyuan Ji
Xu Zhang
Wei Xi
Haozhi Wang
Olga Gadyatskaya
Yinchuan Li
178
1
0
16 Jun 2023
Meta-Reinforcement Learning Based on Self-Supervised Task Representation
  Learning
Meta-Reinforcement Learning Based on Self-Supervised Task Representation LearningAAAI Conference on Artificial Intelligence (AAAI), 2023
Mingyang Wang
Zhenshan Bing
Xiangtong Yao
Shuai Wang
Hang Su
Chenguang Yang
Kai Huang
Alois C. Knoll
SSLOOD
259
19
0
29 Apr 2023
Meta-Reinforcement Learning via Exploratory Task Clustering
Meta-Reinforcement Learning via Exploratory Task ClusteringAAAI Conference on Artificial Intelligence (AAAI), 2023
Zhendong Chu
Hongning Wang
OffRL
183
9
0
15 Feb 2023
Human-Timescale Adaptation in an Open-Ended Task Space
Human-Timescale Adaptation in an Open-Ended Task SpaceInternational Conference on Machine Learning (ICML), 2023
Adaptive Agent Team
Jakob Bauer
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
...
Jakub Sygnowski
K. Tuyls
Sarah York
Alexander Zacherl
Lei Zhang
LM&RoOffRLAI4CELRM
326
147
0
18 Jan 2023
Reusable Options through Gradient-based Meta Learning
Reusable Options through Gradient-based Meta Learning
David Kuric
H. V. Hoof
258
0
0
22 Dec 2022
Enhanced Meta Reinforcement Learning using Demonstrations in Sparse
  Reward Environments
Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments
Desik Rengarajan
Sapana Chaudhary
JaeWon Kim
D. Kalathil
S. Shakkottai
OffRL
191
2
0
26 Sep 2022
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients
Risto Vuorio
Jacob Beck
Shimon Whiteson
Jakob N. Foerster
Gregory Farquhar
206
7
0
22 Sep 2022
On the Convergence Theory of Meta Reinforcement Learning with
  Personalized Policies
On the Convergence Theory of Meta Reinforcement Learning with Personalized Policies
Haozhi Wang
Qing Wang
Yunfeng Shao
Dong Li
Jianye Hao
Yinchuan Li
135
0
0
21 Sep 2022
Meta Reinforcement Learning with Successor Feature Based Context
Meta Reinforcement Learning with Successor Feature Based Context
Xu Han
Feng Wu
OffRLLRM
179
3
0
29 Jul 2022
Learning Action Translator for Meta Reinforcement Learning on
  Sparse-Reward Tasks
Learning Action Translator for Meta Reinforcement Learning on Sparse-Reward TasksAAAI Conference on Artificial Intelligence (AAAI), 2022
Yijie Guo
Qiucheng Wu
Honglak Lee
OffRL
256
8
0
19 Jul 2022
Robust Task Representations for Offline Meta-Reinforcement Learning via
  Contrastive Learning
Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive LearningInternational Conference on Machine Learning (ICML), 2022
Haoqi Yuan
Zongqing Lu
SSLOffRL
219
48
0
21 Jun 2022
Variational Meta Reinforcement Learning for Social Robotics
Variational Meta Reinforcement Learning for Social Robotics
Anand Ballou
Xavier Alameda-Pineda
Chris Reinke
OffRL
289
15
0
07 Jun 2022
CoMPS: Continual Meta Policy Search
CoMPS: Continual Meta Policy Search
Glen Berseth
Zhiwei Zhang
Grace Zhang
Chelsea Finn
Sergey Levine
CLLOffRL
210
17
0
08 Dec 2021
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Charles Packer
Pieter Abbeel
Joseph E. Gonzalez
OffRL
180
22
0
02 Dec 2021
On the Practical Consistency of Meta-Reinforcement Learning Algorithms
On the Practical Consistency of Meta-Reinforcement Learning Algorithms
Zheng Xiong
L. Zintgraf
Jacob Beck
Risto Vuorio
Shimon Whiteson
CLLAIFin
175
11
0
01 Dec 2021
Context Meta-Reinforcement Learning via Neuromodulation
Context Meta-Reinforcement Learning via NeuromodulationNeural Networks (NN), 2021
Eseoghene Ben-Iwhiwhu
Jeffery Dick
Nicholas A. Ketz
Praveen K. Pilly
Andrea Soltoggio
OffRL
497
14
0
30 Oct 2021
Behaviour-conditioned policies for cooperative reinforcement learning
  tasks
Behaviour-conditioned policies for cooperative reinforcement learning tasks
Antti Keurulainen
Isak Westerlund
Ariel Kwiatkowski
Samuel Kaski
Alexander Ilin
101
0
0
04 Oct 2021
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Michael Wan
Jian-wei Peng
Tanmay Gangwani
193
7
0
18 Sep 2021
Bootstrapped Meta-Learning
Bootstrapped Meta-LearningInternational Conference on Learning Representations (ICLR), 2021
Sebastian Flennerhag
Yannick Schroecker
Tom Zahavy
Hado van Hasselt
David Silver
Satinder Singh
170
60
0
09 Sep 2021
A Survey of Exploration Methods in Reinforcement Learning
A Survey of Exploration Methods in Reinforcement Learning
Susan Amin
Maziar Gomrokchi
Harsh Satija
H. V. Hoof
Doina Precup
OffRL
296
99
0
01 Sep 2021
Unifying Gradient Estimators for Meta-Reinforcement Learning via
  Off-Policy Evaluation
Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation
Yunhao Tang
Tadashi Kozuno
Mark Rowland
Rémi Munos
Michal Valko
OffRL
225
9
0
24 Jun 2021
Least-Restrictive Multi-Agent Collision Avoidance via Deep Meta
  Reinforcement Learning and Optimal Control
Least-Restrictive Multi-Agent Collision Avoidance via Deep Meta Reinforcement Learning and Optimal ControlInternational Conference on Robot Intelligence Technology and Applications (RITA), 2021
Salar Asayesh
Mo Chen
M. Mehrandezh
Kamal Gupta
110
5
0
02 Jun 2021
Meta-Reinforcement Learning by Tracking Task Non-stationarity
Meta-Reinforcement Learning by Tracking Task Non-stationarityInternational Joint Conference on Artificial Intelligence (IJCAI), 2021
Riccardo Poiani
Andrea Tirinzoni
Marcello Restelli
OffRL
173
11
0
18 May 2021
Improving Context-Based Meta-Reinforcement Learning with Self-Supervised
  Trajectory Contrastive Learning
Improving Context-Based Meta-Reinforcement Learning with Self-Supervised Trajectory Contrastive Learning
Bernie Wang
Si-ting Xu
Kurt Keutzer
Yang Gao
Bichen Wu
SSLOffRL
109
8
0
10 Mar 2021
Alchemy: A benchmark and analysis toolkit for meta-reinforcement
  learning agents
Alchemy: A benchmark and analysis toolkit for meta-reinforcement learning agents
Jane X. Wang
Michael King
Nicolas Porcel
Z. Kurth-Nelson
Tina Zhu
...
Neil C. Rabinowitz
Loic Matthey
Demis Hassabis
Alexander Lerchner
M. Botvinick
OffRL
303
39
0
04 Feb 2021
Towards Continual Reinforcement Learning: A Review and Perspectives
Towards Continual Reinforcement Learning: A Review and PerspectivesJournal of Artificial Intelligence Research (JAIR), 2020
Khimya Khetarpal
Matthew D Riemer
Irina Rish
Doina Precup
CLLOffRL
549
376
0
25 Dec 2020
Exploration in Approximate Hyper-State Space for Meta Reinforcement
  Learning
Exploration in Approximate Hyper-State Space for Meta Reinforcement LearningInternational Conference on Machine Learning (ICML), 2020
L. Zintgraf
Leo Feng
Cong Lu
Maximilian Igl
Kristian Hartikainen
Katja Hofmann
Shimon Whiteson
350
43
0
02 Oct 2020
OCEAN: Online Task Inference for Compositional Tasks with Context
  Adaptation
OCEAN: Online Task Inference for Compositional Tasks with Context Adaptation
Hongyu Ren
Yuke Zhu
J. Leskovec
Anima Anandkumar
Animesh Garg
LRM
107
4
0
17 Aug 2020
Offline Meta Learning of Exploration
Offline Meta Learning of Exploration
Ron Dorfman
Idan Shenfeld
Aviv Tamar
OffRL
234
20
0
06 Aug 2020
Deep Reinforcement Learning amidst Lifelong Non-Stationarity
Deep Reinforcement Learning amidst Lifelong Non-Stationarity
Annie Xie
James Harrison
Chelsea Finn
CLLOffRL
217
69
0
18 Jun 2020
MetaCURE: Meta Reinforcement Learning with Empowerment-Driven
  Exploration
MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration
Jin Zhang
Jianhao Wang
Hao Hu
Tong Chen
Yingfeng Chen
Changjie Fan
Chongjie Zhang
OffRL
196
30
0
15 Jun 2020
A Brief Look at Generalization in Visual Meta-Reinforcement Learning
A Brief Look at Generalization in Visual Meta-Reinforcement Learning
Safa Alver
Doina Precup
OffRL
193
8
0
12 Jun 2020
Meta-Reinforcement Learning Robust to Distributional Shift via Model
  Identification and Experience Relabeling
Meta-Reinforcement Learning Robust to Distributional Shift via Model Identification and Experience Relabeling
Russell Mendonca
Xinyang Geng
Chelsea Finn
Sergey Levine
OODOffRL
269
40
0
12 Jun 2020
Meta-Model-Based Meta-Policy Optimization
Meta-Model-Based Meta-Policy OptimizationAsian Conference on Machine Learning (ACML), 2020
Takuya Hiraoka
Takahisa Imagawa
Voot Tangkaratt
Takayuki Osa
Takashi Onishi
Yoshimasa Tsuruoka
OffRL
407
9
0
04 Jun 2020
A Comprehensive Overview and Survey of Recent Advances in Meta-Learning
A Comprehensive Overview and Survey of Recent Advances in Meta-Learning
Huimin Peng
VLMOffRL
445
40
0
17 Apr 2020
Meta-Learning in Neural Networks: A Survey
Meta-Learning in Neural Networks: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Timothy M. Hospedales
Antreas Antoniou
P. Micaelli
Amos Storkey
OOD
753
2,389
0
11 Apr 2020
Meta-learning curiosity algorithms
Meta-learning curiosity algorithmsInternational Conference on Learning Representations (ICLR), 2020
Ferran Alet
Martin Schneider
Tomas Lozano-Perez
L. Kaelbling
240
67
0
11 Mar 2020
12
Next