ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.07245
  4. Cited By
Meta-Reinforcement Learning of Structured Exploration Strategies

Meta-Reinforcement Learning of Structured Exploration Strategies

20 February 2018
Abhishek Gupta
Russell Mendonca
YuXuan Liu
Pieter Abbeel
Sergey Levine
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Meta-Reinforcement Learning of Structured Exploration Strategies"

50 / 206 papers shown
Concept Discovery for Fast Adapatation
Concept Discovery for Fast AdapatationSDM (SDM), 2023
Shengyu Feng
Hanghang Tong
OffRL
207
0
0
19 Jan 2023
Is Conditional Generative Modeling all you need for Decision-Making?
Is Conditional Generative Modeling all you need for Decision-Making?International Conference on Learning Representations (ICLR), 2022
Anurag Ajay
Yilun Du
Abhi Gupta
J. Tenenbaum
Tommi Jaakkola
Pulkit Agrawal
DiffM
691
527
0
28 Nov 2022
Prototypical context-aware dynamics generalization for high-dimensional
  model-based reinforcement learning
Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning
Junjie Wang
Yao Mu
Dong Li
Qichao Zhang
Dongbin Zhao
Yuzheng Zhuang
Ping Luo
Bin Wang
Jianye Hao
OffRL
143
3
0
23 Nov 2022
Implicit Training of Energy Model for Structure Prediction
Implicit Training of Energy Model for Structure Prediction
Shiv Shankar
Vihari Piratla
200
0
0
21 Nov 2022
Giving Feedback on Interactive Student Programs with Meta-Exploration
Giving Feedback on Interactive Student Programs with Meta-ExplorationNeural Information Processing Systems (NeurIPS), 2022
Emmy Liu
Moritz Stephan
Allen Nie
Chris Piech
Emma Brunskill
Chelsea Finn
AI4Ed
255
7
0
16 Nov 2022
Build generally reusable agent-environment interaction models
Build generally reusable agent-environment interaction models
Jun Jin
Hongming Zhang
Jun Luo
133
0
0
13 Nov 2022
Knowing the Past to Predict the Future: Reinforcement Virtual Learning
Knowing the Past to Predict the Future: Reinforcement Virtual Learning
Peng Zhang
Yawen Huang
Bingzhang Hu
Shizheng Wang
Haoran Duan
Noura Al Moubayed
Yefeng Zheng
Yang Long
OffRL
164
1
0
02 Nov 2022
The Role of Exploration for Task Transfer in Reinforcement Learning
The Role of Exploration for Task Transfer in Reinforcement Learning
Jonathan C. Balloch
Julia Kim
Jessica B. Langebrake Inman
Mark O. Riedl
OffRL
212
3
0
11 Oct 2022
Decomposed Mutual Information Optimization for Generalized Context in
  Meta-Reinforcement Learning
Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Yao Mu
Yuzheng Zhuang
Fei Ni
Sijin Yu
Jianyu Chen
Jianye Hao
Ping Luo
103
2
0
09 Oct 2022
Winner Takes It All: Training Performant RL Populations for
  Combinatorial Optimization
Winner Takes It All: Training Performant RL Populations for Combinatorial OptimizationNeural Information Processing Systems (NeurIPS), 2022
Nathan Grinsztajn
Daniel Furelos-Blanco
Shikha Surana
Matthew Macfarlane
Thomas D. Barrett
262
50
0
07 Oct 2022
Distributionally Adaptive Meta Reinforcement Learning
Distributionally Adaptive Meta Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Anurag Ajay
Abhishek Gupta
Dibya Ghosh
Sergey Levine
Pulkit Agrawal
OOD
194
17
0
06 Oct 2022
Meta Reinforcement Learning for Optimal Design of Legged Robots
Meta Reinforcement Learning for Optimal Design of Legged RobotsIEEE Robotics and Automation Letters (RA-L), 2022
Álvaro Belmonte-Baeza
Joonho Lee
Giorgio Valsecchi
Marco Hutter
180
33
0
06 Oct 2022
Zero-Shot Policy Transfer with Disentangled Task Representation of
  Meta-Reinforcement Learning
Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement LearningIEEE International Conference on Robotics and Automation (ICRA), 2022
Zheng Wu
Yichen Xie
Wenzhao Lian
Changhao Wang
Yanjiang Guo
Jianyu Chen
S. Schaal
Masayoshi Tomizuka
OffRL
256
12
0
01 Oct 2022
Enhanced Meta Reinforcement Learning using Demonstrations in Sparse
  Reward Environments
Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments
Desik Rengarajan
Sapana Chaudhary
JaeWon Kim
D. Kalathil
S. Shakkottai
OffRL
192
2
0
26 Sep 2022
Meta Reinforcement Learning with Successor Feature Based Context
Meta Reinforcement Learning with Successor Feature Based Context
Xu Han
Feng Wu
OffRLLRM
179
4
0
29 Jul 2022
Provable Generalization of Overparameterized Meta-learning Trained with
  SGD
Provable Generalization of Overparameterized Meta-learning Trained with SGDNeural Information Processing Systems (NeurIPS), 2022
Yu Huang
Yingbin Liang
Longbo Huang
MLT
289
13
0
18 Jun 2022
Transformers are Meta-Reinforcement Learners
Transformers are Meta-Reinforcement LearnersInternational Conference on Machine Learning (ICML), 2022
Luckeciano C. Melo
OffRL
216
62
0
14 Jun 2022
Fast Inference and Transfer of Compositional Task Structures for
  Few-shot Task Generalization
Fast Inference and Transfer of Compositional Task Structures for Few-shot Task GeneralizationConference on Uncertainty in Artificial Intelligence (UAI), 2022
Sungryull Sohn
Hyunjae Woo
Jongwook Choi
lyubing qiang
Izzeddin Gur
Aleksandra Faust
Honglak Lee
BDLOffRL
230
3
0
25 May 2022
Skill-based Meta-Reinforcement Learning
Skill-based Meta-Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022
Taewook Nam
Shao-Hua Sun
Karl Pertsch
Sung Ju Hwang
Joseph J. Lim
OffRL
157
53
0
25 Apr 2022
Few-Shot Forecasting of Time-Series with Heterogeneous Channels
Few-Shot Forecasting of Time-Series with Heterogeneous Channels
L. Brinkmeyer
Rafael Rêgo Drumond
Johannes Burchert
Lars Schmidt-Thieme
AI4TS
205
12
0
07 Apr 2022
Model Based Meta Learning of Critics for Policy Gradients
Model Based Meta Learning of Critics for Policy Gradients
Sarah Bechtle
Ludovic Righetti
Franziska Meier
OffRL
95
0
0
05 Apr 2022
The Sandbox Environment for Generalizable Agent Research (SEGAR)
The Sandbox Environment for Generalizable Agent Research (SEGAR)
R. Devon Hjelm
Bogdan Mazoure
Florian Golemo
Felipe Vieira Frujeri
Mihai Jalobeanu
Andrey Kolobov
LLMAGLRM
153
2
0
19 Mar 2022
What Matters For Meta-Learning Vision Regression Tasks?
What Matters For Meta-Learning Vision Regression Tasks?Computer Vision and Pattern Recognition (CVPR), 2022
Ni Gao
Hanna Ziesche
Ngo Anh Vien
Michael Volpp
Gerhard Neumann
VLM
159
29
0
09 Mar 2022
Learning Robust Real-Time Cultural Transmission without Human Data
Learning Robust Real-Time Cultural Transmission without Human Data
Cultural General Intelligence Team
Avishkar Bhoopchand
Bethanie Brownfield
Adrian Collister
Agustin Dal Lago
...
Alex Platonov
Evan Senter
Sukhdeep Singh
Alexander Zacherl
Lei M. Zhang
VLM
278
12
0
01 Mar 2022
Meta-Reinforcement Learning with Self-Modifying Networks
Meta-Reinforcement Learning with Self-Modifying NetworksNeural Information Processing Systems (NeurIPS), 2022
Mathieu Chalvidal
Thomas Serre
Rufin VanRullen
KELM
218
7
0
04 Feb 2022
System-Agnostic Meta-Learning for MDP-based Dynamic Scheduling via
  Descriptive Policy
System-Agnostic Meta-Learning for MDP-based Dynamic Scheduling via Descriptive PolicyInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Hyunsung Lee
211
1
0
18 Jan 2022
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement
  Learning
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2021
Xidong Feng
Bo Liu
Jie Ren
Luo Mai
Rui Zhu
Haifeng Zhang
Jun Wang
Yaodong Yang
279
12
0
31 Dec 2021
Unsupervised Reinforcement Learning in Multiple Environments
Unsupervised Reinforcement Learning in Multiple Environments
Mirco Mutti
Mattia Mancassola
Marcello Restelli
OffRL
155
27
0
16 Dec 2021
CoMPS: Continual Meta Policy Search
CoMPS: Continual Meta Policy Search
Glen Berseth
Zhiwei Zhang
Grace Zhang
Chelsea Finn
Sergey Levine
CLLOffRL
213
17
0
08 Dec 2021
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Charles Packer
Pieter Abbeel
Joseph E. Gonzalez
OffRL
180
22
0
02 Dec 2021
Reinforcement Learning for Few-Shot Text Generation Adaptation
Reinforcement Learning for Few-Shot Text Generation Adaptation
Pengsen Cheng
Jinqiao Dai
Jiamiao Liu
Jiayong Liu
Peng Jia
377
5
0
22 Nov 2021
Robust Dynamic Bus Control: A Distributional Multi-agent Reinforcement
  Learning Approach
Robust Dynamic Bus Control: A Distributional Multi-agent Reinforcement Learning Approach
Changyin Sun
Lijun Sun
127
14
0
02 Nov 2021
One Step at a Time: Pros and Cons of Multi-Step Meta-Gradient
  Reinforcement Learning
One Step at a Time: Pros and Cons of Multi-Step Meta-Gradient Reinforcement Learning
Matthew Macfarlane
Paul Caron
Thomas D. Barrett
Ian Davies
Alexandre Laterre
166
6
0
30 Oct 2021
Context Meta-Reinforcement Learning via Neuromodulation
Context Meta-Reinforcement Learning via NeuromodulationNeural Networks (NN), 2021
Eseoghene Ben-Iwhiwhu
Jeffery Dick
Nicholas A. Ketz
Praveen K. Pilly
Andrea Soltoggio
OffRL
498
14
0
30 Oct 2021
GalilAI: Out-of-Task Distribution Detection using Causal Active
  Experimentation for Safe Transfer RL
GalilAI: Out-of-Task Distribution Detection using Causal Active Experimentation for Safe Transfer RLInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Sumedh Anand Sontakke
Stephen Iota
Zizhao Hu
Arash Mehrjou
Laurent Itti
Bernhard Schölkopf
OODD
181
3
0
29 Oct 2021
Wasserstein Unsupervised Reinforcement Learning
Wasserstein Unsupervised Reinforcement Learning
Shuncheng He
Yuhang Jiang
Hongchang Zhang
Jianzhun Shao
Xiangyang Ji
OffRL
220
28
0
15 Oct 2021
REIN-2: Giving Birth to Prepared Reinforcement Learning Agents Using
  Reinforcement Learning Agents
REIN-2: Giving Birth to Prepared Reinforcement Learning Agents Using Reinforcement Learning Agents
A. Lazaridis
I. Vlahavas
OffRL
143
2
0
11 Oct 2021
Offline Meta-Reinforcement Learning for Industrial Insertion
Offline Meta-Reinforcement Learning for Industrial InsertionIEEE International Conference on Robotics and Automation (ICRA), 2021
Tony Zhao
Jianlan Luo
Oleg O. Sushkov
Rugile Pevceviciute
N. Heess
Jonathan Scholz
S. Schaal
Sergey Levine
OffRLOnRL
288
91
0
08 Oct 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative
  Survey
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey
Amjad Yousef Majid
Serge Saaybi
Tomas van Rietbergen
Vincent François-Lavet
R. V. Prasad
Chris Verhoeven
OffRL
272
86
0
28 Sep 2021
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Michael Wan
Jian-wei Peng
Tanmay Gangwani
193
7
0
18 Sep 2021
Knowledge is reward: Learning optimal exploration by predictive reward
  cashing
Knowledge is reward: Learning optimal exploration by predictive reward cashing
Luca Ambrogioni
91
1
0
17 Sep 2021
A Survey of Exploration Methods in Reinforcement Learning
A Survey of Exploration Methods in Reinforcement Learning
Susan Amin
Maziar Gomrokchi
Harsh Satija
H. V. Hoof
Doina Precup
OffRL
298
99
0
01 Sep 2021
Improved Robustness and Safety for Pre-Adaptation of Meta Reinforcement
  Learning with Prior Regularization
Improved Robustness and Safety for Pre-Adaptation of Meta Reinforcement Learning with Prior Regularization
Lu Wen
Songan Zhang
H. E. Tseng
Baljeet Singh
Dimitar Filev
H. Peng
OffRLOnRL
179
2
0
19 Aug 2021
Deep Reinforcement Learning for Demand Driven Services in Logistics and
  Transportation Systems: A Survey
Deep Reinforcement Learning for Demand Driven Services in Logistics and Transportation Systems: A SurveyACM Transactions on Knowledge Discovery from Data (TKDD), 2021
Zefang Zong
Tao Feng
Tong Xia
Depeng Jin
Yong Li
237
8
0
10 Aug 2021
Meta-Reinforcement Learning in Broad and Non-Parametric Environments
Meta-Reinforcement Learning in Broad and Non-Parametric EnvironmentsIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
Zhenshan Bing
Lukas Knak
F. O. Morin
Kai-Qi Huang
Alois C. Knoll
OffRL
169
24
0
08 Aug 2021
Offline Meta-Reinforcement Learning with Online Self-Supervision
Offline Meta-Reinforcement Learning with Online Self-SupervisionInternational Conference on Machine Learning (ICML), 2021
Vitchyr H. Pong
Ashvin Nair
Laura M. Smith
Catherine Huang
Sergey Levine
OffRL
364
75
0
08 Jul 2021
Meta-Reinforcement Learning for Heuristic Planning
Meta-Reinforcement Learning for Heuristic Planning
Ricardo Luna Gutierrez
Matteo Leonetti
OffRLAIFin
143
4
0
06 Jul 2021
Meta-Adaptive Nonlinear Control: Theory and Algorithms
Meta-Adaptive Nonlinear Control: Theory and AlgorithmsNeural Information Processing Systems (NeurIPS), 2021
Guanya Shi
Kamyar Azizzadenesheli
Michael O'Connell
Soon-Jo Chung
Yisong Yue
348
48
0
11 Jun 2021
Quickest change detection with unknown parameters: Constant complexity
  and near optimality
Quickest change detection with unknown parameters: Constant complexity and near optimality
Firas Jarboui
Vianney Perchet
100
0
0
09 Jun 2021
Improving Generalization in Meta-RL with Imaginary Tasks from Latent
  Dynamics Mixture
Improving Generalization in Meta-RL with Imaginary Tasks from Latent Dynamics MixtureNeural Information Processing Systems (NeurIPS), 2021
Suyoung Lee
Sae-Young Chung
OffRLAI4CE
205
20
0
28 May 2021
Previous
12345
Next