ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.07245
  4. Cited By
Meta-Reinforcement Learning of Structured Exploration Strategies

Meta-Reinforcement Learning of Structured Exploration Strategies

20 February 2018
Abhishek Gupta
Russell Mendonca
YuXuan Liu
Pieter Abbeel
Sergey Levine
    OffRL
ArXivPDFHTML

Papers citing "Meta-Reinforcement Learning of Structured Exploration Strategies"

50 / 79 papers shown
Title
Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments
Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments
Yun Qu
Wei Wang
Yixiu Mao
Yiqin Lv
Xiangyang Ji
TTA
93
0
0
27 Apr 2025
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
Ruohong Liu
Yuxin Pan
Linjie Xu
Lei Song
Jiang Bian
Pengcheng You
Yize Chen
45
1
0
03 Oct 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert Nowak
84
2
0
07 Jun 2024
On Task-Relevant Loss Functions in Meta-Reinforcement Learning and
  Online LQR
On Task-Relevant Loss Functions in Meta-Reinforcement Learning and Online LQR
Jaeuk Shin
Giho Kim
Howon Lee
Joonho Han
Insoon Yang
OffRL
41
1
0
09 Dec 2023
Transformers as Decision Makers: Provable In-Context Reinforcement
  Learning via Supervised Pretraining
Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining
Licong Lin
Yu Bai
Song Mei
OffRL
37
45
0
12 Oct 2023
Amortized Network Intervention to Steer the Excitatory Point Processes
Amortized Network Intervention to Steer the Excitatory Point Processes
Zitao Song
Wendi Ren
Sourav Garg
29
1
0
06 Oct 2023
AdaptNet: Policy Adaptation for Physics-Based Character Control
AdaptNet: Policy Adaptation for Physics-Based Character Control
Pei Xu
Kaixiang Xie
Sheldon Andrews
P. Kry
Michael Neff
Morgan McGuire
Ioannis Karamouzas
Victor Zordan
TTA
42
17
0
30 Sep 2023
Is Meta-Learning the Right Approach for the Cold-Start Problem in
  Recommender Systems?
Is Meta-Learning the Right Approach for the Cold-Start Problem in Recommender Systems?
Davide Buffelli
Ashish Gupta
Agnieszka Strzalka
Vassilis Plachouras
OffRL
LRM
36
1
0
16 Aug 2023
SPOT: Sequential Predictive Modeling of Clinical Trial Outcome with
  Meta-Learning
SPOT: Sequential Predictive Modeling of Clinical Trial Outcome with Meta-Learning
Zifeng Wang
Cao Xiao
Jimeng Sun
23
17
0
07 Apr 2023
Meta-Reinforcement Learning via Exploratory Task Clustering
Meta-Reinforcement Learning via Exploratory Task Clustering
Zhendong Chu
Hongning Wang
OffRL
33
5
0
15 Feb 2023
Train Hard, Fight Easy: Robust Meta Reinforcement Learning
Train Hard, Fight Easy: Robust Meta Reinforcement Learning
Ido Greenberg
Shie Mannor
Gal Chechik
E. Meirom
OffRL
OOD
21
6
0
26 Jan 2023
A Survey of Meta-Reinforcement Learning
A Survey of Meta-Reinforcement Learning
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
OOD
OffRL
37
124
0
19 Jan 2023
Concept Discovery for Fast Adapatation
Concept Discovery for Fast Adapatation
Shengyu Feng
Yangqiu Song
OffRL
33
0
0
19 Jan 2023
Is Conditional Generative Modeling all you need for Decision-Making?
Is Conditional Generative Modeling all you need for Decision-Making?
Anurag Ajay
Yilun Du
Abhi Gupta
J. Tenenbaum
Tommi Jaakkola
Pulkit Agrawal
DiffM
66
365
0
28 Nov 2022
Giving Feedback on Interactive Student Programs with Meta-Exploration
Giving Feedback on Interactive Student Programs with Meta-Exploration
E. Liu
Moritz Stephan
Allen Nie
Chris Piech
Emma Brunskill
Chelsea Finn
AI4Ed
32
8
0
16 Nov 2022
Knowing the Past to Predict the Future: Reinforcement Virtual Learning
Knowing the Past to Predict the Future: Reinforcement Virtual Learning
Peng Zhang
Yawen Huang
Bingzhang Hu
Shizheng Wang
Haoran Duan
Noura Al Moubayed
Yefeng Zheng
Yang Long
OffRL
27
0
0
02 Nov 2022
Distributionally Adaptive Meta Reinforcement Learning
Distributionally Adaptive Meta Reinforcement Learning
Anurag Ajay
Abhishek Gupta
Dibya Ghosh
Sergey Levine
Pulkit Agrawal
OOD
29
14
0
06 Oct 2022
Meta Reinforcement Learning for Optimal Design of Legged Robots
Meta Reinforcement Learning for Optimal Design of Legged Robots
Álvaro Belmonte-Baeza
Joonho Lee
Giorgio Valsecchi
Marco Hutter
50
17
0
06 Oct 2022
Enhanced Meta Reinforcement Learning using Demonstrations in Sparse
  Reward Environments
Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments
Desik Rengarajan
Sapana Chaudhary
JaeWon Kim
D. Kalathil
S. Shakkottai
OffRL
29
2
0
26 Sep 2022
Meta Reinforcement Learning with Successor Feature Based Context
Meta Reinforcement Learning with Successor Feature Based Context
Xu Han
Feng Wu
OffRL
LRM
40
3
0
29 Jul 2022
Provable Generalization of Overparameterized Meta-learning Trained with
  SGD
Provable Generalization of Overparameterized Meta-learning Trained with SGD
Yu Huang
Yingbin Liang
Longbo Huang
MLT
30
8
0
18 Jun 2022
Transformers are Meta-Reinforcement Learners
Transformers are Meta-Reinforcement Learners
Luckeciano C. Melo
OffRL
41
50
0
14 Jun 2022
Fast Inference and Transfer of Compositional Task Structures for
  Few-shot Task Generalization
Fast Inference and Transfer of Compositional Task Structures for Few-shot Task Generalization
Sungryull Sohn
Hyunjae Woo
Jongwook Choi
lyubing qiang
Izzeddin Gur
Aleksandra Faust
Honglak Lee
BDL
OffRL
35
3
0
25 May 2022
Few-Shot Forecasting of Time-Series with Heterogeneous Channels
Few-Shot Forecasting of Time-Series with Heterogeneous Channels
L. Brinkmeyer
Rafael Rêgo Drumond
Johannes Burchert
Lars Schmidt-Thieme
AI4TS
28
7
0
07 Apr 2022
Learning Robust Real-Time Cultural Transmission without Human Data
Learning Robust Real-Time Cultural Transmission without Human Data
Cultural General Intelligence Team
Avishkar Bhoopchand
Bethanie Brownfield
Adrian Collister
Agustin Dal Lago
...
Alex Platonov
Evan Senter
Sukhdeep Singh
Alexander Zacherl
Lei M. Zhang
VLM
46
11
0
01 Mar 2022
System-Agnostic Meta-Learning for MDP-based Dynamic Scheduling via
  Descriptive Policy
System-Agnostic Meta-Learning for MDP-based Dynamic Scheduling via Descriptive Policy
Hyunsung Lee
19
1
0
18 Jan 2022
CoMPS: Continual Meta Policy Search
CoMPS: Continual Meta Policy Search
Glen Berseth
Zhiwei Zhang
Grace Zhang
Chelsea Finn
Sergey Levine
CLL
OffRL
28
16
0
08 Dec 2021
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Charles Packer
Pieter Abbeel
Joseph E. Gonzalez
OffRL
26
18
0
02 Dec 2021
Robust Dynamic Bus Control: A Distributional Multi-agent Reinforcement
  Learning Approach
Robust Dynamic Bus Control: A Distributional Multi-agent Reinforcement Learning Approach
Changyin Sun
Lijun Sun
22
6
0
02 Nov 2021
Context Meta-Reinforcement Learning via Neuromodulation
Context Meta-Reinforcement Learning via Neuromodulation
Eseoghene Ben-Iwhiwhu
Jeffery Dick
Nicholas A. Ketz
Praveen K. Pilly
Andrea Soltoggio
OffRL
45
12
0
30 Oct 2021
GalilAI: Out-of-Task Distribution Detection using Causal Active
  Experimentation for Safe Transfer RL
GalilAI: Out-of-Task Distribution Detection using Causal Active Experimentation for Safe Transfer RL
Sumedh Anand Sontakke
Stephen Iota
Zizhao Hu
Arash Mehrjou
Laurent Itti
Bernhard Schölkopf
OODD
14
2
0
29 Oct 2021
REIN-2: Giving Birth to Prepared Reinforcement Learning Agents Using
  Reinforcement Learning Agents
REIN-2: Giving Birth to Prepared Reinforcement Learning Agents Using Reinforcement Learning Agents
A. Lazaridis
I. Vlahavas
OffRL
31
2
0
11 Oct 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative
  Survey
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey
Amjad Yousef Majid
Serge Saaybi
Tomas van Rietbergen
Vincent François-Lavet
R. V. Prasad
Chris Verhoeven
OffRL
62
55
0
28 Sep 2021
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Michael Wan
Jian-wei Peng
Tanmay Gangwani
34
7
0
18 Sep 2021
A Survey of Exploration Methods in Reinforcement Learning
A Survey of Exploration Methods in Reinforcement Learning
Susan Amin
Maziar Gomrokchi
Harsh Satija
H. V. Hoof
Doina Precup
OffRL
32
80
0
01 Sep 2021
Improved Robustness and Safety for Pre-Adaptation of Meta Reinforcement
  Learning with Prior Regularization
Improved Robustness and Safety for Pre-Adaptation of Meta Reinforcement Learning with Prior Regularization
Lu Wen
Songan Zhang
H. E. Tseng
Baljeet Singh
Dimitar Filev
H. Peng
OffRL
OnRL
27
1
0
19 Aug 2021
Deep Reinforcement Learning for Demand Driven Services in Logistics and
  Transportation Systems: A Survey
Deep Reinforcement Learning for Demand Driven Services in Logistics and Transportation Systems: A Survey
Zefang Zong
Tao Feng
Tong Xia
Depeng Jin
Yong Li
22
3
0
10 Aug 2021
Offline Meta-Reinforcement Learning with Online Self-Supervision
Offline Meta-Reinforcement Learning with Online Self-Supervision
Vitchyr H. Pong
Ashvin Nair
Laura M. Smith
Catherine Huang
Sergey Levine
OffRL
32
66
0
08 Jul 2021
Meta-Adaptive Nonlinear Control: Theory and Algorithms
Meta-Adaptive Nonlinear Control: Theory and Algorithms
Guanya Shi
Kamyar Azizzadenesheli
Michael O'Connell
Soon-Jo Chung
Yisong Yue
29
41
0
11 Jun 2021
MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale
MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale
Dmitry Kalashnikov
Jacob Varley
Yevgen Chebotar
Benjamin Swanson
Rico Jonschkowski
Chelsea Finn
Sergey Levine
Karol Hausman
OffRL
47
271
0
16 Apr 2021
Bayesian Meta-Learning for Few-Shot Policy Adaptation Across Robotic
  Platforms
Bayesian Meta-Learning for Few-Shot Policy Adaptation Across Robotic Platforms
Ali Ghadirzadeh
Xi Chen
Petra Poklukar
Chelsea Finn
Mårten Björkman
Danica Kragic
BDL
39
41
0
05 Mar 2021
Meta-Learning Dynamics Forecasting Using Task Inference
Meta-Learning Dynamics Forecasting Using Task Inference
Rui Wang
Robin Walters
Rose Yu
OOD
AI4TS
AI4CE
29
32
0
20 Feb 2021
MetaVIM: Meta Variationally Intrinsic Motivated Reinforcement Learning
  for Decentralized Traffic Signal Control
MetaVIM: Meta Variationally Intrinsic Motivated Reinforcement Learning for Decentralized Traffic Signal Control
Liwen Zhu
Peixi Peng
Zongqing Lu
Xiangqian Wang
Yonghong Tian
18
20
0
04 Jan 2021
Adaptable Automation with Modular Deep Reinforcement Learning and Policy
  Transfer
Adaptable Automation with Modular Deep Reinforcement Learning and Policy Transfer
Zohreh Raziei
Mohsen Moghaddam
26
25
0
27 Nov 2020
Distilling a Hierarchical Policy for Planning and Control via
  Representation and Reinforcement Learning
Distilling a Hierarchical Policy for Planning and Control via Representation and Reinforcement Learning
Jung-Su Ha
Young-Jin Park
Hyeok-Joo Chae
Soon-Seo Park
Han-Lim Choi
35
3
0
16 Nov 2020
One Solution is Not All You Need: Few-Shot Extrapolation via Structured
  MaxEnt RL
One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL
Saurabh Kumar
Aviral Kumar
Sergey Levine
Chelsea Finn
OffRL
16
90
0
27 Oct 2020
MELD: Meta-Reinforcement Learning from Images via Latent State Models
MELD: Meta-Reinforcement Learning from Images via Latent State Models
Tony Zhao
Anusha Nagabandi
Kate Rakelly
Chelsea Finn
Sergey Levine
OffRL
32
36
0
26 Oct 2020
FORK: A Forward-Looking Actor For Model-Free Reinforcement Learning
FORK: A Forward-Looking Actor For Model-Free Reinforcement Learning
Honghao Wei
Lei Ying
18
7
0
04 Oct 2020
Towards Effective Context for Meta-Reinforcement Learning: an Approach
  based on Contrastive Learning
Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning
Haotian Fu
Hongyao Tang
Jianye Hao
Cen Chen
Xidong Feng
Dong Li
Wulong Liu
OffRL
35
51
0
29 Sep 2020
Offline Meta-Reinforcement Learning with Advantage Weighting
Offline Meta-Reinforcement Learning with Advantage Weighting
E. Mitchell
Rafael Rafailov
Xue Bin Peng
Sergey Levine
Chelsea Finn
OffRL
38
104
0
13 Aug 2020
12
Next