Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1802.07245
Cited By
Meta-Reinforcement Learning of Structured Exploration Strategies
20 February 2018
Abhishek Gupta
Russell Mendonca
YuXuan Liu
Pieter Abbeel
Sergey Levine
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Meta-Reinforcement Learning of Structured Exploration Strategies"
50 / 206 papers shown
An Information-Theoretic Analysis of Out-of-Distribution Generalization in Meta-Learning with Applications to Meta-RL
Xingtu Liu
OOD
170
0
0
27 Oct 2025
Robot Trains Robot: Automatic Real-World Policy Adaptation and Learning for Humanoids
Kaizhe Hu
Haochen Shi
Yao He
Weizhuo Wang
Changliu Liu
Shuran Song
OnRL
277
1
0
17 Aug 2025
Large Model Empowered Embodied AI: A Survey on Decision-Making and Embodied Learning
Wenlong Liang
Rui Zhou
Yang Ma
Bing Zhang
Songlin Li
Yijia Liao
Ping Kuang
LM&Ro
3DV
AI4CE
169
8
0
14 Aug 2025
Self-Adapting Language Models
Adam Zweiger
Jyothish Pari
Han Guo
Ekin Akyürek
Yoon Kim
Pulkit Agrawal
KELM
LRM
604
16
0
12 Jun 2025
e3: Learning to Explore Enables Extrapolation of Test-Time Compute for LLMs
Amrith Rajagopal Setlur
Matthew Y. R. Yang
Charlie Snell
Jeremy Greer
Ian Wu
Virginia Smith
Max Simchowitz
Aviral Kumar
LRM
267
26
0
10 Jun 2025
Unsupervised Meta-Testing with Conditional Neural Processes for Hybrid Meta-Reinforcement Learning
IEEE Robotics and Automation Letters (RA-L), 2024
S. E. Ada
Emre Ugur
BDL
178
4
0
04 Jun 2025
Real-Time Verification of Embodied Reasoning for Generative Skill Acquisition
Bo Yue
Shuqi Guo
Kaiyu Hu
Chujiao Wang
Benyou Wang
Kui Jia
Guiliang Liu
LRM
299
1
0
16 May 2025
Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments
Yun Qu
Wenjie Wang
Yixiu Mao
Yiqin Lv
Xiangyang Ji
TTA
453
8
0
27 Apr 2025
Proxy-Anchor and EVT-Driven Continual Learning Method for Generalized Category Discovery
Alireza Fathalizadeh
Roozbeh Razavi-Far
CLL
207
0
0
11 Apr 2025
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning
Yuxiao Qu
Matthew Y. R. Yang
Amrith Rajagopal Setlur
Lewis Tunstall
E. Beeching
Ruslan Salakhutdinov
Aviral Kumar
OffRL
410
84
0
10 Mar 2025
Learning Policy Committees for Effective Personalization in MDPs with Diverse Tasks
Luise Ge
Michael Lanier
Anindya Sarkar
Bengisu Guresti
Yevgeniy Vorobeychik
Chongjie Zhang
409
2
0
26 Feb 2025
TIMRL: A Novel Meta-Reinforcement Learning Framework for Non-Stationary and Multi-Task Environments
Chenyang Qi
Huiping Li
Panfeng Huang
OffRL
181
0
0
13 Jan 2025
Neuromodulated Meta-Learning
Wenwen Qiang
Huijie Guo
Jingyao Wang
Jiangmeng Li
Changwen Zheng
Hui Xiong
Gang Hua
364
1
0
11 Nov 2024
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
Ruohong Liu
Yuxin Pan
Linjie Xu
Lei Song
Jiang Bian
Pengcheng You
Yize Chen
225
3
0
03 Oct 2024
Lifelong Reinforcement Learning via Neuromodulation
Sebastian Lee
Samuel Liebana Garcia
Claudia Clopath
Will Dabney
160
3
0
15 Aug 2024
Black box meta-learning intrinsic rewards for sparse-reward environments
Octavio Pappalardo
Rodrigo Ramele
Juan Miguel Santos
OffRL
281
1
0
31 Jul 2024
Constrained Meta Agnostic Reinforcement Learning
Karam Daaboul
Florian Kuhm
Tim Joseph
J. Marius Zoellner
244
0
0
20 Jun 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert D. Nowak
624
5
0
07 Jun 2024
Perturbing the Gradient for Alleviating Meta Overfitting
Manas Gogoi
Sambhavi Tiwari
Shekhar Verma
216
1
0
20 May 2024
MESA: Cooperative Meta-Exploration in Multi-Agent Learning through Exploiting State-Action Space Structure
Zhicheng Zhang
Yancheng Liang
Yi Wu
Fei Fang
188
2
0
01 May 2024
Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity
Neural Information Processing Systems (NeurIPS), 2024
Vahid Balazadeh Meresht
Keertana Chidambaram
Viet Nguyen
Fahad Razak
Vasilis Syrgkanis
433
2
0
10 Apr 2024
Deep Reinforcement Learning for Modelling Protein Complexes
International Conference on Learning Representations (ICLR), 2024
Ziqi Gao
Tao Feng
Jiaxuan You
Chenyi Zi
Yan Zhou
Chen Zhang
Jia Li
247
7
0
11 Mar 2024
Large Language Models as Agents in Two-Player Games
Yang Liu
Yang Liu
Hang Li
LLMAG
178
7
0
12 Feb 2024
Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Filippos Christianos
Georgios Papoudakis
Matthieu Zimmer
Thomas Coste
Zhihao Wu
...
Yicheng Luo
Jianye Hao
Youssef Attia El Hili
Haitham Bou-Ammar
Jun Wang
233
27
0
22 Dec 2023
On Task-Relevant Loss Functions in Meta-Reinforcement Learning and Online LQR
Conference on Learning for Dynamics & Control (L4DC), 2023
Jaeuk Shin
Giho Kim
Howon Lee
Joonho Han
Insoon Yang
OffRL
240
1
0
09 Dec 2023
Large Language Models for Robotics: A Survey
Fanlong Zeng
Wensheng Gan
Zezheng Huai
Lichao Sun
Hechang Chen
Yongheng Wang
Ning Liu
Philip S. Yu
LM&Ro
369
201
0
13 Nov 2023
Dream to Adapt: Meta Reinforcement Learning by Latent Context Imagination and MDP Imagination
Lu Wen
Songan Zhang
E. Tseng
Huei Peng
VLM
OffRL
241
8
0
11 Nov 2023
Hypothesis Network Planned Exploration for Rapid Meta-Reinforcement Learning Adaptation
Maxwell J. Jacobson
Rohan Menon
John Zeng
Yexiang Xue
279
0
0
07 Nov 2023
Meta-Learning Strategies through Value Maximization in Neural Networks
Rodrigo Carrasco-Davis
Javier Masís
Andrew M. Saxe
222
3
0
30 Oct 2023
Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining
International Conference on Learning Representations (ICLR), 2023
Licong Lin
Yu Bai
Song Mei
OffRL
329
66
0
12 Oct 2023
Making Scalable Meta Learning Practical
Neural Information Processing Systems (NeurIPS), 2023
Sang Keun Choe
Sanket Vaibhav Mehta
Hwijeen Ahn
Willie Neiswanger
Pengtao Xie
Emma Strubell
Eric Xing
309
20
0
09 Oct 2023
Amortized Network Intervention to Steer the Excitatory Point Processes
Zitao Song
Wendi Ren
Sourav Garg
354
1
0
06 Oct 2023
AdaptNet: Policy Adaptation for Physics-Based Character Control
ACM Transactions on Graphics (TOG), 2023
Pei Xu
Kaixiang Xie
Sheldon Andrews
P. Kry
Michael Neff
Morgan McGuire
Ioannis Karamouzas
Victor Zordan
TTA
442
28
0
30 Sep 2023
In search of dispersed memories: Generative diffusion models are associative memory networks
Entropy (Entropy), 2023
Luca Ambrogioni
DiffM
251
42
0
29 Sep 2023
Diagnosing and exploiting the computational demands of videos games for deep reinforcement learning
L. Govindarajan
Rex G Liu
Drew Linsley
A. Ashok
Max Reuter
M. Frank
Thomas Serre
OffRL
174
0
0
22 Sep 2023
Toward Discretization-Consistent Closure Schemes for Large Eddy Simulation Using Reinforcement Learning
The Physics of Fluids (Phys. Fluids), 2023
Andrea Beck
Marius Kurz
AI4CE
235
24
0
12 Sep 2023
Is Meta-Learning the Right Approach for the Cold-Start Problem in Recommender Systems?
Davide Buffelli
Ashish Gupta
Agnieszka Strzalka
Vassilis Plachouras
OffRL
LRM
179
3
0
16 Aug 2023
Causal Reinforcement Learning: A Survey
Zhi-Hong Deng
Jing Jiang
Guodong Long
Chen Zhang
CML
LRM
345
32
0
04 Jul 2023
RL
3
^3
3
: Boosting Meta Reinforcement Learning via RL inside RL
2
^2
2
Abhinav Bhatia
Samer B. Nashed
S. Zilberstein
OffRL
366
0
0
28 Jun 2023
Supervised Pretraining Can Learn In-Context Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
Jonathan Lee
Annie Xie
Aldo Pacchiano
Yash Chandak
Chelsea Finn
Ofir Nachum
Emma Brunskill
OffRL
312
119
0
26 Jun 2023
Acceleration in Policy Optimization
Veronica Chelu
Tom Zahavy
A. Guez
Doina Precup
Sebastian Flennerhag
325
0
0
18 Jun 2023
Meta Generative Flow Networks with Personalization for Task-Specific Adaptation
Information Sciences (Inf. Sci.), 2023
Xinyuan Ji
Xu Zhang
Wei Xi
Haozhi Wang
Olga Gadyatskaya
Yinchuan Li
175
1
0
16 Jun 2023
Fast Context Adaptation in Cost-Aware Continual Learning
IEEE Transactions on Machine Learning in Communications and Networking (IEEE TMLCN), 2023
Seyyidahmed Lahmer
Federico Mason
Federico Chiariotti
Andrea Zanella
175
3
0
06 Jun 2023
Learning Embeddings for Sequential Tasks Using Population of Agents
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Mridul Mahajan
Georgios Tzannetos
Goran Radanović
Adish Singla
FedML
259
1
0
05 Jun 2023
Meta-Reinforcement Learning Based on Self-Supervised Task Representation Learning
AAAI Conference on Artificial Intelligence (AAAI), 2023
Mingyang Wang
Zhenshan Bing
Xiangtong Yao
Shuai Wang
Hang Su
Chenguang Yang
Kai Huang
Alois C. Knoll
SSL
OOD
259
19
0
29 Apr 2023
Moving Forward by Moving Backward: Embedding Action Impact over Action Semantics
International Conference on Learning Representations (ICLR), 2023
Kuo-Hao Zeng
Luca Weihs
Roozbeh Mottaghi
Ali Farhadi
174
3
0
24 Apr 2023
SPOT: Sequential Predictive Modeling of Clinical Trial Outcome with Meta-Learning
ACM International Conference on Bioinformatics, Computational Biology and Biomedicine (ACM-BCB), 2023
Zifeng Wang
Cao Xiao
Jimeng Sun
152
27
0
07 Apr 2023
A Comprehensive Review and a Taxonomy of Edge Machine Learning: Requirements, Paradigms, and Techniques
Applied Informatics (AI), 2023
Wenbin Li
Hakim Hacid
Ebtesam Almazrouei
Merouane Debbah
315
20
0
16 Feb 2023
Meta-Reinforcement Learning via Exploratory Task Clustering
AAAI Conference on Artificial Intelligence (AAAI), 2023
Zhendong Chu
Hongning Wang
OffRL
183
9
0
15 Feb 2023
Train Hard, Fight Easy: Robust Meta Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
Ido Greenberg
Shie Mannor
Gal Chechik
E. Meirom
OffRL
OOD
265
10
0
26 Jan 2023
1
2
3
4
5
Next