ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.08028
  4. Cited By
A Survey of Meta-Reinforcement Learning

A Survey of Meta-Reinforcement Learning

19 January 2023
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
    OOD
    OffRL
ArXivPDFHTML

Papers citing "A Survey of Meta-Reinforcement Learning"

50 / 86 papers shown
Title
Combining Bayesian Inference and Reinforcement Learning for Agent Decision Making: A Review
Combining Bayesian Inference and Reinforcement Learning for Agent Decision Making: A Review
Chengmin Zhou
Ville Kyrki
P. Fränti
Laura Ruotsalainen
BDL
AI4CE
27
0
0
12 May 2025
Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments
Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments
Yun Qu
W. Wang
Yixiu Mao
Yiqin Lv
Xiangyang Ji
TTA
85
0
0
27 Apr 2025
Meta-Thinking in LLMs via Multi-Agent Reinforcement Learning: A Survey
Meta-Thinking in LLMs via Multi-Agent Reinforcement Learning: A Survey
Ahsan Bilal
Muhammad Ahmed Mohsin
Muhammad Umer
Muhammad Awais Khan Bangash
Muhammad Ali Jamshed
LLMAG
LRM
AI4CE
49
0
0
20 Apr 2025
Free Random Projection for In-Context Reinforcement Learning
Free Random Projection for In-Context Reinforcement Learning
Tomohiro Hayase
B. Collins
Nakamasa Inoue
14
0
0
09 Apr 2025
Human-Level Competitive Pokémon via Scalable Offline Reinforcement Learning with Transformers
Human-Level Competitive Pokémon via Scalable Offline Reinforcement Learning with Transformers
Jake Grigsby
Yuqi Xie
Justin Sasek
Steven Zheng
Yuke Zhu
OffRL
26
0
0
06 Apr 2025
A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective
A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective
Zhuoren Li
Guizhe Jin
Ran Yu
Z. Chen
Nan I. Li
...
Lu Xiong
Bo Leng
Jia Hu
I. Kolmanovsky
Dimitar Filev
44
0
0
31 Mar 2025
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning
Yuxiao Qu
Matthew Y. R. Yang
Amrith Rajagopal Setlur
Lewis Tunstall
E. Beeching
Ruslan Salakhutdinov
Aviral Kumar
OffRL
54
11
0
10 Mar 2025
SFO: Piloting VLM Feedback for Offline RL
SFO: Piloting VLM Feedback for Offline RL
Jacob Beck
OffRL
31
0
0
02 Mar 2025
Learning Policy Committees for Effective Personalization in MDPs with Diverse Tasks
Luise Ge
Michael Lanier
Anindya Sarkar
Bengisu Guresti
Yevgeniy Vorobeychik
Chongjie Zhang
42
0
0
26 Feb 2025
Training a Generally Curious Agent
Training a Generally Curious Agent
Fahim Tajwar
Yiding Jiang
Abitha Thankaraj
Sumaita Sadia Rahman
J. Zico Kolter
Jeff Schneider
Ruslan Salakhutdinov
115
1
0
24 Feb 2025
Yes, Q-learning Helps Offline In-Context RL
Yes, Q-learning Helps Offline In-Context RL
Denis Tarasov
Alexander Nikulin
Ilya Zisman
Albina Klepach
Andrei Polubarov
Nikita Lyubaykin
Alexander Derevyagin
Igor Kiselev
Vladislav Kurenkov
OffRL
OnRL
88
0
0
24 Feb 2025
Develop AI Agents for System Engineering in Factorio
Develop AI Agents for System Engineering in Factorio
Neel Kant
37
0
0
03 Feb 2025
Toward Task Generalization via Memory Augmentation in Meta-Reinforcement Learning
Toward Task Generalization via Memory Augmentation in Meta-Reinforcement Learning
Kaixi Bao
Chenhao Li
Yarden As
Andreas Krause
Marco Hutter
OffRL
CLL
98
1
0
03 Feb 2025
AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers
Jake Grigsby
Justin Sasek
Samyak Parajuli
Daniel Adebi
Amy Zhang
Yuke Zhu
OffRL
23
2
0
17 Nov 2024
Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting
  Diversity
Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting Diversity
Robby Costales
Stefanos Nikolaidis
AI4CE
23
0
0
07 Nov 2024
Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from
  Shifted-Dynamics Data
Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Chengrui Qu
Laixi Shi
Kishan Panaganti
Pengcheng You
Adam Wierman
OffRL
OnRL
34
0
0
06 Nov 2024
Role Play: Learning Adaptive Role-Specific Strategies in Multi-Agent
  Interactions
Role Play: Learning Adaptive Role-Specific Strategies in Multi-Agent Interactions
Weifan Long
Wen Wen
Peng Zhai
Lihua Zhang
21
0
0
02 Nov 2024
Offline Reinforcement Learning with OOD State Correction and OOD Action
  Suppression
Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression
Yixiu Mao
Qi Wang
Chen Chen
Yun Qu
Xiangyang Ji
OffRL
32
6
0
25 Oct 2024
Meta-Reinforcement Learning with Universal Policy Adaptation: Provable
  Near-Optimality under All-task Optimum Comparator
Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparator
Siyuan Xu
Minghui Zhu
OffRL
18
1
0
13 Oct 2024
Metalic: Meta-Learning In-Context with Protein Language Models
Metalic: Meta-Learning In-Context with Protein Language Models
Jacob Beck
Shikha Surana
Manus McAuliffe
Oliver Bent
Thomas D. Barrett
Juan Jose Garau Luis
Paul Duckworth
AI4CE
28
0
0
10 Oct 2024
ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for
  Embodied AI
ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI
Ahmad Elawady
Gunjan Chhablani
Ram Ramrakhya
Karmesh Yadav
Dhruv Batra
Z. Kira
Andrew Szot
OffRL
21
0
0
03 Oct 2024
Meta Reinforcement Learning Approach for Adaptive Resource Optimization
  in O-RAN
Meta Reinforcement Learning Approach for Adaptive Resource Optimization in O-RAN
Fatemeh Lotfi
Fatemeh Afghah
AI4CE
18
1
0
30 Sep 2024
Personalisation via Dynamic Policy Fusion
Personalisation via Dynamic Policy Fusion
Ajsal Shereef Palattuparambil
T. G. Karimpanal
Santu Rana
19
0
0
30 Sep 2024
Context-Based Meta Reinforcement Learning for Robust and Adaptable Peg-in-Hole Assembly Tasks
Context-Based Meta Reinforcement Learning for Robust and Adaptable Peg-in-Hole Assembly Tasks
Ahmed Shokry
Walid Gomaa
Tobias Zaenker
Murad Dawood
Shady A. Maged
Mohammed I. Awad
Maren Bennewitz
Maren Bennewitz
OffRL
30
0
0
24 Sep 2024
Emulating Brain-like Rapid Learning in Neuromorphic Edge Computing
Emulating Brain-like Rapid Learning in Neuromorphic Edge Computing
Kenneth Stewart
Michael Neumeier
Sumit Bam Shrestha
Garrick Orchard
Emre Neftci
27
0
0
28 Aug 2024
Black box meta-learning intrinsic rewards for sparse-reward environments
Black box meta-learning intrinsic rewards for sparse-reward environments
Octavio Pappalardo
Rodrigo Ramele
Juan Miguel Santos
OffRL
25
0
0
31 Jul 2024
ARCLE: The Abstraction and Reasoning Corpus Learning Environment for
  Reinforcement Learning
ARCLE: The Abstraction and Reasoning Corpus Learning Environment for Reinforcement Learning
Hosung Lee
Sejin Kim
Seungpil Lee
Sanha Hwang
Jihwan Lee
Byung-Jun Lee
Sundong Kim
LRM
35
8
0
30 Jul 2024
SOAP-RL: Sequential Option Advantage Propagation for Reinforcement
  Learning in POMDP Environments
SOAP-RL: Sequential Option Advantage Propagation for Reinforcement Learning in POMDP Environments
Shu Ishida
João F. Henriques
25
0
0
26 Jul 2024
Can Learned Optimization Make Reinforcement Learning Less Difficult?
Can Learned Optimization Make Reinforcement Learning Less Difficult?
Alexander David Goldie
Chris Xiaoxuan Lu
Matthew Jackson
Shimon Whiteson
Jakob N. Foerster
40
3
0
09 Jul 2024
Disentangled Representations for Causal Cognition
Disentangled Representations for Causal Cognition
Filippo Torresan
Manuel Baltieri
CML
27
1
0
30 Jun 2024
A Bayesian Solution To The Imitation Gap
A Bayesian Solution To The Imitation Gap
Risto Vuorio
Mattie Fellows
Cong Lu
Clémence Grislain
Shimon Whiteson
25
1
0
29 Jun 2024
Meta-Gradient Search Control: A Method for Improving the Efficiency of
  Dyna-style Planning
Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning
Bradley Burega
John D. Martin
Luke Kapeluck
Michael H. Bowling
27
0
0
27 Jun 2024
Constrained Meta Agnostic Reinforcement Learning
Constrained Meta Agnostic Reinforcement Learning
Karam Daaboul
Florian Kuhm
Tim Joseph
J. Marius Zoellner
26
0
0
20 Jun 2024
Memory Sequence Length of Data Sampling Impacts the Adaptation of
  Meta-Reinforcement Learning Agents
Memory Sequence Length of Data Sampling Impacts the Adaptation of Meta-Reinforcement Learning Agents
Menglong Zhang
Fuyuan Qian
Quanying Liu
23
1
0
18 Jun 2024
Which Experiences Are Influential for RL Agents? Efficiently Estimating
  The Influence of Experiences
Which Experiences Are Influential for RL Agents? Efficiently Estimating The Influence of Experiences
Takuya Hiraoka
Guanquan Wang
Takashi Onishi
Yoshimasa Tsuruoka
29
0
0
23 May 2024
Preparing for Black Swans: The Antifragility Imperative for Machine
  Learning
Preparing for Black Swans: The Antifragility Imperative for Machine Learning
Ming Jin
32
2
0
18 May 2024
TorchDriveEnv: A Reinforcement Learning Benchmark for Autonomous Driving
  with Reactive, Realistic, and Diverse Non-Playable Characters
TorchDriveEnv: A Reinforcement Learning Benchmark for Autonomous Driving with Reactive, Realistic, and Diverse Non-Playable Characters
J. Lavington
Ke Zhang
Vasileios Lioutas
Matthew Niedoba
Yunpeng Liu
...
Xiaoxuan Liang
Setareh Dabiri
Adam Scibior
Berend Zwartsenberg
Frank D. Wood
22
5
0
07 May 2024
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for
  Mobile Edge Computing, its Applications, and Future Research Trajectories
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories
Ning Yang
Shuo Chen
Haijun Zhang
Randall Berry
OffRL
29
4
0
22 Apr 2024
Inferring Behavior-Specific Context Improves Zero-Shot Generalization in
  Reinforcement Learning
Inferring Behavior-Specific Context Improves Zero-Shot Generalization in Reinforcement Learning
Tidiane Camaret Ndir
André Biedenkapp
Noor H. Awad
OffRL
27
1
0
15 Apr 2024
Sequential Decision Making with Expert Demonstrations under Unobserved
  Heterogeneity
Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity
Vahid Balazadeh Meresht
Keertana Chidambaram
Viet Nguyen
Rahul G. Krishnan
Vasilis Syrgkanis
33
0
0
10 Apr 2024
A Moreau Envelope Approach for LQR Meta-Policy Estimation
A Moreau Envelope Approach for LQR Meta-Policy Estimation
Ashwin Aravind
Taha Toghani
César A. Uribe
21
1
0
26 Mar 2024
Dreaming of Many Worlds: Learning Contextual World Models Aids Zero-Shot
  Generalization
Dreaming of Many Worlds: Learning Contextual World Models Aids Zero-Shot Generalization
Sai Prasanna
Karim Farid
Raghu Rajan
André Biedenkapp
45
2
0
16 Mar 2024
MAMBA: an Effective World Model Approach for Meta-Reinforcement Learning
MAMBA: an Effective World Model Approach for Meta-Reinforcement Learning
Zohar Rimon
Tom Jurgenson
Orr Krupnik
Gilad Adler
Aviv Tamar
24
8
0
14 Mar 2024
Deep Reinforcement Learning for Modelling Protein Complexes
Deep Reinforcement Learning for Modelling Protein Complexes
Ziqi Gao
Tao Feng
Jiaxuan You
Chenyi Zi
Yan Zhou
Chen Zhang
Jia Li
34
5
0
11 Mar 2024
SplAgger: Split Aggregation for Meta-Reinforcement Learning
SplAgger: Split Aggregation for Meta-Reinforcement Learning
Jacob Beck
Matthew Jackson
Risto Vuorio
Zheng Xiong
Shimon Whiteson
OffRL
19
2
0
05 Mar 2024
DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement
  Learning
DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning
Anthony Liang
Guy Tennenholtz
Chih-Wei Hsu
Yinlam Chow
Erdem Biyik
Craig Boutilier
OffRL
33
1
0
25 Feb 2024
Hierarchical Transformers are Efficient Meta-Reinforcement Learners
Hierarchical Transformers are Efficient Meta-Reinforcement Learners
Gresa Shala
André Biedenkapp
Josif Grabocka
OffRL
27
4
0
09 Feb 2024
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
Lanqing Li
Hai Zhang
Xinyu Zhang
Shatong Zhu
Junqiao Zhao
Junqiao Zhao
Pheng-Ann Heng
OffRL
26
7
0
04 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement
  Learning and Large Language Models
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
11
6
0
02 Feb 2024
Meta-Learning Linear Quadratic Regulators: A Policy Gradient MAML
  Approach for Model-free LQR
Meta-Learning Linear Quadratic Regulators: A Policy Gradient MAML Approach for Model-free LQR
Leonardo F. Toso
Donglin Zhan
James Anderson
Han Wang
21
9
0
25 Jan 2024
12
Next