ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.09971
  4. Cited By
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents

AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents

15 October 2023
Jake Grigsby
Linxi Fan
Yuke Zhu
    OffRL
    LM&Ro
ArXivPDFHTML

Papers citing "AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents"

15 / 15 papers shown
Title
Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision
Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision
Shilin Zhang
Zican Hu
Wenhao Wu
Xinyi Xie
Jianxiang Tang
Chunlin Chen
Daoyi Dong
Yu Cheng
Zhenhong Sun
Zhi Wang
OffRL
66
0
0
21 Apr 2025
Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting
  Diversity
Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting Diversity
Robby Costales
Stefanos Nikolaidis
AI4CE
23
0
0
07 Nov 2024
N-Gram Induction Heads for In-Context RL: Improving Stability and Reducing Data Needs
N-Gram Induction Heads for In-Context RL: Improving Stability and Reducing Data Needs
Ilya Zisman
Alexander Nikulin
Andrei Polubarov
Nikita Lyubaykin
Vladislav Kurenkov
Andrei Polubarov
Igor Kiselev
Vladislav Kurenkov
OffRL
44
1
0
04 Nov 2024
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World
  Model Disentanglement
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement
Zhi Wang
Li Lyna Zhang
Wenhao Wu
Yuanheng Zhu
Dongbin Zhao
C. L. Philip Chen
OffRL
33
6
0
15 Oct 2024
From Words to Actions: Unveiling the Theoretical Underpinnings of
  LLM-Driven Autonomous Systems
From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems
Jianliang He
Siyu Chen
Fengzhuo Zhang
Zhuoran Yang
LM&Ro
LLMAG
40
2
0
30 May 2024
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific
  Learning Rate
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate
Fan Luo
Zuolin Tu
Zefang Huang
Yang Yu
OffRL
32
0
0
24 May 2024
Stabilizing Transformer Training by Preventing Attention Entropy
  Collapse
Stabilizing Transformer Training by Preventing Attention Entropy Collapse
Shuangfei Zhai
Tatiana Likhomanenko
Etai Littwin
Dan Busbridge
Jason Ramapuram
Yizhe Zhang
Jiatao Gu
J. Susskind
AAML
38
64
0
11 Mar 2023
Structured State Space Models for In-Context Reinforcement Learning
Structured State Space Models for In-Context Reinforcement Learning
Chris Xiaoxuan Lu
Yannick Schroecker
Albert Gu
Emilio Parisotto
Jakob N. Foerster
Satinder Singh
Feryal M. P. Behbahani
AI4TS
84
81
0
07 Mar 2023
You Can't Count on Luck: Why Decision Transformers and RvS Fail in
  Stochastic Environments
You Can't Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments
Keiran Paster
Sheila A. McIlraith
Jimmy Ba
OffRL
133
27
0
31 May 2022
The Primacy Bias in Deep Reinforcement Learning
The Primacy Bias in Deep Reinforcement Learning
Evgenii Nikishin
Max Schwarzer
P. DÓro
Pierre-Luc Bacon
Aaron C. Courville
OnRL
85
178
0
16 May 2022
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit
  Partial Observability
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability
Dibya Ghosh
Jad Rahme
Aviral Kumar
Amy Zhang
Ryan P. Adams
Sergey Levine
OffRL
270
108
0
13 Jul 2021
DisCo RL: Distribution-Conditioned Reinforcement Learning for
  General-Purpose Policies
DisCo RL: Distribution-Conditioned Reinforcement Learning for General-Purpose Policies
Soroush Nasiriany
Vitchyr H. Pong
Ashvin Nair
Alexander Khazatsky
Glen Berseth
Sergey Levine
OffRL
58
14
0
23 Apr 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
329
1,944
0
04 May 2020
Soft Actor-Critic for Discrete Action Settings
Soft Actor-Critic for Discrete Action Settings
Petros Christodoulou
OffRL
104
292
0
16 Oct 2019
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
243
11,659
0
09 Mar 2017
1