v1v2v3 (latest)

Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning

4 February 2024

Junqiao Zhao

Pheng-Ann Heng

OffRL

ArXiv (abs)PDF HTML Github (24★)

Papers citing "Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning"

50 / 58 papers shown

PROF: An LLM-based Reward Code Preference Optimization Framework for Offline Imitation Learning

388

14 Nov 2025

Learning to Decide with Just Enough: Information-Theoretic Context Summarization for CMDPs

190

02 Oct 2025

Efficient On-Policy Reinforcement Learning via Exploration of Sparse Parameter Space

Xinyu Zhang

Aishik Deb

Klaus Mueller

136

30 Sep 2025

Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy

174

16 Sep 2025

Next-Token Prediction Should be Ambiguity-Sensitive: A Meta-Learning Perspective

262

19 Jun 2025

Text-to-Decision Agent: Offline Meta-Reinforcement Learning from Natural Language Supervision

1.2K

21 Apr 2025

Human-Level Competitive Pokémon via Scalable Offline Reinforcement Learning with Transformers

353

06 Apr 2025

Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model DisentanglementNeural Information Processing Systems (NeurIPS), 2024

338

15 Oct 2024

Focus On What Matters: Separated Models For Visual-Based RL GeneralizationNeural Information Processing Systems (NeurIPS), 2024

328

29 Sep 2024

Scrutinize What We Ignore: Reining In Task Representation Shift Of Context-Based Offline Meta Reinforcement Learning

Anqi Guo

548

20 May 2024

Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations

424

26 Dec 2023

Context Shift Reduction for Offline Meta-Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023

...

Ling Li

202

07 Nov 2023

Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised PretrainingInternational Conference on Learning Representations (ICLR), 2023

388

12 Oct 2023

How to Fine-tune the Model: Unified Model Shift and Model Bias Policy OptimizationNeural Information Processing Systems (NeurIPS), 2023

380

22 Sep 2023

Supervised Pretraining Can Learn In-Context Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023

346

136

26 Jun 2023

ContraBAR: Contrastive Bayes-Adaptive Deep RLInternational Conference on Machine Learning (ICML), 2023

Era Choshen

Aviv Tamar

BDL OffRL

216

04 Jun 2023

Foundation Models for Decision Making: Problems, Methods, and Opportunities

Pieter Abbeel

LM&Ro OffRL LRM AI4CE

447

228

07 Mar 2023

Designing Biological Sequences via Meta-Reinforcement Learning and Bayesian Optimization

Pierre-Luc Bacon

326

13 Sep 2022

Prompting Decision Transformer for Few-Shot Policy GeneralizationInternational Conference on Machine Learning (ICML), 2022

Ding Zhao

Chuang Gan

299

191

27 Jun 2022

Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive LearningInternational Conference on Machine Learning (ICML), 2022

Haoqi Yuan

Zongqing Lu

SSL OffRL

255

21 Jun 2022

On the Effectiveness of Fine-tuning Versus Meta-reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022

454

07 Jun 2022

Sergio Gomez Colmenarejo

...

651

1,023

12 May 2022

Value Penalized Q-Learning for Recommender Systems

242

15 Oct 2021

Offline Meta-Reinforcement Learning for Industrial InsertionIEEE International Conference on Robotics and Automation (ICRA), 2021

428

08 Oct 2021

RMA: Rapid Motor Adaptation for Legged Robots

1.1K

824

08 Jul 2021

Offline Meta-Reinforcement Learning with Online Self-SupervisionInternational Conference on Machine Learning (ICML), 2021

436

08 Jul 2021

Offline Reinforcement Learning as One Big Sequence Modeling ProblemNeural Information Processing Systems (NeurIPS), 2021

883

832

03 Jun 2021

Decision Transformer: Reinforcement Learning via Sequence ModelingNeural Information Processing Systems (NeurIPS), 2021

Aravind Rajeswaran

Pieter Abbeel

734

2,179

02 Jun 2021

The Power of Scale for Parameter-Efficient Prompt TuningConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

1.6K

5,344

18 Apr 2021

Prefix-Tuning: Optimizing Continuous Prompts for GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Xiang Lisa Li

Abigail Z. Jacobs

857

5,620

01 Jan 2021

Offline Meta-Reinforcement Learning with Advantage Weighting

464

122

13 Aug 2020

CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information

Weituo Hao

Lawrence Carin

594

501

22 Jun 2020

Self-supervised Learning: Generative or Contrastive

Xiao Liu

Jing Zhang

895

2,100

15 Jun 2020

MOPO: Model-based Offline Policy OptimizationNeural Information Processing Systems (NeurIPS), 2020

869

906

27 May 2020

Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems

1.3K

2,510

04 May 2020

Shortcut Learning in Deep Neural NetworksNature Machine Intelligence (NMI), 2020

Wieland Brendel

1.5K

2,717

16 Apr 2020

Decision-Making with Auto-Encoding Variational BayesNeural Information Processing Systems (NeurIPS), 2020

Romain Lopez

Pierre Boyeau

Nir Yosef

Michael I. Jordan

Jeffrey Regier

BDL

1.8K

20,656

17 Feb 2020

Deep Reinforcement Learning for Autonomous Driving: A Survey

961

2,240

02 Feb 2020

Scaling Laws for Neural Language Models

2.2K

7,549

23 Jan 2020

Behavior Regularized Offline Reinforcement Learning

676

816

26 Nov 2019

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement LearningConference on Robot Learning (CoRL), 2019

871

1,545

24 Oct 2019

Exploring the Limits of Transfer Learning with a Unified Text-to-Text TransformerJournal of machine learning research (JMLR), 2019

Sharan Narang

1.8K

25,199

23 Oct 2019

VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-LearningInternational Conference on Learning Representations (ICLR), 2019

529

308

18 Oct 2019

Meta-Q-LearningInternational Conference on Learning Representations (ICLR), 2019

345

162

30 Sep 2019

Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog

573

385

30 Jun 2019

Stabilizing Off-Policy Q-Learning via Bootstrapping Error ReductionNeural Information Processing Systems (NeurIPS), 2019

623

1,247

03 Jun 2019

Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context VariablesInternational Conference on Machine Learning (ICML), 2019

423

782

19 Mar 2019

Representation Learning with Contrastive Predictive Coding

2.0K

12,894

10 Jul 2018

1.1K

1,580

27 Mar 2018

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

Tuomas Haarnoja

Aurick Zhou

Pieter Abbeel

Sergey Levine

2.9K

10,878

04 Jan 2018