ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.02429
  4. Cited By
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
v1v2v3 (latest)

Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning

4 February 2024
Lanqing Li
Hai Zhang
Xinyu Zhang
Shatong Zhu
Siyue Tao
Junqiao Zhao
Pheng-Ann Heng
    OffRL
ArXiv (abs)PDFHTMLGithub (24★)

Papers citing "Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning"

50 / 58 papers shown
PROF: An LLM-based Reward Code Preference Optimization Framework for Offline Imitation Learning
PROF: An LLM-based Reward Code Preference Optimization Framework for Offline Imitation Learning
Shengjie Sun
Jiafei Lyu
Runze Liu
Mengbei Yan
Bo Liu
Deheng Ye
Xiu Li
OffRL
388
0
0
14 Nov 2025
Learning to Decide with Just Enough: Information-Theoretic Context Summarization for CMDPs
Learning to Decide with Just Enough: Information-Theoretic Context Summarization for CMDPs
Peidong Liu
Junjiang Lin
S. Wang
Yao Xu
Haiqing Li
Xuhao Xie
Siyi Wu
Hao Li
190
0
0
02 Oct 2025
Efficient On-Policy Reinforcement Learning via Exploration of Sparse Parameter Space
Efficient On-Policy Reinforcement Learning via Exploration of Sparse Parameter Space
Xinyu Zhang
Aishik Deb
Klaus Mueller
136
0
0
30 Sep 2025
Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy
Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy
Yunchuan Guan
Yu Liu
Ke Zhou
Zhiqi Shen
Jenq-Neng Hwang
Serge Belongie
Lei Li
174
1
0
16 Sep 2025
Next-Token Prediction Should be Ambiguity-Sensitive: A Meta-Learning Perspective
Next-Token Prediction Should be Ambiguity-Sensitive: A Meta-Learning Perspective
Léo Gagnon
Eric Elmoznino
Sarthak Mittal
Tom Marty
Tejas Kasetty
Dhanya Sridhar
Guillaume Lajoie
262
0
0
19 Jun 2025
Text-to-Decision Agent: Offline Meta-Reinforcement Learning from Natural Language Supervision
Text-to-Decision Agent: Offline Meta-Reinforcement Learning from Natural Language Supervision
Shilin Zhang
Zican Hu
Wenhao Wu
Xinyi Xie
Jianxiang Tang
Chunlin Chen
Daoyi Dong
Yu Cheng
Zhenhong Sun
Zhi Wang
OffRL
1.2K
0
0
21 Apr 2025
Human-Level Competitive Pokémon via Scalable Offline Reinforcement Learning with Transformers
Human-Level Competitive Pokémon via Scalable Offline Reinforcement Learning with Transformers
Jake Grigsby
Yuqi Xie
Justin Sasek
Steven Zheng
Yuke Zhu
OffRL
353
1
0
06 Apr 2025
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World
  Model Disentanglement
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model DisentanglementNeural Information Processing Systems (NeurIPS), 2024
Zhi Wang
Li Zhang
Wenhao Wu
Yuanheng Zhu
Dongbin Zhao
C. L. Philip Chen
OffRL
338
21
0
15 Oct 2024
Focus On What Matters: Separated Models For Visual-Based RL
  Generalization
Focus On What Matters: Separated Models For Visual-Based RL GeneralizationNeural Information Processing Systems (NeurIPS), 2024
Di Zhang
Bowen Lv
Hai Zhang
Feifan Yang
Siyue Tao
Hang Yu
Chang Huang
Hongtu Zhou
Chen Ye
Changjun Jiang
328
8
0
29 Sep 2024
Scrutinize What We Ignore: Reining In Task Representation Shift Of Context-Based Offline Meta Reinforcement Learning
Scrutinize What We Ignore: Reining In Task Representation Shift Of Context-Based Offline Meta Reinforcement Learning
Hai Zhang
Boyuan Zheng
Anqi Guo
Tianying Ji
Anqi Guo
Siyue Tao
Lanqing Li
OffRL
548
0
0
20 May 2024
Generalizable Task Representation Learning for Offline
  Meta-Reinforcement Learning with Data Limitations
Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations
Renzhe Zhou
Chenxiao Gao
Zongzhang Zhang
Yang Yu
OffRL
424
15
0
26 Dec 2023
Context Shift Reduction for Offline Meta-Reinforcement Learning
Context Shift Reduction for Offline Meta-Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Yunkai Gao
Rui Zhang
Jiaming Guo
Fan Wu
Qi Yi
...
Zidong Du
Xingui Hu
Qi Guo
Ling Li
Yunji Chen
OffRL
202
28
0
07 Nov 2023
Transformers as Decision Makers: Provable In-Context Reinforcement
  Learning via Supervised Pretraining
Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised PretrainingInternational Conference on Learning Representations (ICLR), 2023
Licong Lin
Yu Bai
Song Mei
OffRL
388
72
0
12 Oct 2023
How to Fine-tune the Model: Unified Model Shift and Model Bias Policy
  Optimization
How to Fine-tune the Model: Unified Model Shift and Model Bias Policy OptimizationNeural Information Processing Systems (NeurIPS), 2023
Hai Zhang
Hang Yu
Siyue Tao
Di Zhang
Chang Huang
Hongtu Zhou
Xiao Zhang
Chen Ye
380
13
0
22 Sep 2023
Supervised Pretraining Can Learn In-Context Reinforcement Learning
Supervised Pretraining Can Learn In-Context Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Jonathan Lee
Annie Xie
Aldo Pacchiano
Yash Chandak
Chelsea Finn
Ofir Nachum
Emma Brunskill
OffRL
346
136
0
26 Jun 2023
ContraBAR: Contrastive Bayes-Adaptive Deep RL
ContraBAR: Contrastive Bayes-Adaptive Deep RLInternational Conference on Machine Learning (ICML), 2023
Era Choshen
Aviv Tamar
BDLOffRL
216
10
0
04 Jun 2023
Foundation Models for Decision Making: Problems, Methods, and
  Opportunities
Foundation Models for Decision Making: Problems, Methods, and Opportunities
Sherry Yang
Ofir Nachum
Yilun Du
Jason W. Wei
Pieter Abbeel
Dale Schuurmans
LM&RoOffRLLRMAI4CE
447
228
0
07 Mar 2023
Designing Biological Sequences via Meta-Reinforcement Learning and
  Bayesian Optimization
Designing Biological Sequences via Meta-Reinforcement Learning and Bayesian Optimization
Leo Feng
Padideh Nouri
Aneri Muni
Yoshua Bengio
Pierre-Luc Bacon
326
4
0
13 Sep 2022
Prompting Decision Transformer for Few-Shot Policy Generalization
Prompting Decision Transformer for Few-Shot Policy GeneralizationInternational Conference on Machine Learning (ICML), 2022
Mengdi Xu
Songlin Yang
Shun Zhang
Yuchen Lu
Ding Zhao
J. Tenenbaum
Chuang Gan
OffRL
299
191
0
27 Jun 2022
Robust Task Representations for Offline Meta-Reinforcement Learning via
  Contrastive Learning
Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive LearningInternational Conference on Machine Learning (ICML), 2022
Haoqi Yuan
Zongqing Lu
SSLOffRL
255
53
0
21 Jun 2022
On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning
On the Effectiveness of Fine-tuning Versus Meta-reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Mandi Zhao
Pieter Abbeel
Stephen James
OffRL
454
39
0
07 Jun 2022
A Generalist Agent
A Generalist Agent
Scott E. Reed
Konrad Zolna
Emilio Parisotto
Sergio Gomez Colmenarejo
Alexander Novikov
...
Yutian Chen
R. Hadsell
Oriol Vinyals
Mahyar Bordbar
Nando de Freitas
LM&RoLLMAGAI4CE
651
1,023
0
12 May 2022
Value Penalized Q-Learning for Recommender Systems
Value Penalized Q-Learning for Recommender Systems
Chengqian Gao
Ke Xu
Kuangqi Zhou
Lanqing Li
Xueqian Wang
Bo Yuan
P. Zhao
OffRL
242
22
0
15 Oct 2021
Offline Meta-Reinforcement Learning for Industrial Insertion
Offline Meta-Reinforcement Learning for Industrial InsertionIEEE International Conference on Robotics and Automation (ICRA), 2021
Tony Zhao
Jianlan Luo
Oleg O. Sushkov
Rugile Pevceviciute
N. Heess
Jonathan Scholz
S. Schaal
Sergey Levine
OffRLOnRL
428
91
0
08 Oct 2021
RMA: Rapid Motor Adaptation for Legged Robots
RMA: Rapid Motor Adaptation for Legged Robots
Ashish Kumar
Zipeng Fu
Deepak Pathak
Jitendra Malik
1.1K
824
0
08 Jul 2021
Offline Meta-Reinforcement Learning with Online Self-Supervision
Offline Meta-Reinforcement Learning with Online Self-SupervisionInternational Conference on Machine Learning (ICML), 2021
Vitchyr H. Pong
Ashvin Nair
Laura M. Smith
Catherine Huang
Sergey Levine
OffRL
436
78
0
08 Jul 2021
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Offline Reinforcement Learning as One Big Sequence Modeling ProblemNeural Information Processing Systems (NeurIPS), 2021
Michael Janner
Qiyang Li
Sergey Levine
OffRL
883
832
0
03 Jun 2021
Decision Transformer: Reinforcement Learning via Sequence Modeling
Decision Transformer: Reinforcement Learning via Sequence ModelingNeural Information Processing Systems (NeurIPS), 2021
Lili Chen
Kevin Lu
Aravind Rajeswaran
Kimin Lee
Aditya Grover
Michael Laskin
Pieter Abbeel
A. Srinivas
Igor Mordatch
OffRL
734
2,179
0
02 Jun 2021
The Power of Scale for Parameter-Efficient Prompt Tuning
The Power of Scale for Parameter-Efficient Prompt TuningConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Brian Lester
Rami Al-Rfou
Noah Constant
VPVLM
1.6K
5,344
0
18 Apr 2021
Prefix-Tuning: Optimizing Continuous Prompts for Generation
Prefix-Tuning: Optimizing Continuous Prompts for GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Xiang Lisa Li
Abigail Z. Jacobs
857
5,620
0
01 Jan 2021
Offline Meta-Reinforcement Learning with Advantage Weighting
Offline Meta-Reinforcement Learning with Advantage Weighting
E. Mitchell
Rafael Rafailov
Xue Bin Peng
Sergey Levine
Chelsea Finn
OffRL
464
122
0
13 Aug 2020
CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information
CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information
Pengyu Cheng
Weituo Hao
Shuyang Dai
Jiachang Liu
Zhe Gan
Lawrence Carin
VLM
594
501
0
22 Jun 2020
Self-supervised Learning: Generative or Contrastive
Self-supervised Learning: Generative or Contrastive
Xiao Liu
Fanjin Zhang
Zhenyu Hou
Zhaoyu Wang
Li Mian
Jing Zhang
Jie Tang
SSL
895
2,100
0
15 Jun 2020
MOPO: Model-based Offline Policy Optimization
MOPO: Model-based Offline Policy OptimizationNeural Information Processing Systems (NeurIPS), 2020
Tianhe Yu
G. Thomas
Lantao Yu
Stefano Ermon
James Zou
Sergey Levine
Chelsea Finn
Tengyu Ma
OffRL
869
906
0
27 May 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRLGP
1.3K
2,510
0
04 May 2020
Shortcut Learning in Deep Neural Networks
Shortcut Learning in Deep Neural NetworksNature Machine Intelligence (NMI), 2020
Robert Geirhos
J. Jacobsen
Claudio Michaelis
R. Zemel
Wieland Brendel
Matthias Bethge
Felix Wichmann
1.5K
2,717
0
16 Apr 2020
Decision-Making with Auto-Encoding Variational Bayes
Decision-Making with Auto-Encoding Variational BayesNeural Information Processing Systems (NeurIPS), 2020
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
1.8K
20,656
0
17 Feb 2020
Deep Reinforcement Learning for Autonomous Driving: A Survey
Deep Reinforcement Learning for Autonomous Driving: A Survey
B. R. Kiran
Ibrahim Sobh
V. Talpaert
Patrick Mannion
A. A. Sallab
S. Yogamani
P. Pérez
961
2,240
0
02 Feb 2020
Scaling Laws for Neural Language Models
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
2.2K
7,549
0
23 Jan 2020
Behavior Regularized Offline Reinforcement Learning
Behavior Regularized Offline Reinforcement Learning
Yifan Wu
George Tucker
Ofir Nachum
OffRL
676
816
0
26 Nov 2019
Meta-World: A Benchmark and Evaluation for Multi-Task and Meta
  Reinforcement Learning
Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement LearningConference on Robot Learning (CoRL), 2019
Tianhe Yu
Deirdre Quillen
Zhanpeng He
Ryan Julian
Avnish Narayan
Hayden Shively
Adithya Bellathur
Karol Hausman
Chelsea Finn
Sergey Levine
OffRL
871
1,545
0
24 Oct 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
Exploring the Limits of Transfer Learning with a Unified Text-to-Text TransformerJournal of machine learning research (JMLR), 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
1.8K
25,199
0
23 Oct 2019
VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-LearningInternational Conference on Learning Representations (ICLR), 2019
L. Zintgraf
K. Shiarlis
Maximilian Igl
Sebastian Schulze
Y. Gal
Katja Hofmann
Shimon Whiteson
OffRL
529
308
0
18 Oct 2019
Meta-Q-Learning
Meta-Q-LearningInternational Conference on Learning Representations (ICLR), 2019
Rasool Fakoor
Pratik Chaudhari
Stefano Soatto
Alex Smola
OffRL
345
162
0
30 Sep 2019
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human
  Preferences in Dialog
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Natasha Jaques
Asma Ghandeharioun
J. Shen
Craig Ferguson
Àgata Lapedriza
Noah J. Jones
S. Gu
Rosalind W. Picard
OffRL
573
385
0
30 Jun 2019
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Stabilizing Off-Policy Q-Learning via Bootstrapping Error ReductionNeural Information Processing Systems (NeurIPS), 2019
Aviral Kumar
Justin Fu
George Tucker
Sergey Levine
OffRLOnRL
623
1,247
0
03 Jun 2019
Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic
  Context Variables
Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context VariablesInternational Conference on Machine Learning (ICML), 2019
Kate Rakelly
Aurick Zhou
Deirdre Quillen
Chelsea Finn
Sergey Levine
OffRL
423
782
0
19 Mar 2019
Representation Learning with Contrastive Predictive Coding
Representation Learning with Contrastive Predictive Coding
Aaron van den Oord
Yazhe Li
Oriol Vinyals
DRLSSL
2.0K
12,894
0
10 Jul 2018
World Models
World Models
David R Ha
Jürgen Schmidhuber
SyDa
1.1K
1,580
1
27 Mar 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
2.9K
10,878
0
04 Jan 2018
12
Next
Page 1 of 2