Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2402.02429
Cited By
v1
v2
v3 (latest)
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
4 February 2024
Lanqing Li
Hai Zhang
Xinyu Zhang
Shatong Zhu
Siyue Tao
Junqiao Zhao
Pheng-Ann Heng
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Github (24★)
Papers citing
"Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning"
50 / 58 papers shown
PROF: An LLM-based Reward Code Preference Optimization Framework for Offline Imitation Learning
Shengjie Sun
Jiafei Lyu
Runze Liu
Mengbei Yan
Bo Liu
Deheng Ye
Xiu Li
OffRL
388
0
0
14 Nov 2025
Learning to Decide with Just Enough: Information-Theoretic Context Summarization for CMDPs
Peidong Liu
Junjiang Lin
S. Wang
Yao Xu
Haiqing Li
Xuhao Xie
Siyi Wu
Hao Li
190
0
0
02 Oct 2025
Efficient On-Policy Reinforcement Learning via Exploration of Sparse Parameter Space
Xinyu Zhang
Aishik Deb
Klaus Mueller
136
0
0
30 Sep 2025
Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy
Yunchuan Guan
Yu Liu
Ke Zhou
Zhiqi Shen
Jenq-Neng Hwang
Serge Belongie
Lei Li
174
1
0
16 Sep 2025
Next-Token Prediction Should be Ambiguity-Sensitive: A Meta-Learning Perspective
Léo Gagnon
Eric Elmoznino
Sarthak Mittal
Tom Marty
Tejas Kasetty
Dhanya Sridhar
Guillaume Lajoie
262
0
0
19 Jun 2025
Text-to-Decision Agent: Offline Meta-Reinforcement Learning from Natural Language Supervision
Shilin Zhang
Zican Hu
Wenhao Wu
Xinyi Xie
Jianxiang Tang
Chunlin Chen
Daoyi Dong
Yu Cheng
Zhenhong Sun
Zhi Wang
OffRL
1.2K
0
0
21 Apr 2025
Human-Level Competitive Pokémon via Scalable Offline Reinforcement Learning with Transformers
Jake Grigsby
Yuqi Xie
Justin Sasek
Steven Zheng
Yuke Zhu
OffRL
353
1
0
06 Apr 2025
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement
Neural Information Processing Systems (NeurIPS), 2024
Zhi Wang
Li Zhang
Wenhao Wu
Yuanheng Zhu
Dongbin Zhao
C. L. Philip Chen
OffRL
338
21
0
15 Oct 2024
Focus On What Matters: Separated Models For Visual-Based RL Generalization
Neural Information Processing Systems (NeurIPS), 2024
Di Zhang
Bowen Lv
Hai Zhang
Feifan Yang
Siyue Tao
Hang Yu
Chang Huang
Hongtu Zhou
Chen Ye
Changjun Jiang
328
8
0
29 Sep 2024
Scrutinize What We Ignore: Reining In Task Representation Shift Of Context-Based Offline Meta Reinforcement Learning
Hai Zhang
Boyuan Zheng
Anqi Guo
Tianying Ji
Anqi Guo
Siyue Tao
Lanqing Li
OffRL
548
0
0
20 May 2024
Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations
Renzhe Zhou
Chenxiao Gao
Zongzhang Zhang
Yang Yu
OffRL
424
15
0
26 Dec 2023
Context Shift Reduction for Offline Meta-Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
Yunkai Gao
Rui Zhang
Jiaming Guo
Fan Wu
Qi Yi
...
Zidong Du
Xingui Hu
Qi Guo
Ling Li
Yunji Chen
OffRL
202
28
0
07 Nov 2023
Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining
International Conference on Learning Representations (ICLR), 2023
Licong Lin
Yu Bai
Song Mei
OffRL
388
72
0
12 Oct 2023
How to Fine-tune the Model: Unified Model Shift and Model Bias Policy Optimization
Neural Information Processing Systems (NeurIPS), 2023
Hai Zhang
Hang Yu
Siyue Tao
Di Zhang
Chang Huang
Hongtu Zhou
Xiao Zhang
Chen Ye
380
13
0
22 Sep 2023
Supervised Pretraining Can Learn In-Context Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
Jonathan Lee
Annie Xie
Aldo Pacchiano
Yash Chandak
Chelsea Finn
Ofir Nachum
Emma Brunskill
OffRL
346
136
0
26 Jun 2023
ContraBAR: Contrastive Bayes-Adaptive Deep RL
International Conference on Machine Learning (ICML), 2023
Era Choshen
Aviv Tamar
BDL
OffRL
216
10
0
04 Jun 2023
Foundation Models for Decision Making: Problems, Methods, and Opportunities
Sherry Yang
Ofir Nachum
Yilun Du
Jason W. Wei
Pieter Abbeel
Dale Schuurmans
LM&Ro
OffRL
LRM
AI4CE
447
228
0
07 Mar 2023
Designing Biological Sequences via Meta-Reinforcement Learning and Bayesian Optimization
Leo Feng
Padideh Nouri
Aneri Muni
Yoshua Bengio
Pierre-Luc Bacon
326
4
0
13 Sep 2022
Prompting Decision Transformer for Few-Shot Policy Generalization
International Conference on Machine Learning (ICML), 2022
Mengdi Xu
Songlin Yang
Shun Zhang
Yuchen Lu
Ding Zhao
J. Tenenbaum
Chuang Gan
OffRL
299
191
0
27 Jun 2022
Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning
International Conference on Machine Learning (ICML), 2022
Haoqi Yuan
Zongqing Lu
SSL
OffRL
255
53
0
21 Jun 2022
On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2022
Mandi Zhao
Pieter Abbeel
Stephen James
OffRL
454
39
0
07 Jun 2022
A Generalist Agent
Scott E. Reed
Konrad Zolna
Emilio Parisotto
Sergio Gomez Colmenarejo
Alexander Novikov
...
Yutian Chen
R. Hadsell
Oriol Vinyals
Mahyar Bordbar
Nando de Freitas
LM&Ro
LLMAG
AI4CE
651
1,023
0
12 May 2022
Value Penalized Q-Learning for Recommender Systems
Chengqian Gao
Ke Xu
Kuangqi Zhou
Lanqing Li
Xueqian Wang
Bo Yuan
P. Zhao
OffRL
242
22
0
15 Oct 2021
Offline Meta-Reinforcement Learning for Industrial Insertion
IEEE International Conference on Robotics and Automation (ICRA), 2021
Tony Zhao
Jianlan Luo
Oleg O. Sushkov
Rugile Pevceviciute
N. Heess
Jonathan Scholz
S. Schaal
Sergey Levine
OffRL
OnRL
428
91
0
08 Oct 2021
RMA: Rapid Motor Adaptation for Legged Robots
Ashish Kumar
Zipeng Fu
Deepak Pathak
Jitendra Malik
1.1K
824
0
08 Jul 2021
Offline Meta-Reinforcement Learning with Online Self-Supervision
International Conference on Machine Learning (ICML), 2021
Vitchyr H. Pong
Ashvin Nair
Laura M. Smith
Catherine Huang
Sergey Levine
OffRL
436
78
0
08 Jul 2021
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Neural Information Processing Systems (NeurIPS), 2021
Michael Janner
Qiyang Li
Sergey Levine
OffRL
883
832
0
03 Jun 2021
Decision Transformer: Reinforcement Learning via Sequence Modeling
Neural Information Processing Systems (NeurIPS), 2021
Lili Chen
Kevin Lu
Aravind Rajeswaran
Kimin Lee
Aditya Grover
Michael Laskin
Pieter Abbeel
A. Srinivas
Igor Mordatch
OffRL
734
2,179
0
02 Jun 2021
The Power of Scale for Parameter-Efficient Prompt Tuning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Brian Lester
Rami Al-Rfou
Noah Constant
VPVLM
1.6K
5,344
0
18 Apr 2021
Prefix-Tuning: Optimizing Continuous Prompts for Generation
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Xiang Lisa Li
Abigail Z. Jacobs
857
5,620
0
01 Jan 2021
Offline Meta-Reinforcement Learning with Advantage Weighting
E. Mitchell
Rafael Rafailov
Xue Bin Peng
Sergey Levine
Chelsea Finn
OffRL
464
122
0
13 Aug 2020
CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information
Pengyu Cheng
Weituo Hao
Shuyang Dai
Jiachang Liu
Zhe Gan
Lawrence Carin
VLM
594
501
0
22 Jun 2020
Self-supervised Learning: Generative or Contrastive
Xiao Liu
Fanjin Zhang
Zhenyu Hou
Zhaoyu Wang
Li Mian
Jing Zhang
Jie Tang
SSL
895
2,100
0
15 Jun 2020
MOPO: Model-based Offline Policy Optimization
Neural Information Processing Systems (NeurIPS), 2020
Tianhe Yu
G. Thomas
Lantao Yu
Stefano Ermon
James Zou
Sergey Levine
Chelsea Finn
Tengyu Ma
OffRL
869
906
0
27 May 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
1.3K
2,510
0
04 May 2020
Shortcut Learning in Deep Neural Networks
Nature Machine Intelligence (NMI), 2020
Robert Geirhos
J. Jacobsen
Claudio Michaelis
R. Zemel
Wieland Brendel
Matthias Bethge
Felix Wichmann
1.5K
2,717
0
16 Apr 2020
Decision-Making with Auto-Encoding Variational Bayes
Neural Information Processing Systems (NeurIPS), 2020
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
1.8K
20,656
0
17 Feb 2020
Deep Reinforcement Learning for Autonomous Driving: A Survey
B. R. Kiran
Ibrahim Sobh
V. Talpaert
Patrick Mannion
A. A. Sallab
S. Yogamani
P. Pérez
961
2,240
0
02 Feb 2020
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
2.2K
7,549
0
23 Jan 2020
Behavior Regularized Offline Reinforcement Learning
Yifan Wu
George Tucker
Ofir Nachum
OffRL
676
816
0
26 Nov 2019
Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning
Conference on Robot Learning (CoRL), 2019
Tianhe Yu
Deirdre Quillen
Zhanpeng He
Ryan Julian
Avnish Narayan
Hayden Shively
Adithya Bellathur
Karol Hausman
Chelsea Finn
Sergey Levine
OffRL
871
1,545
0
24 Oct 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Journal of machine learning research (JMLR), 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
1.8K
25,199
0
23 Oct 2019
VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
International Conference on Learning Representations (ICLR), 2019
L. Zintgraf
K. Shiarlis
Maximilian Igl
Sebastian Schulze
Y. Gal
Katja Hofmann
Shimon Whiteson
OffRL
529
308
0
18 Oct 2019
Meta-Q-Learning
International Conference on Learning Representations (ICLR), 2019
Rasool Fakoor
Pratik Chaudhari
Stefano Soatto
Alex Smola
OffRL
345
162
0
30 Sep 2019
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Natasha Jaques
Asma Ghandeharioun
J. Shen
Craig Ferguson
Àgata Lapedriza
Noah J. Jones
S. Gu
Rosalind W. Picard
OffRL
573
385
0
30 Jun 2019
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Neural Information Processing Systems (NeurIPS), 2019
Aviral Kumar
Justin Fu
George Tucker
Sergey Levine
OffRL
OnRL
623
1,247
0
03 Jun 2019
Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables
International Conference on Machine Learning (ICML), 2019
Kate Rakelly
Aurick Zhou
Deirdre Quillen
Chelsea Finn
Sergey Levine
OffRL
423
782
0
19 Mar 2019
Representation Learning with Contrastive Predictive Coding
Aaron van den Oord
Yazhe Li
Oriol Vinyals
DRL
SSL
2.0K
12,894
0
10 Jul 2018
World Models
David R Ha
Jürgen Schmidhuber
SyDa
1.1K
1,580
1
27 Mar 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
2.9K
10,878
0
04 Jan 2018
1
2
Next
Page 1 of 2