ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.02039
  4. Cited By
Offline Reinforcement Learning as One Big Sequence Modeling Problem

Offline Reinforcement Learning as One Big Sequence Modeling Problem

3 June 2021
Michael Janner
Qiyang Li
Sergey Levine
    OffRL
ArXivPDFHTML

Papers citing "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

15 / 465 papers shown
Title
Assistive Tele-op: Leveraging Transformers to Collect Robotic Task
  Demonstrations
Assistive Tele-op: Leveraging Transformers to Collect Robotic Task Demonstrations
Henry M. Clever
Ankur Handa
H. Mazhar
Kevin Parker
Omer Shapira
Qian Wan
Yashraj S. Narang
Iretiayo Akinola
Maya Cakmak
D. Fox
15
18
0
09 Dec 2021
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence
  Model Tackles All SMAC Tasks
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks
Linghui Meng
Muning Wen
Yaodong Yang
Chenyang Le
Xiyun Li
Weinan Zhang
Ying Wen
Haifeng Zhang
Jun Wang
Bo Xu
OffRL
26
38
0
06 Dec 2021
SWAT: Spatial Structure Within and Among Tokens
SWAT: Spatial Structure Within and Among Tokens
Kumara Kahatapitiya
Michael S. Ryoo
23
6
0
26 Nov 2021
URLB: Unsupervised Reinforcement Learning Benchmark
URLB: Unsupervised Reinforcement Learning Benchmark
Michael Laskin
Denis Yarats
Hao Liu
Kimin Lee
Albert Zhan
Kevin Lu
Catherine Cang
Lerrel Pinto
Pieter Abbeel
SSL
OffRL
30
132
0
28 Oct 2021
Transfer learning with causal counterfactual reasoning in Decision
  Transformers
Transfer learning with causal counterfactual reasoning in Decision Transformers
Ayman Boustati
Hana Chockler
Daniel C. McNamee
CML
OffRL
LRM
15
9
0
27 Oct 2021
What Would Jiminy Cricket Do? Towards Agents That Behave Morally
What Would Jiminy Cricket Do? Towards Agents That Behave Morally
Dan Hendrycks
Mantas Mazeika
Andy Zou
Sahil Patel
Christine Zhu
Jesus Navarro
D. Song
Bo-wen Li
Jacob Steinhardt
14
58
0
25 Oct 2021
Towards Instance-Optimal Offline Reinforcement Learning with Pessimism
Towards Instance-Optimal Offline Reinforcement Learning with Pessimism
Ming Yin
Yu-Xiang Wang
OffRL
24
82
0
17 Oct 2021
StARformer: Transformer with State-Action-Reward Representations for
  Visual Reinforcement Learning
StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement Learning
Jinghuan Shang
Kumara Kahatapitiya
Xiang Li
Michael S. Ryoo
OffRL
35
33
0
12 Oct 2021
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
212
838
0
12 Oct 2021
An Offline Deep Reinforcement Learning for Maintenance Decision-Making
An Offline Deep Reinforcement Learning for Maintenance Decision-Making
H. Khorasgani
Haiyan Wang
Chetan Gupta
Ahmed K. Farahat
KELM
OffRL
16
5
0
28 Sep 2021
Boosting Search Engines with Interactive Agents
Boosting Search Engines with Interactive Agents
Leonard Adolphs
Benjamin Boerschinger
Christian Buck
Michelle Chen Huebscher
Massimiliano Ciaramita
...
Thomas Hofmann
Yannic Kilcher
Sascha Rothe
Pier Giuseppe Sessa
Lierni Sestorain Saralegui
LLMAG
18
24
0
01 Sep 2021
Pre-trained Language Models as Prior Knowledge for Playing Text-based
  Games
Pre-trained Language Models as Prior Knowledge for Playing Text-based Games
Ishika Singh
Gargi Singh
Ashutosh Modi
OffRL
AI4CE
16
28
0
18 Jul 2021
From Eye-blinks to State Construction: Diagnostic Benchmarks for Online
  Representation Learning
From Eye-blinks to State Construction: Diagnostic Benchmarks for Online Representation Learning
Banafsheh Rafiee
Zaheer Abbas
Sina Ghiassian
Raksha Kumaraswamy
R. Sutton
Elliot A. Ludvig
Adam White
OffRL
6
17
0
09 Nov 2020
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline
  and Online RL
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Seyed Kamyar Seyed Ghasemipour
Dale Schuurmans
S. Gu
OffRL
209
119
0
21 Jul 2020
Deep Dynamics Models for Learning Dexterous Manipulation
Deep Dynamics Models for Learning Dexterous Manipulation
Anusha Nagabandi
K. Konolige
Sergey Levine
Vikash Kumar
143
407
0
25 Sep 2019
Previous
123...1089