ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.14379
  4. Cited By
Offline Reinforcement Learning Hands-On

Offline Reinforcement Learning Hands-On

29 November 2020
L. Monier
Jakub Kmec
Alexandre Laterre
Thomas Pierrot
Valentin Courgeau
Olivier Sigaud
Karim Beguir
    OffRL
ArXivPDFHTML

Papers citing "Offline Reinforcement Learning Hands-On"

7 / 7 papers shown
Title
Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs
  and Practical Solutions
Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs and Practical Solutions
Yicheng Luo
Jackie Kay
Edward Grefenstette
M. Deisenroth
OffRL
OnRL
13
15
0
30 Mar 2023
Offline Imitation Learning with Suboptimal Demonstrations via Relaxed
  Distribution Matching
Offline Imitation Learning with Suboptimal Demonstrations via Relaxed Distribution Matching
Lantao Yu
Tianhe Yu
Jiaming Song
W. Neiswanger
Stefano Ermon
OffRL
68
16
0
05 Mar 2023
A Dataset Perspective on Offline Reinforcement Learning
A Dataset Perspective on Offline Reinforcement Learning
Kajetan Schweighofer
Andreas Radler
Marius-Constantin Dinu
M. Hofmarcher
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
OffRL
27
17
0
08 Nov 2021
On Multi-objective Policy Optimization as a Tool for Reinforcement
  Learning: Case Studies in Offline RL and Finetuning
On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning
A. Abdolmaleki
Sandy H. Huang
Giulia Vezzani
Bobak Shahriari
Jost Tobias Springenberg
...
András Gyorgy
Csaba Szepesvári
R. Hadsell
N. Heess
Martin Riedmiller
OffRL
15
5
0
15 Jun 2021
Offline Policy Comparison under Limited Historical Agent-Environment
  Interactions
Offline Policy Comparison under Limited Historical Agent-Environment Interactions
Anton Dereventsov
Joseph Daws
Clayton Webster
OffRL
31
3
0
07 Jun 2021
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline
  and Online RL
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Seyed Kamyar Seyed Ghasemipour
Dale Schuurmans
S. Gu
OffRL
209
119
0
21 Jul 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
340
1,960
0
04 May 2020
1