ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.07796
  4. Cited By
On overfitting and asymptotic bias in batch reinforcement learning with
  partial observability
v1v2 (latest)

On overfitting and asymptotic bias in batch reinforcement learning with partial observability

22 September 2017
Vincent François-Lavet
Guillaume Rabusseau
Joelle Pineau
D. Ernst
R. Fonteneau
    OffRL
ArXiv (abs)PDFHTML

Papers citing "On overfitting and asymptotic bias in batch reinforcement learning with partial observability"

19 / 19 papers shown
Title
Attention on flow control: transformer-based reinforcement learning for lift regulation in highly disturbed flows
Attention on flow control: transformer-based reinforcement learning for lift regulation in highly disturbed flows
Zhecheng Liu
Jeff D. Eldredge
57
0
0
11 Jun 2025
Agent-state based policies in POMDPs: Beyond belief-state MDPs
Agent-state based policies in POMDPs: Beyond belief-state MDPs
Amit Sinha
Aditya Mahajan
53
3
0
24 Sep 2024
Leveraging Knowledge Graph-Based Human-Like Memory Systems to Solve
  Partially Observable Markov Decision Processes
Leveraging Knowledge Graph-Based Human-Like Memory Systems to Solve Partially Observable Markov Decision Processes
Taewoon Kim
Vincent François-Lavet
Michael Cochez
RALM
125
2
0
11 Aug 2024
On shallow planning under partial observability
On shallow planning under partial observability
Randy Lefebvre
Audrey Durand
OffRL
71
1
0
22 Jul 2024
Model approximation in MDPs with unbounded per-step cost
Model approximation in MDPs with unbounded per-step cost
Berk Bozkurt
Aditya Mahajan
A. Nayyar
Ouyang Yi
25
2
0
13 Feb 2024
Offline Risk-sensitive RL with Partial Observability to Enhance
  Performance in Human-Robot Teaming
Offline Risk-sensitive RL with Partial Observability to Enhance Performance in Human-Robot Teaming
Giorgio Angelotti
Caroline Ponzoni Carvalho Chanel
Adam H. M. Pinto
Christophe Lounis
C. Chauffaut
Nicolas Drougard
OffRL
29
2
0
08 Feb 2024
Semi-Offline Reinforcement Learning for Optimized Text Generation
Semi-Offline Reinforcement Learning for Optimized Text Generation
Changyu Chen
Xiting Wang
Yiqiao Jin
Victor Ye Dong
Li Dong
Jie Cao
Yi Liu
Rui Yan
OffRL
81
15
0
16 Jun 2023
POMRL: No-Regret Learning-to-Plan with Increasing Horizons
POMRL: No-Regret Learning-to-Plan with Increasing Horizons
Khimya Khetarpal
Claire Vernade
Brendan O'Donoghue
Satinder Singh
Tom Zahavy
OffRL
68
0
0
30 Dec 2022
Rethinking Value Function Learning for Generalization in Reinforcement
  Learning
Rethinking Value Function Learning for Generalization in Reinforcement Learning
Seungyong Moon
JunYeong Lee
Hyun Oh Song
OODOffRL
67
16
0
18 Oct 2022
Semi-Markov Offline Reinforcement Learning for Healthcare
Semi-Markov Offline Reinforcement Learning for Healthcare
Mehdi Fatemi
Mary Wu
J. Petch
Walter Nelson
S. Connolly
Alexander Benz
A. Carnicelli
Marzyeh Ghassemi
OffRL
66
14
0
17 Mar 2022
Recent Advances in Reinforcement Learning in Finance
Recent Advances in Reinforcement Learning in Finance
B. Hambly
Renyuan Xu
Huining Yang
OffRL
126
180
0
08 Dec 2021
Medical Dead-ends and Learning to Identify High-risk States and
  Treatments
Medical Dead-ends and Learning to Identify High-risk States and Treatments
Mehdi Fatemi
Taylor W. Killian
J. Subramanian
Marzyeh Ghassemi
OffRL
94
40
0
08 Oct 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative
  Survey
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey
Amjad Yousef Majid
Serge Saaybi
Tomas van Rietbergen
Vincent François-Lavet
R. V. Prasad
Chris Verhoeven
OffRL
135
60
0
28 Sep 2021
Approximate information state for approximate planning and reinforcement
  learning in partially observed systems
Approximate information state for approximate planning and reinforcement learning in partially observed systems
Jayakumar Subramanian
Amit Sinha
Raihan Seraj
Aditya Mahajan
155
86
0
17 Oct 2020
Discount Factor as a Regularizer in Reinforcement Learning
Discount Factor as a Regularizer in Reinforcement Learning
Ron Amit
Ron Meir
K. Ciosek
OffRL
97
72
0
04 Jul 2020
Counterfactually Guided Off-policy Transfer in Clinical Settings
Counterfactually Guided Off-policy Transfer in Clinical Settings
Taylor W. Killian
Marzyeh Ghassemi
Shalmali Joshi
CMLOffRLOOD
66
12
0
20 Jun 2020
Advantage Amplification in Slowly Evolving Latent-State Environments
Advantage Amplification in Slowly Evolving Latent-State Environments
Martin Mladenov
Ofer Meshi
Jayden Ooi
Dale Schuurmans
Craig Boutilier
OffRL
89
9
0
29 May 2019
An Introduction to Deep Reinforcement Learning
An Introduction to Deep Reinforcement Learning
Vincent François-Lavet
Peter Henderson
Riashat Islam
Marc G. Bellemare
Joelle Pineau
OffRLAI4CE
173
1,279
0
30 Nov 2018
A Dissection of Overfitting and Generalization in Continuous
  Reinforcement Learning
A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning
Amy Zhang
Nicolas Ballas
Joelle Pineau
CLLOffRL
113
180
0
20 Jun 2018
1