v1v2 (latest)

On overfitting and asymptotic bias in batch reinforcement learning with partial observability

22 September 2017

Vincent François-Lavet

Papers citing "On overfitting and asymptotic bias in batch reinforcement learning with partial observability"

20 / 20 papers shown

Informed Asymmetric Actor-Critic: Leveraging Privileged Signals Beyond Full-State Access

356

30 Sep 2025

Attention on flow control: transformer-based reinforcement learning for lift regulation in highly disturbed flows

Zhecheng Liu

Jeff D. Eldredge

395

11 Jun 2025

Agent-state based policies in POMDPs: Beyond belief-state MDPsIEEE Conference on Decision and Control (CDC), 2024

Amit Sinha

Aditya Mahajan

317

24 Sep 2024

Temporal Knowledge-Graph Memory in a Partially Observable Environment

Taewoon Kim

Vincent François-Lavet

Michael Cochez

RALM

309

11 Aug 2024

On shallow planning under partial observability

Randy Lefebvre

Audrey Durand

OffRL

268

22 Jul 2024

Model approximation in MDPs with unbounded per-step cost

188

13 Feb 2024

Offline Risk-sensitive RL with Partial Observability to Enhance Performance in Human-Robot Teaming

Giorgio Angelotti

Caroline Ponzoni Carvalho Chanel

165

08 Feb 2024

Semi-Offline Reinforcement Learning for Optimized Text GenerationInternational Conference on Machine Learning (ICML), 2023

Rui Yan

256

16 Jun 2023

POMRL: No-Regret Learning-to-Plan with Increasing Horizons

194

30 Dec 2022

Rethinking Value Function Learning for Generalization in Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022

266

18 Oct 2022

Semi-Markov Offline Reinforcement Learning for HealthcareACM Conference on Health, Inference, and Learning (ACM CHIL), 2022

267

17 Mar 2022

Recent Advances in Reinforcement Learning in Finance

619

264

08 Dec 2021

Medical Dead-ends and Learning to Identify High-risk States and TreatmentsNeural Information Processing Systems (NeurIPS), 2021

279

08 Oct 2021

Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey

Amjad Yousef Majid

Serge Saaybi

Tomas van Rietbergen

Vincent François-Lavet

R. V. Prasad

Chris Verhoeven

OffRL

301

28 Sep 2021

Approximate information state for approximate planning and reinforcement learning in partially observed systemsJournal of machine learning research (JMLR), 2020

Jayakumar Subramanian

Amit Sinha

Raihan Seraj

Aditya Mahajan

410

108

17 Oct 2020

Discount Factor as a Regularizer in Reinforcement Learning

269

04 Jul 2020

Counterfactually Guided Off-policy Transfer in Clinical Settings

354

20 Jun 2020

Advantage Amplification in Slowly Evolving Latent-State EnvironmentsInternational Joint Conference on Artificial Intelligence (IJCAI), 2019

205

29 May 2019

An Introduction to Deep Reinforcement Learning

Vincent François-Lavet

512

1,447

30 Nov 2018

A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning

311

195

20 Jun 2018