On learning history based policies for controlling Markov decision processes

6 November 2022

Papers citing "On learning history based policies for controlling Markov decision processes"

3 / 3 papers shown

Title
Bridging State and History Representations: Understanding Self-Predictive RL Tianwei Ni Benjamin Eysenbach Erfan Seyedsalehi Michel Ma Clement Gehring Aditya Mahajan Pierre-Luc Bacon AI4TS AI4CE 17 20 0 17 Jan 2024
Policy Gradient Algorithms Implicitly Optimize by Continuation Adrien Bolland Gilles Louppe D. Ernst 24 3 0 11 May 2023
Approximate Information States for Worst-Case Control and Learning in Uncertain Systems Aditya Dave N. Venkatesh Andreas A. Malikopoulos 22 7 0 12 Jan 2023