ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.03011
  4. Cited By
On learning history based policies for controlling Markov decision
  processes

On learning history based policies for controlling Markov decision processes

6 November 2022
Gandharv Patil
Aditya Mahajan
Doina Precup
    OffRL
ArXivPDFHTML

Papers citing "On learning history based policies for controlling Markov decision processes"

3 / 3 papers shown
Title
Bridging State and History Representations: Understanding
  Self-Predictive RL
Bridging State and History Representations: Understanding Self-Predictive RL
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TS
AI4CE
17
20
0
17 Jan 2024
Policy Gradient Algorithms Implicitly Optimize by Continuation
Policy Gradient Algorithms Implicitly Optimize by Continuation
Adrien Bolland
Gilles Louppe
D. Ernst
24
3
0
11 May 2023
Approximate Information States for Worst-Case Control and Learning in
  Uncertain Systems
Approximate Information States for Worst-Case Control and Learning in Uncertain Systems
Aditya Dave
N. Venkatesh
Andreas A. Malikopoulos
22
7
0
12 Jan 2023
1