v1v2v3 (latest)

Harnessing Structures for Value-Based Planning and Reinforcement Learning

International Conference on Learning Representations (ICLR), 2019

26 September 2019

ArXiv (abs)PDF HTML Github (34★)

Papers citing "Harnessing Structures for Value-Based Planning and Reinforcement Learning"

29 / 29 papers shown

Simplicial Embeddings Improve Sample Efficiency in Actor-Critic Agents

196

15 Oct 2025

ADARL: Adaptive Low-Rank Structures for Robust Policy Learning under Uncertainty

168

13 Oct 2025

NetArena: Dynamic Benchmarks for AI Agents in Network Automation

400

03 Jun 2025

Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn

245

31 May 2025

Plasticity-Aware Mixture of Experts for Learning Under QoE Shifts in Adaptive Video Streaming

Zhiqiang He

Zhi Liu

371

14 Apr 2025

Solving Finite-Horizon MDPs via Low-Rank Tensors

Sergio Rozada

Jose Luis Orejuela

Antonio G. Marques

312

17 Jan 2025

Multilinear Tensor Low-Rank Approximation for Policy-Gradient Methods in Reinforcement Learning

272

10 Jan 2025

Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise Matrix EstimationNeural Information Processing Systems (NeurIPS), 2024

Stefan Stojanovic

Yassir Jedra

Alexandre Proutiere

448

30 Oct 2024

Tensor Low-rank Approximation of Finite-horizon Value Functions

Sergio Rozada

Antonio G. Marques

303

27 May 2024

Matrix Low-Rank Approximation For Policy Gradient Methods

Sergio Rozada

A. Marques

333

27 May 2024

No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO

385

01 May 2024

Dissecting Deep RL with High Update Ratios: Combatting Value Divergence

Marcel Hussing

C. Voelcker

Igor Gilitschenski

Amir-massoud Farahmand

Eric Eaton

456

09 Mar 2024

Directions of Curvature as an Explanation for Loss of Plasticity

527

30 Nov 2023

The Ladder in Chaos: A Simple and Effective Improvement to General DRL Algorithms by Policy Path Trimming and Boosting

Hongyao Tang

Hao Fei

Jianye Hao

329

02 Mar 2023

Reinforcement Learning for Resilient Power Grids

160

08 Dec 2022

Detection and Evaluation of Clusters within Sequential DataData mining and knowledge discovery (DMKD), 2022

244

04 Oct 2022

Overcoming the Long Horizon Barrier for Sample-Efficient Reinforcement Learning with Latent Low-Rank StructureMeasurement and Modeling of Computer Systems (SIGMETRICS), 2022

496

07 Jun 2022

Tensor and Matrix Low-Rank Value-Function Approximation in Reinforcement LearningIEEE Transactions on Signal Processing (IEEE Trans. Signal Process.), 2022

Sergio Rozada

Santiago Paternain

A. Marques

364

21 Jan 2022

A Generalized Bootstrap Target for Value-Learning, Efficiently Combining Value and Feature PredictionsAAAI Conference on Artificial Intelligence (AAAI), 2022

245

05 Jan 2022

Conditional Imitation Learning for Multi-Agent GamesIEEE/ACM International Conference on Human-Robot Interaction (HRI), 2022

Andy Shih

Stefano Ermon

Dorsa Sadigh

310

05 Jan 2022

Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement LearningInternational Conference on Distributed Artificial Intelligence (DAI), 2021

Tong Sang

Hongyao Tang

Jianye Hao

Yan Zheng

Zhaopeng Meng

155

19 Nov 2021

Low-rank State-action Value-function ApproximationEuropean Signal Processing Conference (EUSIPCO), 2021

202

18 Apr 2021

On Using Hamiltonian Monte Carlo Sampling for Reinforcement Learning Problems in High-dimension

475

11 Nov 2020

Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2020

418

153

27 Oct 2020

Sample Efficient Reinforcement Learning via Low-Rank Matrix EstimationNeural Information Processing Systems (NeurIPS), 2020

331

11 Jun 2020

Stable Reinforcement Learning with Unbounded State Space

213

08 Jun 2020

On Reinforcement Learning for Turn-based Zero-sum Markov GamesFoundations of Data Science Conference (FODS), 2020

149

25 Feb 2020

On Robustness of Principal Component RegressionNeural Information Processing Systems (NeurIPS), 2019

973

28 Feb 2019

Non-Asymptotic Analysis of Monte Carlo Tree Search

Devavrat Shah

Qiaomin Xie

Zhi Xu

365

14 Feb 2019