Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1909.12255
Cited By
v1
v2
v3 (latest)
Harnessing Structures for Value-Based Planning and Reinforcement Learning
International Conference on Learning Representations (ICLR), 2019
26 September 2019
Yuzhe Yang
Guo Zhang
Zhi Xu
Dina Katabi
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Github (34★)
Papers citing
"Harnessing Structures for Value-Based Planning and Reinforcement Learning"
29 / 29 papers shown
Simplicial Embeddings Improve Sample Efficiency in Actor-Critic Agents
J. Obando-Ceron
Walter Mayor
Samuel Lavoie
Scott Fujimoto
Aaron Courville
Pablo Samuel Castro
196
7
0
15 Oct 2025
ADARL: Adaptive Low-Rank Structures for Robust Policy Learning under Uncertainty
Chenliang Li
Junyu Leng
Jiaxiang Li
Youbang Sun
Shixiang Chen
Shahin Shahrampour
Alfredo García
168
0
0
13 Oct 2025
NetArena: Dynamic Benchmarks for AI Agents in Network Automation
Yajie Zhou
Jiajun Ruan
Eric S. Wang
Sadjad Fouladi
Francis Y. Yan
Kevin Hsieh
Zaoxing Liu
400
9
0
03 Jun 2025
Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn
Hongyao Tang
J. Obando-Ceron
Pablo Samuel Castro
Aaron Courville
Glen Berseth
245
12
0
31 May 2025
Plasticity-Aware Mixture of Experts for Learning Under QoE Shifts in Adaptive Video Streaming
Zhiqiang He
Zhi Liu
371
1
0
14 Apr 2025
Solving Finite-Horizon MDPs via Low-Rank Tensors
Sergio Rozada
Jose Luis Orejuela
Antonio G. Marques
312
1
0
17 Jan 2025
Multilinear Tensor Low-Rank Approximation for Policy-Gradient Methods in Reinforcement Learning
Sergio Rozada
Hoi-To Wai
Antonio G. Marques
OffRL
272
2
0
10 Jan 2025
Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise Matrix Estimation
Neural Information Processing Systems (NeurIPS), 2024
Stefan Stojanovic
Yassir Jedra
Alexandre Proutiere
448
0
0
30 Oct 2024
Tensor Low-rank Approximation of Finite-horizon Value Functions
Sergio Rozada
Antonio G. Marques
303
7
0
27 May 2024
Matrix Low-Rank Approximation For Policy Gradient Methods
Sergio Rozada
A. Marques
333
3
0
27 May 2024
No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO
Skander Moalla
Andrea Miele
Razvan Pascanu
Çağlar Gülçehre
385
26
0
01 May 2024
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Marcel Hussing
C. Voelcker
Igor Gilitschenski
Amir-massoud Farahmand
Eric Eaton
456
16
0
09 Mar 2024
Directions of Curvature as an Explanation for Loss of Plasticity
Alex Lewandowski
Haruto Tanaka
Dale Schuurmans
Marlos C. Machado
527
21
0
30 Nov 2023
The Ladder in Chaos: A Simple and Effective Improvement to General DRL Algorithms by Policy Path Trimming and Boosting
Hongyao Tang
Hao Fei
Jianye Hao
329
2
0
02 Mar 2023
Reinforcement Learning for Resilient Power Grids
Zhenting Zhao
Po-Yen Chen
Yucheng Jin
AI4CE
160
4
0
08 Dec 2022
Detection and Evaluation of Clusters within Sequential Data
Data mining and knowledge discovery (DMKD), 2022
Alexander Van Werde
Albert Senen-Cerda
Gianluca Kosmella
J. Sanders
244
2
0
04 Oct 2022
Overcoming the Long Horizon Barrier for Sample-Efficient Reinforcement Learning with Latent Low-Rank Structure
Measurement and Modeling of Computer Systems (SIGMETRICS), 2022
Tyler Sam
Yudong Chen
Chao Yu
OffRL
496
9
0
07 Jun 2022
Tensor and Matrix Low-Rank Value-Function Approximation in Reinforcement Learning
IEEE Transactions on Signal Processing (IEEE Trans. Signal Process.), 2022
Sergio Rozada
Santiago Paternain
A. Marques
364
21
0
21 Jan 2022
A Generalized Bootstrap Target for Value-Learning, Efficiently Combining Value and Feature Predictions
AAAI Conference on Artificial Intelligence (AAAI), 2022
Anthony GX-Chen
Veronica Chelu
Blake A. Richards
Joelle Pineau
TTA
245
1
0
05 Jan 2022
Conditional Imitation Learning for Multi-Agent Games
IEEE/ACM International Conference on Human-Robot Interaction (HRI), 2022
Andy Shih
Stefano Ermon
Dorsa Sadigh
310
16
0
05 Jan 2022
Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning
International Conference on Distributed Artificial Intelligence (DAI), 2021
Tong Sang
Hongyao Tang
Jianye Hao
Yan Zheng
Zhaopeng Meng
155
2
0
19 Nov 2021
Low-rank State-action Value-function Approximation
European Signal Processing Conference (EUSIPCO), 2021
Sergio Rozada
Victor M. Tenorio
A. Marques
OffRL
202
11
0
18 Apr 2021
On Using Hamiltonian Monte Carlo Sampling for Reinforcement Learning Problems in High-dimension
Udari Madhushani
Biswadip Dey
Naomi Ehrich Leonard
Amit Chakraborty
OffRL
475
2
0
11 Nov 2020
Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning
International Conference on Learning Representations (ICLR), 2020
Aviral Kumar
Rishabh Agarwal
Dibya Ghosh
Sergey Levine
OffRL
418
153
0
27 Oct 2020
Sample Efficient Reinforcement Learning via Low-Rank Matrix Estimation
Neural Information Processing Systems (NeurIPS), 2020
Devavrat Shah
Dogyoon Song
Zhi Xu
Yuzhe Yang
331
34
0
11 Jun 2020
Stable Reinforcement Learning with Unbounded State Space
Devavrat Shah
Qiaomin Xie
Zhi Xu
OffRL
213
18
0
08 Jun 2020
On Reinforcement Learning for Turn-based Zero-sum Markov Games
Foundations of Data Science Conference (FODS), 2020
Devavrat Shah
Varun Somani
Qiaomin Xie
Zhi Xu
149
12
0
25 Feb 2020
On Robustness of Principal Component Regression
Neural Information Processing Systems (NeurIPS), 2019
Anish Agarwal
Devavrat Shah
Dennis Shen
Dogyoon Song
973
91
0
28 Feb 2019
Non-Asymptotic Analysis of Monte Carlo Tree Search
Devavrat Shah
Qiaomin Xie
Zhi Xu
365
9
0
14 Feb 2019
1
Page 1 of 1