ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.12255
  4. Cited By
Harnessing Structures for Value-Based Planning and Reinforcement
  Learning
v1v2v3 (latest)

Harnessing Structures for Value-Based Planning and Reinforcement Learning

International Conference on Learning Representations (ICLR), 2019
26 September 2019
Yuzhe Yang
Guo Zhang
Zhi Xu
Dina Katabi
    OffRL
ArXiv (abs)PDFHTMLGithub (34★)

Papers citing "Harnessing Structures for Value-Based Planning and Reinforcement Learning"

29 / 29 papers shown
Simplicial Embeddings Improve Sample Efficiency in Actor-Critic Agents
Simplicial Embeddings Improve Sample Efficiency in Actor-Critic Agents
J. Obando-Ceron
Walter Mayor
Samuel Lavoie
Scott Fujimoto
Aaron Courville
Pablo Samuel Castro
196
7
0
15 Oct 2025
ADARL: Adaptive Low-Rank Structures for Robust Policy Learning under Uncertainty
ADARL: Adaptive Low-Rank Structures for Robust Policy Learning under Uncertainty
Chenliang Li
Junyu Leng
Jiaxiang Li
Youbang Sun
Shixiang Chen
Shahin Shahrampour
Alfredo García
168
0
0
13 Oct 2025
NetArena: Dynamic Benchmarks for AI Agents in Network Automation
NetArena: Dynamic Benchmarks for AI Agents in Network Automation
Yajie Zhou
Jiajun Ruan
Eric S. Wang
Sadjad Fouladi
Francis Y. Yan
Kevin Hsieh
Zaoxing Liu
400
9
0
03 Jun 2025
Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn
Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn
Hongyao Tang
J. Obando-Ceron
Pablo Samuel Castro
Aaron Courville
Glen Berseth
245
12
0
31 May 2025
Plasticity-Aware Mixture of Experts for Learning Under QoE Shifts in Adaptive Video Streaming
Plasticity-Aware Mixture of Experts for Learning Under QoE Shifts in Adaptive Video Streaming
Zhiqiang He
Zhi Liu
371
1
0
14 Apr 2025
Solving Finite-Horizon MDPs via Low-Rank Tensors
Solving Finite-Horizon MDPs via Low-Rank Tensors
Sergio Rozada
Jose Luis Orejuela
Antonio G. Marques
312
1
0
17 Jan 2025
Multilinear Tensor Low-Rank Approximation for Policy-Gradient Methods in Reinforcement Learning
Multilinear Tensor Low-Rank Approximation for Policy-Gradient Methods in Reinforcement Learning
Sergio Rozada
Hoi-To Wai
Antonio G. Marques
OffRL
272
2
0
10 Jan 2025
Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise
  Matrix Estimation
Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise Matrix EstimationNeural Information Processing Systems (NeurIPS), 2024
Stefan Stojanovic
Yassir Jedra
Alexandre Proutiere
448
0
0
30 Oct 2024
Tensor Low-rank Approximation of Finite-horizon Value Functions
Tensor Low-rank Approximation of Finite-horizon Value Functions
Sergio Rozada
Antonio G. Marques
303
7
0
27 May 2024
Matrix Low-Rank Approximation For Policy Gradient Methods
Matrix Low-Rank Approximation For Policy Gradient Methods
Sergio Rozada
A. Marques
333
3
0
27 May 2024
No Representation, No Trust: Connecting Representation, Collapse, and
  Trust Issues in PPO
No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO
Skander Moalla
Andrea Miele
Razvan Pascanu
Çağlar Gülçehre
385
26
0
01 May 2024
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Marcel Hussing
C. Voelcker
Igor Gilitschenski
Amir-massoud Farahmand
Eric Eaton
456
16
0
09 Mar 2024
Directions of Curvature as an Explanation for Loss of Plasticity
Directions of Curvature as an Explanation for Loss of Plasticity
Alex Lewandowski
Haruto Tanaka
Dale Schuurmans
Marlos C. Machado
527
21
0
30 Nov 2023
The Ladder in Chaos: A Simple and Effective Improvement to General DRL
  Algorithms by Policy Path Trimming and Boosting
The Ladder in Chaos: A Simple and Effective Improvement to General DRL Algorithms by Policy Path Trimming and Boosting
Hongyao Tang
Hao Fei
Jianye Hao
329
2
0
02 Mar 2023
Reinforcement Learning for Resilient Power Grids
Reinforcement Learning for Resilient Power Grids
Zhenting Zhao
Po-Yen Chen
Yucheng Jin
AI4CE
160
4
0
08 Dec 2022
Detection and Evaluation of Clusters within Sequential Data
Detection and Evaluation of Clusters within Sequential DataData mining and knowledge discovery (DMKD), 2022
Alexander Van Werde
Albert Senen-Cerda
Gianluca Kosmella
J. Sanders
244
2
0
04 Oct 2022
Overcoming the Long Horizon Barrier for Sample-Efficient Reinforcement
  Learning with Latent Low-Rank Structure
Overcoming the Long Horizon Barrier for Sample-Efficient Reinforcement Learning with Latent Low-Rank StructureMeasurement and Modeling of Computer Systems (SIGMETRICS), 2022
Tyler Sam
Yudong Chen
Chao Yu
OffRL
496
9
0
07 Jun 2022
Tensor and Matrix Low-Rank Value-Function Approximation in Reinforcement
  Learning
Tensor and Matrix Low-Rank Value-Function Approximation in Reinforcement LearningIEEE Transactions on Signal Processing (IEEE Trans. Signal Process.), 2022
Sergio Rozada
Santiago Paternain
A. Marques
364
21
0
21 Jan 2022
A Generalized Bootstrap Target for Value-Learning, Efficiently Combining
  Value and Feature Predictions
A Generalized Bootstrap Target for Value-Learning, Efficiently Combining Value and Feature PredictionsAAAI Conference on Artificial Intelligence (AAAI), 2022
Anthony GX-Chen
Veronica Chelu
Blake A. Richards
Joelle Pineau
TTA
245
1
0
05 Jan 2022
Conditional Imitation Learning for Multi-Agent Games
Conditional Imitation Learning for Multi-Agent GamesIEEE/ACM International Conference on Human-Robot Interaction (HRI), 2022
Andy Shih
Stefano Ermon
Dorsa Sadigh
310
16
0
05 Jan 2022
Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement
  Learning
Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement LearningInternational Conference on Distributed Artificial Intelligence (DAI), 2021
Tong Sang
Hongyao Tang
Jianye Hao
Yan Zheng
Zhaopeng Meng
155
2
0
19 Nov 2021
Low-rank State-action Value-function Approximation
Low-rank State-action Value-function ApproximationEuropean Signal Processing Conference (EUSIPCO), 2021
Sergio Rozada
Victor M. Tenorio
A. Marques
OffRL
202
11
0
18 Apr 2021
On Using Hamiltonian Monte Carlo Sampling for Reinforcement Learning
  Problems in High-dimension
On Using Hamiltonian Monte Carlo Sampling for Reinforcement Learning Problems in High-dimension
Udari Madhushani
Biswadip Dey
Naomi Ehrich Leonard
Amit Chakraborty
OffRL
475
2
0
11 Nov 2020
Implicit Under-Parameterization Inhibits Data-Efficient Deep
  Reinforcement Learning
Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2020
Aviral Kumar
Rishabh Agarwal
Dibya Ghosh
Sergey Levine
OffRL
418
153
0
27 Oct 2020
Sample Efficient Reinforcement Learning via Low-Rank Matrix Estimation
Sample Efficient Reinforcement Learning via Low-Rank Matrix EstimationNeural Information Processing Systems (NeurIPS), 2020
Devavrat Shah
Dogyoon Song
Zhi Xu
Yuzhe Yang
331
34
0
11 Jun 2020
Stable Reinforcement Learning with Unbounded State Space
Stable Reinforcement Learning with Unbounded State Space
Devavrat Shah
Qiaomin Xie
Zhi Xu
OffRL
213
18
0
08 Jun 2020
On Reinforcement Learning for Turn-based Zero-sum Markov Games
On Reinforcement Learning for Turn-based Zero-sum Markov GamesFoundations of Data Science Conference (FODS), 2020
Devavrat Shah
Varun Somani
Qiaomin Xie
Zhi Xu
149
12
0
25 Feb 2020
On Robustness of Principal Component Regression
On Robustness of Principal Component RegressionNeural Information Processing Systems (NeurIPS), 2019
Anish Agarwal
Devavrat Shah
Dennis Shen
Dogyoon Song
973
91
0
28 Feb 2019
Non-Asymptotic Analysis of Monte Carlo Tree Search
Non-Asymptotic Analysis of Monte Carlo Tree Search
Devavrat Shah
Qiaomin Xie
Zhi Xu
365
9
0
14 Feb 2019
1
Page 1 of 1