Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2409.04792
Cited By
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn
Neural Information Processing Systems (NeurIPS), 2024
7 September 2024
Hongyao Tang
Glen Berseth
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn"
4 / 4 papers shown
The Reasoning Boundary Paradox: How Reinforcement Learning Constrains Language Models
Phuc Minh Nguyen
Chinh D. La
Duy M. Nguyen
Nitesh Chawla
Binh T. Nguyen
Khoa D. Doan
ReLM
LRM
811
1
1
02 Oct 2025
How Should We Meta-Learn Reinforcement Learning Algorithms?
Alexander David Goldie
Zilin Wang
Jakob Foerster
Jakob N. Foerster
Shimon Whiteson
OffRL
278
3
0
23 Jul 2025
Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn
Hongyao Tang
J. Obando-Ceron
Pablo Samuel Castro
Aaron Courville
Glen Berseth
200
3
0
31 May 2025
Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations
Bryan Chan
Anson Leung
James Bergstra
OffRL
OnRL
489
2
0
19 Oct 2024
1