ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2409.04792
  4. Cited By
Improving Deep Reinforcement Learning by Reducing the Chain Effect of
  Value and Policy Churn

Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn

Neural Information Processing Systems (NeurIPS), 2024
7 September 2024
Hongyao Tang
Glen Berseth
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn"

4 / 4 papers shown
The Reasoning Boundary Paradox: How Reinforcement Learning Constrains Language Models
The Reasoning Boundary Paradox: How Reinforcement Learning Constrains Language Models
Phuc Minh Nguyen
Chinh D. La
Duy M. Nguyen
Nitesh Chawla
Binh T. Nguyen
Khoa D. Doan
ReLMLRM
811
1
1
02 Oct 2025
How Should We Meta-Learn Reinforcement Learning Algorithms?
How Should We Meta-Learn Reinforcement Learning Algorithms?
Alexander David Goldie
Zilin Wang
Jakob Foerster
Jakob N. Foerster
Shimon Whiteson
OffRL
278
3
0
23 Jul 2025
Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn
Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn
Hongyao Tang
J. Obando-Ceron
Pablo Samuel Castro
Aaron Courville
Glen Berseth
200
3
0
31 May 2025
Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations
Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations
Bryan Chan
Anson Leung
James Bergstra
OffRLOnRL
489
2
0
19 Oct 2024
1