Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2409.04792
Cited By

Improving Deep Reinforcement Learning by Reducing the Chain Effect of
Value and Policy Churn

Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn

Neural Information Processing Systems (NeurIPS), 2024

7 September 2024

ArXiv (abs)PDF HTML

Papers citing "Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn"

4 / 4 papers shown

The Reasoning Boundary Paradox: How Reinforcement Learning Constrains Language Models

The Reasoning Boundary Paradox: How Reinforcement Learning Constrains Language Models

Phuc Minh Nguyen

811

1

1

02 Oct 2025

How Should We Meta-Learn Reinforcement Learning Algorithms?

How Should We Meta-Learn Reinforcement Learning Algorithms?

Alexander David Goldie

Jakob N. Foerster

Shimon Whiteson

278

3

0

23 Jul 2025

Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn

Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn

J. Obando-Ceron

Pablo Samuel Castro

Aaron Courville

200

3

0

31 May 2025

Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations

Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations

489

2

0

19 Oct 2024