v1v2 (latest)

Dropout Q-Functions for Doubly Efficient Reinforcement Learning

5 October 2021

Papers citing "Dropout Q-Functions for Doubly Efficient Reinforcement Learning"

50 / 83 papers shown

Dexterous Robotic Piano Playing at Scale

171

04 Nov 2025

Off-policy Reinforcement Learning with Model-based Exploration Augmentation

172

29 Oct 2025

XQC: Well-conditioned Optimization Accelerates Deep Reinforcement Learning

157

29 Sep 2025

An Investigation of Batch Normalization in Off-Policy Actor-Critic Algorithms

153

28 Sep 2025

Solving Robotics Tasks with Prior Demonstration via Exploration-Efficient Deep Reinforcement Learning

Chengyandan Shen

Christoffer Sloth

OffRL

113

04 Sep 2025

A Tutorial: An Intuitive Explanation of Offline Reinforcement Learning Theory

Fengdi Che

OffRL

136

11 Aug 2025

Scaling DRL for Decision Making: A Survey on Data, Network, and Training Budget Strategies

177

05 Aug 2025

Scaling Algorithm Distillation for Continuous Control with Mamba

Samuel Beaussant

Mehdi Mounsif

191

16 Jun 2025

Scaling CrossQ with Weight Normalization

Daniel Palenicek

Florian Vogt

Jan Peters

287

04 Jun 2025

Growable and Interpretable Neural Control with Online Continual Learning for Autonomous Lifelong Locomotion Learning MachinesThe international journal of robotics research (IJRR), 2025

Arthicha Srisuchinnawong

Poramate Manoonpong

CLL LRM

296

17 May 2025

Moderate Actor-Critic Methods: Controlling Overestimation Bias via Expectile Loss

Ukjo Hwang

Songnam Hong

OffRL

228

14 Apr 2025

Learning to Play Piano in the Real World

Yves-Simon Zeulner

Sandeep Selvaraj

Roberto Calandra

279

19 Mar 2025

Gait in Eight: Efficient On-Robot Learning for Omnidirectional Quadruped Locomotion

261

11 Mar 2025

Performance Comparisons of Reinforcement Learning Algorithms for Sequential Experimental Design

Yasir Zubayr Barlas

Kizito Salako

228

07 Mar 2025

Hyperspherical Normalization for Scalable Deep Reinforcement Learning

465

21 Feb 2025

Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-TuningInternational Conference on Learning Representations (ICLR), 2025

434

04 Feb 2025

Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations

489

19 Oct 2024

Traversability-Aware Legged Navigation by Learning from Real-World Visual Data

...

265

14 Oct 2024

Reinforcement Learning For Quadrupedal Locomotion: Current Advancements And Future Perspectives

347

14 Oct 2024

SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2024

464

13 Oct 2024

Learning to Walk from Three Minutes of Real-World Data with Semi-structured Dynamics ModelsConference on Robot Learning (CoRL), 2024

Jacob Levy

T. Westenbroek

David Fridovich-Keil

347

11 Oct 2024

Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024

Xinran Li

Ling Pan

Jun Zhang

207

11 Oct 2024

MAD-TD: Model-Augmented Data stabilizes High Update Ratio RLInternational Conference on Learning Representations (ICLR), 2024

C. Voelcker

Marcel Hussing

Eric Eaton

Amir-massoud Farahmand

Igor Gilitschenski

405

11 Oct 2024

The Role of Deep Learning Regularizations on Actors in Offline RL

354

11 Sep 2024

RP1M: A Large-Scale Motion Dataset for Piano Playing with Bi-Manual Dexterous Robot HandsConference on Robot Learning (CoRL), 2024

Le Chen

200

20 Aug 2024

Scenario-based Thermal Management Parametrization Through Deep Reinforcement Learning

206

04 Aug 2024

HiLMa-Res: A General Hierarchical Framework via Residual RL for Combining Quadrupedal Locomotion and Manipulation

Laura Smith

220

09 Jul 2024

Augmented Bayesian Policy Search

179

05 Jul 2024

BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO

Sebastian Dittert

Vincent Moens

Gianni De Fabritiis

249

25 Jun 2024

Learning-based legged locomotion; state of the art and future perspectives

Wenhao Yu

324

03 Jun 2024

OMPO: A Unified Framework for RL under Policy and Dynamics Shifts

292

29 May 2024

Oracle-Efficient Reinforcement Learning for Max Value Ensembles

212

27 May 2024

Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control

253

25 May 2024

Which Experiences Are Influential for RL Agents? Efficiently Estimating The Influence of Experiences

264

23 May 2024

An Efficient Learning Control Framework With Sim-to-Real for String-Type Artificial Muscle-Driven Robotic SystemsIEEE/ASME transactions on mechatronics (TAM), 2024

Jiyue Tao

Yunsong Zhang

Sunil Kumar Rajendran

Feitian Zhang

448

17 May 2024

Smart Sampling: Self-Attention and Bootstrapping for Improved Ensembled Q-LearningThe Florida AI Research Society (FLAIRS), 2024

M. Khan

Syed Hammad Ahmed

G. Sukthankar

165

14 May 2024

AFU: Actor-Free critic Updates in off-policy RL for continuous control

Nicolas Perrin-Gilbert

OffRL

297

24 Apr 2024

Rank2Reward: Learning Shaped Reward Functions from Passive Video

Dima Damen

Abhishek Gupta

229

23 Apr 2024

Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning

Zhen Wang

314

09 Apr 2024

Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration

Yibo Wang

Jiang Zhao

OffRL OnRL

232

31 Mar 2024

Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2024

165

12 Mar 2024

Dissecting Deep RL with High Update Ratios: Combatting Value Divergence

Marcel Hussing

C. Voelcker

Igor Gilitschenski

Amir-massoud Farahmand

Eric Eaton

346

09 Mar 2024

A Case for Validation Buffer in Pessimistic Actor-Critic

Michal Nauman

M. Ostaszewski

Marek Cygan

227

01 Mar 2024

Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning

343

01 Mar 2024

In value-based deep reinforcement learning, a pruned network is a good network

480

19 Feb 2024

Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey on Hybrid AlgorithmsIEEE Transactions on Evolutionary Computation (IEEE Trans. Evol. Comput.), 2024

Jianye Hao

Yan Zheng

339

22 Jan 2024

ReACT: Reinforcement Learning for Controller Parametrization using B-Spline GeometriesIEEE International Conference on Systems, Man and Cybernetics (SMC), 2023

204

10 Jan 2024

A unified uncertainty-aware exploration: Combining epistemic and aleatory uncertainty

Parvin Malekzadeh

Ming Hou

Konstantinos N. Plataniotis

210

05 Jan 2024

Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a High Replay Ratio and Regularization

Takuya Hiraoka

OffRL

269

10 Dec 2023

Handling Cost and Constraints with Off-Policy Deep Reinforcement Learning

158

30 Nov 2023