ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.02034
  4. Cited By
Dropout Q-Functions for Doubly Efficient Reinforcement Learning
v1v2 (latest)

Dropout Q-Functions for Doubly Efficient Reinforcement Learning

5 October 2021
Takuya Hiraoka
Takahisa Imagawa
Taisei Hashimoto
Takashi Onishi
Yoshimasa Tsuruoka
ArXiv (abs)PDFHTML

Papers citing "Dropout Q-Functions for Doubly Efficient Reinforcement Learning"

33 / 83 papers shown
Imitation Bootstrapped Reinforcement Learning
Imitation Bootstrapped Reinforcement Learning
Hengyuan Hu
Suvir Mirchandani
Dorsa Sadigh
480
47
0
03 Nov 2023
Adapt On-the-Go: Behavior Modulation for Single-Life Robot Deployment
Adapt On-the-Go: Behavior Modulation for Single-Life Robot Deployment
Annie S. Chen
Govind Chada
Laura M. Smith
Archit Sharma
Zipeng Fu
Sergey Levine
Chelsea Finn
474
9
0
02 Nov 2023
DrM: Mastering Visual Reinforcement Learning through Dormant Ratio
  Minimization
DrM: Mastering Visual Reinforcement Learning through Dormant Ratio MinimizationInternational Conference on Learning Representations (ICLR), 2023
Guowei Xu
Ruijie Zheng
Yongyuan Liang
Xiyao Wang
Zhecheng Yuan
...
Shuzhen Li
Yanjie Ze
Hal Daumé
Furong Huang
Huazhe Xu
303
42
0
30 Oct 2023
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
Michal Nauman
Marek Cygan
406
5
0
30 Oct 2023
Grow Your Limits: Continuous Improvement with Real-World RL for Robotic
  Locomotion
Grow Your Limits: Continuous Improvement with Real-World RL for Robotic LocomotionIEEE International Conference on Robotics and Automation (ICRA), 2023
Laura M. Smith
Yunhao Cao
Sergey Levine
OffRL
185
31
0
26 Oct 2023
Mind the Model, Not the Agent: The Primacy Bias in Model-based RL
Mind the Model, Not the Agent: The Primacy Bias in Model-based RLEuropean Conference on Artificial Intelligence (ECAI), 2023
Zhongjian Qiao
Jiafei Lyu
Xiu Li
237
4
0
23 Oct 2023
One is More: Diverse Perspectives within a Single Network for Efficient
  DRL
One is More: Diverse Perspectives within a Single Network for Efficient DRL
Yiqin Tan
Ling Pan
Longbo Huang
OffRL
293
0
0
21 Oct 2023
RL-X: A Deep Reinforcement Learning Library (not only) for RoboCup
RL-X: A Deep Reinforcement Learning Library (not only) for RoboCup
Nico Bohlinger
Klaus Dorer
188
5
0
20 Oct 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
METRA: Scalable Unsupervised RL with Metric-Aware AbstractionInternational Conference on Learning Representations (ICLR), 2023
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
384
66
0
13 Oct 2023
An Open-Loop Baseline for Reinforcement Learning Locomotion Tasks
An Open-Loop Baseline for Reinforcement Learning Locomotion Tasks
Antonin Raffin
Olivier Sigaud
Jens Kober
Alin Albu-Schäffer
João Silvério
F. Stulp
181
4
0
09 Oct 2023
PLASTIC: Improving Input and Label Plasticity for Sample Efficient
  Reinforcement Learning
PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Hojoon Lee
Hanseul Cho
Hyunseung Kim
Daehoon Gwak
Joonkee Kim
Jaegul Choo
Se-Young Yun
Chulhee Yun
OffRL
360
40
0
19 Jun 2023
Normalization Enhances Generalization in Visual Reinforcement Learning
Normalization Enhances Generalization in Visual Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
Lu Li
Jiafei Lyu
Guozheng Ma
Zilin Wang
Zhen Yang
Xiu Li
Zhiheng Li
OOD
201
12
0
01 Jun 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control
  via Sample Multiple Reuse
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple ReuseInformation Sciences (Inf. Sci.), 2023
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
191
15
0
29 May 2023
Distributional Reinforcement Learning with Dual Expectile-Quantile Regression
Distributional Reinforcement Learning with Dual Expectile-Quantile RegressionConference on Uncertainty in Artificial Intelligence (UAI), 2023
Sami Jullien
Romain Deffayet
J. Renders
Paul T. Groth
Maarten de Rijke
OOD
297
1
0
26 May 2023
Revisiting the Minimalist Approach to Offline Reinforcement Learning
Revisiting the Minimalist Approach to Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Denis Tarasov
Vladislav Kurenkov
Alexander Nikulin
Sergey Kolesnikov
OffRL
326
76
0
16 May 2023
Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Efficient Deep Reinforcement Learning Requires Regulating OverfittingInternational Conference on Learning Representations (ICLR), 2023
Qiyang Li
Aviral Kumar
Ilya Kostrikov
Sergey Levine
OffRL
284
42
0
20 Apr 2023
Learning and Adapting Agile Locomotion Skills by Transferring Experience
Learning and Adapting Agile Locomotion Skills by Transferring Experience
Laura M. Smith
J. Kew
Tianyu Li
Linda Luu
Xue Bin Peng
Sehoon Ha
Jie Tan
Sergey Levine
218
61
0
19 Apr 2023
RoboPianist: Dexterous Piano Playing with Deep Reinforcement Learning
RoboPianist: Dexterous Piano Playing with Deep Reinforcement LearningConference on Robot Learning (CoRL), 2023
Kevin Zakka
Philipp Wu
Laura M. Smith
Nimrod Gileadi
Taylor A. Howell
...
Sumeet Singh
Yuval Tassa
Pete Florence
Andy Zeng
Pieter Abbeel
401
49
0
09 Apr 2023
Multi-Task Reinforcement Learning in Continuous Control with Successor
  Feature-Based Concurrent Composition
Multi-Task Reinforcement Learning in Continuous Control with Successor Feature-Based Concurrent CompositionEuropean Control Conference (ECC), 2023
Y. Liu
Aamir Ahmad
238
5
0
24 Mar 2023
Understanding the Synergies between Quality-Diversity and Deep
  Reinforcement Learning
Understanding the Synergies between Quality-Diversity and Deep Reinforcement LearningAnnual Conference on Genetic and Evolutionary Computation (GECCO), 2023
Bryan Lim
Manon Flageat
Antoine Cully
OnRL
212
9
0
10 Mar 2023
A Survey on Uncertainty Quantification Methods for Deep Learning
A Survey on Uncertainty Quantification Methods for Deep Learning
Wenchong He
Zhe Jiang
Tingsong Xiao
Zelin Xu
Yukun Li
BDLUQCVAI4CE
701
57
0
26 Feb 2023
The Dormant Neuron Phenomenon in Deep Reinforcement Learning
The Dormant Neuron Phenomenon in Deep Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023
Ghada Sokar
Rishabh Agarwal
Pablo Samuel Castro
Utku Evci
CLL
289
129
0
24 Feb 2023
Efficient Online Reinforcement Learning with Offline Data
Efficient Online Reinforcement Learning with Offline DataInternational Conference on Machine Learning (ICML), 2023
Philip J. Ball
Laura M. Smith
Ilya Kostrikov
Sergey Levine
OffRLOnRL
530
257
0
06 Feb 2023
Which Experiences Are Influential for Your Agent? Policy Iteration with Turn-over Dropout
Takuya Hiraoka
Takashi Onishi
Yoshimasa Tsuruoka
OffRL
213
0
0
26 Jan 2023
Sample-Efficient Multi-Objective Learning via Generalized Policy
  Improvement Prioritization
Sample-Efficient Multi-Objective Learning via Generalized Policy Improvement PrioritizationAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
L. N. Alegre
A. Bazzan
D. Roijers
Ann Nowé
Bruno C. da Silva
296
46
0
18 Jan 2023
Dexterous Manipulation from Images: Autonomous Real-World RL via Substep
  Guidance
Dexterous Manipulation from Images: Autonomous Real-World RL via Substep GuidanceIEEE International Conference on Robotics and Automation (ICRA), 2022
Kelvin Xu
Zheyuan Hu
Ria Doshi
Aaron Rovinsky
Vikash Kumar
Abhishek Gupta
Sergey Levine
171
30
0
19 Dec 2022
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch
  Size
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Dmitry Akimov
Sergey Kolesnikov
OffRL
258
19
0
20 Nov 2022
How to Enable Uncertainty Estimation in Proximal Policy Optimization
How to Enable Uncertainty Estimation in Proximal Policy Optimization
Eugene Bykovets
Yannick Metz
Mennatallah El-Assady
Daniel A. Keim
J. M. Buhmann
UQCV
198
2
0
07 Oct 2022
Learning to Exploit Elastic Actuators for Quadruped Locomotion
Learning to Exploit Elastic Actuators for Quadruped Locomotion
Antonin Raffin
D. Seidel
Jens Kober
Alin Albu-Schäffer
João Silvério
F. Stulp
177
6
0
15 Sep 2022
A Review of Uncertainty for Deep Reinforcement Learning
A Review of Uncertainty for Deep Reinforcement LearningArtificial Intelligence and Interactive Digital Entertainment Conference (AIIDE), 2022
Owen Lockwood
Mei Si
218
67
0
18 Aug 2022
A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free
  Reinforcement Learning
A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning
Laura M. Smith
Ilya Kostrikov
Sergey Levine
OffRL
145
122
0
16 Aug 2022
Neighborhood Mixup Experience Replay: Local Convex Interpolation for
  Improved Sample Efficiency in Continuous Control Tasks
Neighborhood Mixup Experience Replay: Local Convex Interpolation for Improved Sample Efficiency in Continuous Control TasksConference on Learning for Dynamics & Control (L4DC), 2022
Ryan M Sander
Wilko Schwarting
Tim Seyde
Igor Gilitschenski
S. Karaman
Daniela Rus
198
3
0
18 May 2022
CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater
  Sample Efficiency and Simplicity
CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity
Aditya Bhatt
Daniel Palenicek
Boris Belousov
Max Argus
Artemij Amiranashvili
Thomas Brox
Jan Peters
316
84
0
14 Feb 2019
Previous
12