Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.02034
Cited By
Dropout Q-Functions for Doubly Efficient Reinforcement Learning
5 October 2021
Takuya Hiraoka
Takahisa Imagawa
Taisei Hashimoto
Takashi Onishi
Yoshimasa Tsuruoka
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Dropout Q-Functions for Doubly Efficient Reinforcement Learning"
50 / 74 papers shown
Title
Moderate Actor-Critic Methods: Controlling Overestimation Bias via Expectile Loss
Ukjo Hwang
Songnam Hong
OffRL
31
0
0
14 Apr 2025
Learning to Play Piano in the Real World
Yves-Simon Zeulner
Sandeep Selvaraj
Roberto Calandra
38
0
0
19 Mar 2025
Gait in Eight: Efficient On-Robot Learning for Omnidirectional Quadruped Locomotion
Nico Bohlinger
Jonathan Kinzel
Daniel Palenicek
Lukasz Antczak
Jan Peters
41
1
0
11 Mar 2025
Performance Comparisons of Reinforcement Learning Algorithms for Sequential Experimental Design
Yasir Zubayr Barlas
Kizito Salako
32
0
0
07 Mar 2025
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hojoon Lee
Youngdo Lee
Takuma Seno
Donghu Kim
Peter Stone
Jaegul Choo
63
1
0
24 Feb 2025
Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-Tuning
Patrick Yin
Tyler Westenbroek
Simran Bagaria
Kevin Huang
Ching-an Cheng
Andrey Kobolov
Abhishek Gupta
70
2
0
04 Feb 2025
Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations
Bryan Chan
Anson Leung
James Bergstra
OffRL
OnRL
52
0
0
19 Oct 2024
Traversability-Aware Legged Navigation by Learning from Real-World Visual Data
Hongbo Zhang
Zhongyu Li
Xuanqi Zeng
Laura Smith
Kyle Stachowicz
...
Zhitao Song
Weipeng Xia
Sergey Levine
K. Sreenath
Yun-hui Liu
34
2
0
14 Oct 2024
Reinforcement Learning For Quadrupedal Locomotion: Current Advancements And Future Perspectives
Maurya Gurram
Prakash Kumar Uttam
Shantipal S. Ohol
OffRL
34
0
0
14 Oct 2024
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning
Hojoon Lee
Dongyoon Hwang
Donghu Kim
Hyunseung Kim
Jun Jet Tai
K. Subramanian
Peter R. Wurman
Jaegul Choo
Peter Stone
Takuma Seno
OffRL
62
6
0
13 Oct 2024
Learning to Walk from Three Minutes of Real-World Data with Semi-structured Dynamics Models
Jacob Levy
T. Westenbroek
David Fridovich-Keil
23
0
0
11 Oct 2024
Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learning
Xinran Li
Ling Pan
Jun Zhang
18
1
0
11 Oct 2024
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
39
1
0
11 Oct 2024
The Role of Deep Learning Regularizations on Actors in Offline RL
Denis Tarasov
Anja Surina
Çağlar Gülçehre
OffRL
AI4CE
48
1
0
11 Sep 2024
RP1M: A Large-Scale Motion Dataset for Piano Playing with Bi-Manual Dexterous Robot Hands
Yi Zhao
Le Chen
Jan Schneider
Quankai Gao
Juho Kannala
Bernhard Scholkopf
J. Pajarinen
Dieter Buchler
25
1
0
20 Aug 2024
Scenario-based Thermal Management Parametrization Through Deep Reinforcement Learning
Thomas Rudolf
Philip Muhl
Sören Hohmann
Lutz Eckstein
21
0
0
04 Aug 2024
HiLMa-Res: A General Hierarchical Framework via Residual RL for Combining Quadrupedal Locomotion and Manipulation
Xiaoyu Huang
Qiayuan Liao
Yiming Ni
Zhongyu Li
Laura Smith
Sergey Levine
Xue Bin Peng
K. Sreenath
33
3
0
09 Jul 2024
Augmented Bayesian Policy Search
Mahdi Kallel
Debabrota Basu
R. Akrour
Carlo DÉramo
35
2
0
05 Jul 2024
BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO
Sebastian Dittert
Vincent Moens
Gianni de Fabritiis
26
1
0
25 Jun 2024
Learning-based legged locomotion; state of the art and future perspectives
Sehoon Ha
Joonho Lee
M. van de Panne
Zhaoming Xie
Wenhao Yu
Majid Khadiv
45
16
0
03 Jun 2024
OMPO: A Unified Framework for RL under Policy and Dynamics Shifts
Yu-Juan Luo
Tianying Ji
Fuchun Sun
Jianwei Zhang
Huazhe Xu
Xianyuan Zhan
OffRL
56
3
0
29 May 2024
Oracle-Efficient Reinforcement Learning for Max Value Ensembles
Marcel Hussing
Michael Kearns
Aaron Roth
S. B. Sengupta
Jessica Sorrell
35
0
0
27 May 2024
Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Michal Nauman
M. Ostaszewski
Krzysztof Jankowski
Piotr Milo's
Marek Cygan
OffRL
37
16
0
25 May 2024
Which Experiences Are Influential for RL Agents? Efficiently Estimating The Influence of Experiences
Takuya Hiraoka
Guanquan Wang
Takashi Onishi
Yoshimasa Tsuruoka
37
0
0
23 May 2024
Smart Sampling: Self-Attention and Bootstrapping for Improved Ensembled Q-Learning
M. Khan
Syed Hammad Ahmed
G. Sukthankar
28
0
0
14 May 2024
AFU: Actor-Free critic Updates in off-policy RL for continuous control
Nicolas Perrin-Gilbert
OffRL
27
0
0
24 Apr 2024
Rank2Reward: Learning Shaped Reward Functions from Passive Video
Daniel Yang
Davin Tjia
Jacob Berg
Dima Damen
Pulkit Agrawal
Abhishek Gupta
OffRL
20
4
0
23 Apr 2024
Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning
Xudong Yu
Chenjia Bai
Hongyi Guo
Changhong Wang
Zhen Wang
OffRL
32
0
0
09 Apr 2024
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
Yibo Wang
Jiang Zhao
OffRL
OnRL
25
0
0
31 Mar 2024
Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning
Motoki Omura
Takayuki Osa
Yusuke Mukuta
Tatsuya Harada
OffRL
22
0
0
12 Mar 2024
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Marcel Hussing
C. Voelcker
Igor Gilitschenski
Amir-massoud Farahmand
Eric Eaton
34
3
0
09 Mar 2024
A Case for Validation Buffer in Pessimistic Actor-Critic
Michal Nauman
M. Ostaszewski
Marek Cygan
34
0
0
01 Mar 2024
Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Michal Nauman
Michal Bortkiewicz
Piotr Milo's
Tomasz Trzciñski
M. Ostaszewski
Marek Cygan
OffRL
22
16
0
01 Mar 2024
In value-based deep reinforcement learning, a pruned network is a good network
J. Obando-Ceron
Aaron C. Courville
Pablo Samuel Castro
OffRL
36
18
0
19 Feb 2024
Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey on Hybrid Algorithms
Pengyi Li
Jianye Hao
Hongyao Tang
Xian Fu
Yan Zheng
Ke Tang
29
9
0
22 Jan 2024
ReACT: Reinforcement Learning for Controller Parametrization using B-Spline Geometries
Thomas Rudolf
Daniel Flögel
Tobias Schürmann
Simon Süß
S. Schwab
Sören Hohmann
AI4CE
28
1
0
10 Jan 2024
A unified uncertainty-aware exploration: Combining epistemic and aleatory uncertainty
Parvin Malekzadeh
Ming Hou
Konstantinos N. Plataniotis
UD
16
2
0
05 Jan 2024
Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a High Replay Ratio and Regularization
Takuya Hiraoka
OffRL
25
1
0
10 Dec 2023
Handling Cost and Constraints with Off-Policy Deep Reinforcement Learning
Jared Markowitz
Jesse Silverberg
Gary Collins
OffRL
15
0
0
30 Nov 2023
Imitation Bootstrapped Reinforcement Learning
Hengyuan Hu
Suvir Mirchandani
Dorsa Sadigh
30
24
0
03 Nov 2023
DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization
Guowei Xu
Ruijie Zheng
Yongyuan Liang
Xiyao Wang
Zhecheng Yuan
...
Shuzhen Li
Yanjie Ze
Hal Daumé
Furong Huang
Huazhe Xu
34
27
0
30 Oct 2023
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
Michal Nauman
Marek Cygan
27
1
0
30 Oct 2023
Grow Your Limits: Continuous Improvement with Real-World RL for Robotic Locomotion
Laura M. Smith
Yunhao Cao
Sergey Levine
OffRL
18
19
0
26 Oct 2023
Mind the Model, Not the Agent: The Primacy Bias in Model-based RL
Zhongjian Qiao
Jiafei Lyu
Xiu Li
16
3
0
23 Oct 2023
One is More: Diverse Perspectives within a Single Network for Efficient DRL
Yiqin Tan
Ling Pan
Longbo Huang
OffRL
30
0
0
21 Oct 2023
RL-X: A Deep Reinforcement Learning Library (not only) for RoboCup
Nico Bohlinger
Klaus Dorer
22
4
0
20 Oct 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
28
34
0
13 Oct 2023
An Open-Loop Baseline for Reinforcement Learning Locomotion Tasks
Antonin Raffin
Olivier Sigaud
Jens Kober
Alin Albu-Schäffer
João Silvério
F. Stulp
19
2
0
09 Oct 2023
PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning
Hojoon Lee
Hanseul Cho
Hyunseung Kim
Daehoon Gwak
Joonkee Kim
Jaegul Choo
Se-Young Yun
Chulhee Yun
OffRL
76
25
0
19 Jun 2023
Normalization Enhances Generalization in Visual Reinforcement Learning
Lu Li
Jiafei Lyu
Guozheng Ma
Zilin Wang
Zhen Yang
Xiu Li
Zhiheng Li
OOD
17
8
0
01 Jun 2023
1
2
Next