Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.05905
Cited By
Soft Actor-Critic Algorithms and Applications
13 December 2018
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
Jie Tan
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Soft Actor-Critic Algorithms and Applications"
50 / 480 papers shown
Title
Attention Based Communication and Control for Multi-UAV Path Planning
Hamid Shiri
Hyowoon Seo
Jihong Park
M. Bennis
21
14
0
20 Dec 2021
Autonomous Reinforcement Learning: Formalism and Benchmarking
Archit Sharma
Kelvin Xu
Nikhil Sardana
Abhishek Gupta
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
60
26
0
17 Dec 2021
On Optimizing Interventions in Shared Autonomy
Weihao Tan
David Koleczek
Siddhant Pradhan
Nicholas Perello
Vivek Chettiar
Vishal Rohra
Aaslesha Rajaram
Soundararajan Srinivasan
H. M. S. Hossain
Yash Chandak
31
4
0
16 Dec 2021
Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning
Trevor Ablett
Bryan Chan
Jonathan Kelly
37
4
0
16 Dec 2021
Invariance Through Latent Alignment
Takuma Yoneda
Ge Yang
Matthew R. Walter
Bradly C. Stadie
OOD
23
9
0
15 Dec 2021
Stochastic Planner-Actor-Critic for Unsupervised Deformable Image Registration
Ziwei Luo
Jing Hu
Xin Wang
Shu Hu
Bin Kong
Youbing Yin
Qi Song
Xi Wu
Siwei Lyu
MedIm
27
12
0
14 Dec 2021
Stochastic Actor-Executor-Critic for Image-to-Image Translation
Ziwei Luo
Jing Hu
Xin Wang
Siwei Lyu
Bin Kong
Youbing Yin
Qi Song
Xi Wu
BDL
EGVM
30
5
0
14 Dec 2021
Zero-Shot Uncertainty-Aware Deployment of Simulation Trained Policies on Real-World Robots
Krishan Rana
Vibhavari Dasagi
Jesse Haviland
Ben Talbot
Michael Milford
Niko Sünderhauf
32
1
0
10 Dec 2021
Recent Advances in Reinforcement Learning in Finance
B. Hambly
Renyuan Xu
Huining Yang
OffRL
29
167
0
08 Dec 2021
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks
Linghui Meng
Muning Wen
Yaodong Yang
Chenyang Le
Xiyun Li
Weinan Zhang
Ying Wen
Haifeng Zhang
Jun Wang
Bo Xu
OffRL
28
38
0
06 Dec 2021
Residual Pathway Priors for Soft Equivariance Constraints
Marc Finzi
Gregory W. Benton
A. Wilson
BDL
UQCV
24
54
0
02 Dec 2021
Real-world challenges for multi-agent reinforcement learning in grid-interactive buildings
Kingsley Nweye
Bo Liu
Peter Stone
Zoltán Nagy
OffRL
AI4CE
37
37
0
25 Nov 2021
Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning
Nicolai Dorka
Tim Welschehold
Joschka Boedecker
Wolfram Burgard
OffRL
30
9
0
24 Nov 2021
Policy Gradient and Actor-Critic Learning in Continuous Time and Space: Theory and Algorithms
Yanwei Jia
X. Zhou
OffRL
34
79
0
22 Nov 2021
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Yanqiu Wu
Xinyue Chen
Che Wang
Yiming Zhang
Keith Ross
OffRL
17
9
0
17 Nov 2021
AW-Opt: Learning Robotic Skills with Imitation and Reinforcement at Scale
Yao Lu
Karol Hausman
Yevgen Chebotar
Mengyuan Yan
Eric Jang
...
Ted Xiao
A. Irpan
Mohi Khansari
Dmitry Kalashnikov
Sergey Levine
OffRL
95
59
0
09 Nov 2021
d3rlpy: An Offline Deep Reinforcement Learning Library
Takuma Seno
M. Imai
OffRL
GP
65
100
0
06 Nov 2021
Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies
Tim Seyde
Igor Gilitschenski
Wilko Schwarting
Bartolomeo Stellato
Martin Riedmiller
Markus Wulfmeier
Daniela Rus
26
44
0
03 Nov 2021
An Adaptable Approach to Learn Realistic Legged Locomotion without Examples
Daniel Felipe Ordoñez Apraez
Antonio Agudo
Francesc Moreno-Noguer
Mario Martin
44
8
0
28 Oct 2021
RoMA: Robust Model Adaptation for Offline Model-based Optimization
Sihyun Yu
Sungsoo Ahn
Le Song
Jinwoo Shin
OffRL
37
31
0
27 Oct 2021
Distributed Multi-Agent Deep Reinforcement Learning Framework for Whole-building HVAC Control
Vinay Hanumaiah
Sahika Genc
AI4CE
16
6
0
26 Oct 2021
Learning Insertion Primitives with Discrete-Continuous Hybrid Action Space for Robotic Assembly Tasks
Yongyu Wang
Shiyu Jin
Changhao Wang
Xinghao Zhu
Masayoshi Tomizuka
28
42
0
25 Oct 2021
Hierarchical Skills for Efficient Exploration
Jonas Gehring
Gabriel Synnaeve
Andreas Krause
Nicolas Usunier
28
40
0
20 Oct 2021
Continuous Control with Action Quantization from Demonstrations
Robert Dadashi
Léonard Hussenot
Damien Vincent
Sertan Girgin
Anton Raichuk
M. Geist
Olivier Pietquin
OffRL
33
23
0
19 Oct 2021
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Edoardo Cetin
Oya Celiktutan
OffRL
44
17
0
07 Oct 2021
Evaluating model-based planning and planner amortization for continuous control
Arunkumar Byravan
Leonard Hasenclever
Piotr Trochim
M. Berk Mirza
Alessandro Davide Ialongo
...
Jost Tobias Springenberg
A. Abdolmaleki
N. Heess
J. Merel
Martin Riedmiller
55
17
0
07 Oct 2021
On the Privacy Risks of Deploying Recurrent Neural Networks in Machine Learning Models
Yunhao Yang
Parham Gohari
Ufuk Topcu
AAML
35
3
0
06 Oct 2021
Adaptive control of a mechatronic system using constrained residual reinforcement learning
Tom Staessens
Tom Lefebvre
Guillaume Crevecoeur
22
16
0
06 Oct 2021
Dropout Q-Functions for Doubly Efficient Reinforcement Learning
Takuya Hiraoka
Takahisa Imagawa
Taisei Hashimoto
Takashi Onishi
Yoshimasa Tsuruoka
13
105
0
05 Oct 2021
Hit and Lead Discovery with Explorative RL and Fragment-based Molecule Generation
Soojung Yang
Doyeong Hwang
Seul Lee
Seongok Ryu
Sung Ju Hwang
36
67
0
04 Oct 2021
The
f
f
f
-Divergence Reinforcement Learning Framework
Chen Gong
Qiang He
Yunpeng Bai
Zhouyi Yang
Xiaoyu Chen
Xinwen Hou
Xianjie Zhang
Yu Liu
Guoliang Fan
36
3
0
24 Sep 2021
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience
C. Banerjee
Zhiyong Chen
N. Noman
19
30
0
24 Sep 2021
Accessibility-Based Clustering for Efficient Learning of Locomotion Skills
Chong Zhang
Wanming Yu
Zhibin Li
31
9
0
23 Sep 2021
Shape Control of Deformable Linear Objects with Offline and Online Learning of Local Linear Deformation Models
Mingrui Yu
Hanzhong Zhong
Xiang-Yang Li
OffRL
AI4CE
38
41
0
23 Sep 2021
Decentralized Global Connectivity Maintenance for Multi-Robot Navigation: A Reinforcement Learning Approach
Minghao Li
Yingrui Jie
Yang Kong
Hui Cheng
43
9
0
17 Sep 2021
Computation Rate Maximum for Mobile Terminals in UAV-assisted Wireless Powered MEC Networks with Fairness Constraint
Xiaoyi Zhou
Liang Huang
Tong Ye
Weiqiang Sun
11
1
0
13 Sep 2021
Learning to Navigate Sidewalks in Outdoor Environments
Maks Sorokin
Jie Tan
Karen Liu
Sehoon Ha
26
41
0
12 Sep 2021
Encoding Distributional Soft Actor-Critic for Autonomous Driving in Multi-lane Scenarios
Jingliang Duan
Yangang Ren
Fawang Zhang
Yang Guan
Dongjie Yu
Shengbo Eben Li
B. Cheng
Lin Zhao
23
7
0
12 Sep 2021
Optimizing Quantum Variational Circuits with Deep Reinforcement Learning
Owen Lockwood
22
9
0
07 Sep 2021
APPLE: Adaptive Planner Parameter Learning from Evaluative Feedback
Zizhao Wang
Xuesu Xiao
Garrett A. Warnell
Peter Stone
25
44
0
22 Aug 2021
Implicitly Regularized RL with Implicit Q-Values
Nino Vieillard
Marcin Andrychowicz
Anton Raichuk
Olivier Pietquin
M. Geist
OffRL
24
9
0
16 Aug 2021
A Pragmatic Look at Deep Imitation Learning
Kai Arulkumaran
D. Lillrank
35
9
0
04 Aug 2021
Deep Reinforcement Learning Based Networked Control with Network Delays for Signal Temporal Logic Specifications
Junya Ikemoto
T. Ushio
21
3
0
03 Aug 2021
A Reinforcement Learning Approach for Scheduling in mmWave Networks
M. Dogan
Yahya H. Ezzeldin
Christina Fragouli
Addison W. Bohannon
22
10
0
01 Aug 2021
ManiSkill: Generalizable Manipulation Skill Benchmark with Large-Scale Demonstrations
Tongzhou Mu
Z. Ling
Fanbo Xiang
Derek Yang
Xuanlin Li
Stone Tao
Zhiao Huang
Zhiwei Jia
Hao Su
41
132
0
30 Jul 2021
DRIVE: Deep Reinforced Accident Anticipation with Visual Explanation
Wentao Bao
Qi Yu
Yu Kong
FAtt
27
39
0
21 Jul 2021
Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics
Krishan Rana
Vibhavari Dasagi
Jesse Haviland
Ben Talbot
Michael Milford
Niko Sünderhauf
BDL
OffRL
27
31
0
21 Jul 2021
Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning
Denis Yarats
Rob Fergus
A. Lazaric
Lerrel Pinto
OffRL
36
338
0
20 Jul 2021
LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of Sparse Reward Iterative Tasks
Albert Wilcox
Ashwin Balakrishna
Brijen Thananjeyan
Joseph E. Gonzalez
Ken Goldberg
29
11
0
10 Jul 2021
Offline Meta-Reinforcement Learning with Online Self-Supervision
Vitchyr H. Pong
Ashvin Nair
Laura M. Smith
Catherine Huang
Sergey Levine
OffRL
34
66
0
08 Jul 2021
Previous
1
2
3
...
10
6
7
8
9
Next