Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.05905
Cited By
Soft Actor-Critic Algorithms and Applications
13 December 2018
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
Jie Tan
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Soft Actor-Critic Algorithms and Applications"
50 / 480 papers shown
Title
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Yuanpei Chen
Tianhao Wu
Shengjie Wang
Xidong Feng
Jiechuan Jiang
...
Yiran Geng
Hao Dong
Zongqing Lu
Song-Chun Zhu
Yaodong Yang
OffRL
51
109
0
17 Jun 2022
Relative Policy-Transition Optimization for Fast Policy Transfer
Jiawei Xu
Cheng Zhou
Yizheng Zhang
Zhengyou Zhang
Lei Han
21
0
0
13 Jun 2022
Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?
Xiang Li
Jinghuan Shang
Srijan Das
Michael S. Ryoo
SSL
37
31
0
10 Jun 2022
Deep Multi-Agent Reinforcement Learning with Hybrid Action Spaces based on Maximum Entropy
Hongzhi Hua
Kaigui Wu
Guixuan Wen
24
0
0
10 Jun 2022
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
Cong Lu
Philip J. Ball
Tim G. J. Rudner
Jack Parker-Holder
Michael A. Osborne
Yee Whye Teh
OffRL
32
52
0
09 Jun 2022
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on Exploration and Performance
Jakob J. Hollenstein
Sayantan Auddy
Matteo Saveriano
Erwan Renaudo
J. Piater
43
17
0
08 Jun 2022
Stabilizing Voltage in Power Distribution Networks via Multi-Agent Reinforcement Learning with Transformer
Minrui Wang
Ming Feng
Wen-gang Zhou
Houqiang Li
33
9
0
08 Jun 2022
FishGym: A High-Performance Physics-based Simulation Framework for Underwater Robot Learning
Wenji Liu
Kai-Yi Bai
Xuming He
Shuran Song
Changxi Zheng
Xiaopei Liu
AI4CE
35
12
0
03 Jun 2022
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
Kevin Esslinger
Robert W. Platt
Chris Amato
OffRL
35
35
0
02 Jun 2022
Control of Two-way Coupled Fluid Systems with Differentiable Solvers
B. Ramos
Felix Trost
Nils Thuerey
AI4CE
19
5
0
01 Jun 2022
Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints
D. Mguni
Aivar Sootla
Juliusz Ziomek
Oliver Slumbers
Zipeng Dai
Kun Shao
Jun Wang
42
6
0
31 May 2022
Critic Sequential Monte Carlo
Vasileios Lioutas
J. Lavington
Justice Sefas
Matthew Niedoba
Yunpeng Liu
Berend Zwartsenberg
Setareh Dabiri
Frank Wood
Adam Scibior
50
7
0
30 May 2022
BulletArm: An Open-Source Robotic Manipulation Benchmark and Learning Framework
Dian Wang
Colin Kohler
Xu Zhu
Ming Jia
Robert W. Platt
29
9
0
28 May 2022
Iso-Dream: Isolating and Leveraging Noncontrollable Visual Dynamics in World Models
Minting Pan
Xiangming Zhu
Yunbo Wang
Xiaokang Yang
32
39
0
27 May 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure
Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi-Ju Chang
OffRL
41
8
0
20 May 2022
The Primacy Bias in Deep Reinforcement Learning
Evgenii Nikishin
Max Schwarzer
P. DÓro
Pierre-Luc Bacon
Rameswar Panda
OnRL
96
182
0
16 May 2022
Qualitative Differences Between Evolutionary Strategies and Reinforcement Learning Methods for Control of Autonomous Agents
Nicola Milano
S. Nolfi
20
0
0
16 May 2022
How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation
Alex X. Lee
Coline Devin
Jost Tobias Springenberg
Yuxiang Zhou
Thomas Lampe
A. Abdolmaleki
Konstantinos Bousmalis
OffRL
OnRL
26
15
0
06 May 2022
CCLF: A Contrastive-Curiosity-Driven Learning Framework for Sample-Efficient Reinforcement Learning
Chenyu Sun
Hangwei Qian
Chunyan Miao
OffRL
32
12
0
02 May 2022
Evaluating Vision Transformer Methods for Deep Reinforcement Learning from Pixels
Tianxin Tao
Daniele Reda
M. van de Panne
ViT
13
19
0
11 Apr 2022
Learning Purely Tactile In-Hand Manipulation with a Torque-Controlled Hand
Leon Sievers
Johannes Pitz
Berthold Bäuml
29
38
0
07 Apr 2022
Visual-Tactile Multimodality for Following Deformable Linear Objects Using Reinforcement Learning
Leszek Pecyna
Siyuan Dong
Shan Luo
19
21
0
31 Mar 2022
Monte Carlo Tree Search based Hybrid Optimization of Variational Quantum Circuits
Jiahao Yao
Haoya Li
Marin Bukov
Lin Lin
Lexing Ying
16
15
0
30 Mar 2022
Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots
Yufeng Yuan
Rupam Mahmood
OffRL
33
19
0
23 Mar 2022
Reinforcement learning for automatic quadrilateral mesh generation: a soft actor-critic approach
J. Pan
Jingwei Huang
G. Cheng
Yong Zeng
AI4CE
24
40
0
19 Mar 2022
DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning
Jinxin Liu
Hongyin Zhang
Donglin Wang
OffRL
38
33
0
13 Mar 2022
Temporal Difference Learning for Model Predictive Control
Nicklas Hansen
Xiaolong Wang
H. Su
PINN
MU
38
226
0
09 Mar 2022
Investigation of Factorized Optical Flows as Mid-Level Representations
Hsuan-Kung Yang
Tsu-Ching Hsiao
Tingbo Liao
Hsu-Shen Liu
Li-Yuan Tsao
Tzu-Wen Wang
Shan Yang
Yu-Wen Chen
Huang-ru Liao
Chun-Yi Lee
35
3
0
09 Mar 2022
Recursive Reasoning Graph for Multi-Agent Reinforcement Learning
Xiaobai Ma
David Isele
Jayesh K. Gupta
K. Fujimura
Mykel J. Kochenderfer
14
5
0
06 Mar 2022
HOI4D: A 4D Egocentric Dataset for Category-Level Human-Object Interaction
Yunze Liu
Yun-Hai Liu
Chen Jiang
Kangbo Lyu
Weikang Wan
Hao Shen
Bo-Hua Liang
Zhoujie Fu
He Wang
Li Yi
47
173
0
03 Mar 2022
RL-PGO: Reinforcement Learning-based Planar Pose-Graph Optimization
Nikolaos Kourtzanidis
Sajad Saeedi
37
2
0
26 Feb 2022
Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics
Honghu Xue
Benedikt Hein
M. Bakr
Georg Schildbach
Bengt Abel
Elmar Rueckert
16
15
0
23 Feb 2022
Design-Bench: Benchmarks for Data-Driven Offline Model-Based Optimization
Brandon Trabucco
Xinyang Geng
Aviral Kumar
Sergey Levine
OffRL
34
95
0
17 Feb 2022
Soft Actor-Critic Deep Reinforcement Learning for Fault Tolerant Flight Control
Killian Dally
E. Kampen
27
16
0
16 Feb 2022
Strategy Discovery and Mixture in Lifelong Learning from Heterogeneous Demonstration
Sravan Jayanthi
Letian Chen
Matthew C. Gombolay
30
0
0
14 Feb 2022
Supported Policy Optimization for Offline Reinforcement Learning
Jialong Wu
Haixu Wu
Zihan Qiu
Jianmin Wang
Mingsheng Long
OffRL
35
65
0
13 Feb 2022
Deconstructing the Inductive Biases of Hamiltonian Neural Networks
Nate Gruver
Marc Finzi
Samuel Stanton
A. Wilson
AI4CE
26
40
0
10 Feb 2022
Bingham Policy Parameterization for 3D Rotations in Reinforcement Learning
Stephen James
Pieter Abbeel
35
9
0
08 Feb 2022
Soft Actor-Critic with Inhibitory Networks for Faster Retraining
J. Ide
Daria Mićović
Michael J. Guarino
K. Alcedo
D. Rosenbluth
Adrian P. Pope
18
3
0
07 Feb 2022
Malleable Agents for Re-Configurable Robotic Manipulators
Athindran Ramesh Kumar
Gurudutt Hosangadi
32
0
0
04 Feb 2022
Tutorial on amortized optimization
Brandon Amos
OffRL
78
43
0
01 Feb 2022
Towards Safe Reinforcement Learning with a Safety Editor Policy
Haonan Yu
Wei Xu
Haichao Zhang
OffRL
69
31
0
28 Jan 2022
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Scott Fujimoto
David Meger
Doina Precup
Ofir Nachum
S. Gu
30
32
0
28 Jan 2022
Mask-based Latent Reconstruction for Reinforcement Learning
Tao Yu
Zhizheng Zhang
Cuiling Lan
Yan Lu
Zhibo Chen
24
44
0
28 Jan 2022
From Psychological Curiosity to Artificial Curiosity: Curiosity-Driven Learning in Artificial Intelligence Tasks
Chenyu Sun
Hangwei Qian
Chunyan Miao
18
10
0
20 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
33
100
0
11 Jan 2022
Mixture of basis for interpretable continual learning with distribution shifts
Mengda Xu
Sumitra Ganesh
Pranay Pasula
OOD
29
1
0
05 Jan 2022
A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement Learning
Yuxing Wang
Tiantian Zhang
Yongzhe Chang
Bin Liang
Xueqian Wang
Bo Yuan
29
15
0
01 Jan 2022
Nearly Optimal Policy Optimization with Stable at Any Time Guarantee
Tianhao Wu
Yunchang Yang
Han Zhong
Liwei Wang
S. Du
Jiantao Jiao
55
14
0
21 Dec 2021
Variational Quantum Soft Actor-Critic
Qingfeng Lan
22
20
0
20 Dec 2021
Previous
1
2
3
...
10
5
6
7
8
9
Next