ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.05905
  4. Cited By
Soft Actor-Critic Algorithms and Applications

Soft Actor-Critic Algorithms and Applications

13 December 2018
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
Jie Tan
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
ArXivPDFHTML

Papers citing "Soft Actor-Critic Algorithms and Applications"

50 / 480 papers shown
Title
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement
  Learning
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Yuanpei Chen
Tianhao Wu
Shengjie Wang
Xidong Feng
Jiechuan Jiang
...
Yiran Geng
Hao Dong
Zongqing Lu
Song-Chun Zhu
Yaodong Yang
OffRL
51
109
0
17 Jun 2022
Relative Policy-Transition Optimization for Fast Policy Transfer
Relative Policy-Transition Optimization for Fast Policy Transfer
Jiawei Xu
Cheng Zhou
Yizheng Zhang
Zhengyou Zhang
Lei Han
21
0
0
13 Jun 2022
Does Self-supervised Learning Really Improve Reinforcement Learning from
  Pixels?
Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?
Xiang Li
Jinghuan Shang
Srijan Das
Michael S. Ryoo
SSL
37
31
0
10 Jun 2022
Deep Multi-Agent Reinforcement Learning with Hybrid Action Spaces based
  on Maximum Entropy
Deep Multi-Agent Reinforcement Learning with Hybrid Action Spaces based on Maximum Entropy
Hongzhi Hua
Kaigui Wu
Guixuan Wen
24
0
0
10 Jun 2022
Challenges and Opportunities in Offline Reinforcement Learning from
  Visual Observations
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
Cong Lu
Philip J. Ball
Tim G. J. Rudner
Jack Parker-Holder
Michael A. Osborne
Yee Whye Teh
OffRL
32
52
0
09 Jun 2022
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on
  Exploration and Performance
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on Exploration and Performance
Jakob J. Hollenstein
Sayantan Auddy
Matteo Saveriano
Erwan Renaudo
J. Piater
43
17
0
08 Jun 2022
Stabilizing Voltage in Power Distribution Networks via Multi-Agent
  Reinforcement Learning with Transformer
Stabilizing Voltage in Power Distribution Networks via Multi-Agent Reinforcement Learning with Transformer
Minrui Wang
Ming Feng
Wen-gang Zhou
Houqiang Li
33
9
0
08 Jun 2022
FishGym: A High-Performance Physics-based Simulation Framework for
  Underwater Robot Learning
FishGym: A High-Performance Physics-based Simulation Framework for Underwater Robot Learning
Wenji Liu
Kai-Yi Bai
Xuming He
Shuran Song
Changxi Zheng
Xiaopei Liu
AI4CE
35
12
0
03 Jun 2022
Deep Transformer Q-Networks for Partially Observable Reinforcement
  Learning
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
Kevin Esslinger
Robert W. Platt
Chris Amato
OffRL
35
35
0
02 Jun 2022
Control of Two-way Coupled Fluid Systems with Differentiable Solvers
Control of Two-way Coupled Fluid Systems with Differentiable Solvers
B. Ramos
Felix Trost
Nils Thuerey
AI4CE
19
5
0
01 Jun 2022
Timing is Everything: Learning to Act Selectively with Costly Actions
  and Budgetary Constraints
Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints
D. Mguni
Aivar Sootla
Juliusz Ziomek
Oliver Slumbers
Zipeng Dai
Kun Shao
Jun Wang
42
6
0
31 May 2022
Critic Sequential Monte Carlo
Critic Sequential Monte Carlo
Vasileios Lioutas
J. Lavington
Justice Sefas
Matthew Niedoba
Yunpeng Liu
Berend Zwartsenberg
Setareh Dabiri
Frank Wood
Adam Scibior
50
7
0
30 May 2022
BulletArm: An Open-Source Robotic Manipulation Benchmark and Learning
  Framework
BulletArm: An Open-Source Robotic Manipulation Benchmark and Learning Framework
Dian Wang
Colin Kohler
Xu Zhu
Ming Jia
Robert W. Platt
29
9
0
28 May 2022
Iso-Dream: Isolating and Leveraging Noncontrollable Visual Dynamics in
  World Models
Iso-Dream: Isolating and Leveraging Noncontrollable Visual Dynamics in World Models
Minting Pan
Xiangming Zhu
Yunbo Wang
Xiaokang Yang
32
39
0
27 May 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still
  Insufficient according to an Off-Policy Measure
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure
Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi-Ju Chang
OffRL
41
8
0
20 May 2022
The Primacy Bias in Deep Reinforcement Learning
The Primacy Bias in Deep Reinforcement Learning
Evgenii Nikishin
Max Schwarzer
P. DÓro
Pierre-Luc Bacon
Rameswar Panda
OnRL
96
182
0
16 May 2022
Qualitative Differences Between Evolutionary Strategies and
  Reinforcement Learning Methods for Control of Autonomous Agents
Qualitative Differences Between Evolutionary Strategies and Reinforcement Learning Methods for Control of Autonomous Agents
Nicola Milano
S. Nolfi
20
0
0
16 May 2022
How to Spend Your Robot Time: Bridging Kickstarting and Offline
  Reinforcement Learning for Vision-based Robotic Manipulation
How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation
Alex X. Lee
Coline Devin
Jost Tobias Springenberg
Yuxiang Zhou
Thomas Lampe
A. Abdolmaleki
Konstantinos Bousmalis
OffRL
OnRL
26
15
0
06 May 2022
CCLF: A Contrastive-Curiosity-Driven Learning Framework for
  Sample-Efficient Reinforcement Learning
CCLF: A Contrastive-Curiosity-Driven Learning Framework for Sample-Efficient Reinforcement Learning
Chenyu Sun
Hangwei Qian
Chunyan Miao
OffRL
32
12
0
02 May 2022
Evaluating Vision Transformer Methods for Deep Reinforcement Learning
  from Pixels
Evaluating Vision Transformer Methods for Deep Reinforcement Learning from Pixels
Tianxin Tao
Daniele Reda
M. van de Panne
ViT
13
19
0
11 Apr 2022
Learning Purely Tactile In-Hand Manipulation with a Torque-Controlled
  Hand
Learning Purely Tactile In-Hand Manipulation with a Torque-Controlled Hand
Leon Sievers
Johannes Pitz
Berthold Bäuml
29
38
0
07 Apr 2022
Visual-Tactile Multimodality for Following Deformable Linear Objects
  Using Reinforcement Learning
Visual-Tactile Multimodality for Following Deformable Linear Objects Using Reinforcement Learning
Leszek Pecyna
Siyuan Dong
Shan Luo
19
21
0
31 Mar 2022
Monte Carlo Tree Search based Hybrid Optimization of Variational Quantum
  Circuits
Monte Carlo Tree Search based Hybrid Optimization of Variational Quantum Circuits
Jiahao Yao
Haoya Li
Marin Bukov
Lin Lin
Lexing Ying
16
15
0
30 Mar 2022
Asynchronous Reinforcement Learning for Real-Time Control of Physical
  Robots
Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots
Yufeng Yuan
Rupam Mahmood
OffRL
33
19
0
23 Mar 2022
Reinforcement learning for automatic quadrilateral mesh generation: a
  soft actor-critic approach
Reinforcement learning for automatic quadrilateral mesh generation: a soft actor-critic approach
J. Pan
Jingwei Huang
G. Cheng
Yong Zeng
AI4CE
24
40
0
19 Mar 2022
DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement
  Learning
DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning
Jinxin Liu
Hongyin Zhang
Donglin Wang
OffRL
38
33
0
13 Mar 2022
Temporal Difference Learning for Model Predictive Control
Temporal Difference Learning for Model Predictive Control
Nicklas Hansen
Xiaolong Wang
H. Su
PINN
MU
38
226
0
09 Mar 2022
Investigation of Factorized Optical Flows as Mid-Level Representations
Investigation of Factorized Optical Flows as Mid-Level Representations
Hsuan-Kung Yang
Tsu-Ching Hsiao
Tingbo Liao
Hsu-Shen Liu
Li-Yuan Tsao
Tzu-Wen Wang
Shan Yang
Yu-Wen Chen
Huang-ru Liao
Chun-Yi Lee
35
3
0
09 Mar 2022
Recursive Reasoning Graph for Multi-Agent Reinforcement Learning
Recursive Reasoning Graph for Multi-Agent Reinforcement Learning
Xiaobai Ma
David Isele
Jayesh K. Gupta
K. Fujimura
Mykel J. Kochenderfer
14
5
0
06 Mar 2022
HOI4D: A 4D Egocentric Dataset for Category-Level Human-Object
  Interaction
HOI4D: A 4D Egocentric Dataset for Category-Level Human-Object Interaction
Yunze Liu
Yun-Hai Liu
Chen Jiang
Kangbo Lyu
Weikang Wan
Hao Shen
Bo-Hua Liang
Zhoujie Fu
He Wang
Li Yi
47
173
0
03 Mar 2022
RL-PGO: Reinforcement Learning-based Planar Pose-Graph Optimization
RL-PGO: Reinforcement Learning-based Planar Pose-Graph Optimization
Nikolaos Kourtzanidis
Sajad Saeedi
37
2
0
26 Feb 2022
Using Deep Reinforcement Learning with Automatic Curriculum Learning for
  Mapless Navigation in Intralogistics
Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics
Honghu Xue
Benedikt Hein
M. Bakr
Georg Schildbach
Bengt Abel
Elmar Rueckert
16
15
0
23 Feb 2022
Design-Bench: Benchmarks for Data-Driven Offline Model-Based
  Optimization
Design-Bench: Benchmarks for Data-Driven Offline Model-Based Optimization
Brandon Trabucco
Xinyang Geng
Aviral Kumar
Sergey Levine
OffRL
34
95
0
17 Feb 2022
Soft Actor-Critic Deep Reinforcement Learning for Fault Tolerant Flight
  Control
Soft Actor-Critic Deep Reinforcement Learning for Fault Tolerant Flight Control
Killian Dally
E. Kampen
27
16
0
16 Feb 2022
Strategy Discovery and Mixture in Lifelong Learning from Heterogeneous
  Demonstration
Strategy Discovery and Mixture in Lifelong Learning from Heterogeneous Demonstration
Sravan Jayanthi
Letian Chen
Matthew C. Gombolay
30
0
0
14 Feb 2022
Supported Policy Optimization for Offline Reinforcement Learning
Supported Policy Optimization for Offline Reinforcement Learning
Jialong Wu
Haixu Wu
Zihan Qiu
Jianmin Wang
Mingsheng Long
OffRL
35
65
0
13 Feb 2022
Deconstructing the Inductive Biases of Hamiltonian Neural Networks
Deconstructing the Inductive Biases of Hamiltonian Neural Networks
Nate Gruver
Marc Finzi
Samuel Stanton
A. Wilson
AI4CE
26
40
0
10 Feb 2022
Bingham Policy Parameterization for 3D Rotations in Reinforcement
  Learning
Bingham Policy Parameterization for 3D Rotations in Reinforcement Learning
Stephen James
Pieter Abbeel
35
9
0
08 Feb 2022
Soft Actor-Critic with Inhibitory Networks for Faster Retraining
Soft Actor-Critic with Inhibitory Networks for Faster Retraining
J. Ide
Daria Mićović
Michael J. Guarino
K. Alcedo
D. Rosenbluth
Adrian P. Pope
18
3
0
07 Feb 2022
Malleable Agents for Re-Configurable Robotic Manipulators
Malleable Agents for Re-Configurable Robotic Manipulators
Athindran Ramesh Kumar
Gurudutt Hosangadi
32
0
0
04 Feb 2022
Tutorial on amortized optimization
Tutorial on amortized optimization
Brandon Amos
OffRL
78
43
0
01 Feb 2022
Towards Safe Reinforcement Learning with a Safety Editor Policy
Towards Safe Reinforcement Learning with a Safety Editor Policy
Haonan Yu
Wei Xu
Haichao Zhang
OffRL
69
31
0
28 Jan 2022
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement
  for Value Error
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Scott Fujimoto
David Meger
Doina Precup
Ofir Nachum
S. Gu
30
32
0
28 Jan 2022
Mask-based Latent Reconstruction for Reinforcement Learning
Mask-based Latent Reconstruction for Reinforcement Learning
Tao Yu
Zhizheng Zhang
Cuiling Lan
Yan Lu
Zhibo Chen
24
44
0
28 Jan 2022
From Psychological Curiosity to Artificial Curiosity: Curiosity-Driven
  Learning in Artificial Intelligence Tasks
From Psychological Curiosity to Artificial Curiosity: Curiosity-Driven Learning in Artificial Intelligence Tasks
Chenyu Sun
Hangwei Qian
Chunyan Miao
18
10
0
20 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
33
100
0
11 Jan 2022
Mixture of basis for interpretable continual learning with distribution
  shifts
Mixture of basis for interpretable continual learning with distribution shifts
Mengda Xu
Sumitra Ganesh
Pranay Pasula
OOD
29
1
0
05 Jan 2022
A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement
  Learning
A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement Learning
Yuxing Wang
Tiantian Zhang
Yongzhe Chang
Bin Liang
Xueqian Wang
Bo Yuan
29
15
0
01 Jan 2022
Nearly Optimal Policy Optimization with Stable at Any Time Guarantee
Nearly Optimal Policy Optimization with Stable at Any Time Guarantee
Tianhao Wu
Yunchang Yang
Han Zhong
Liwei Wang
S. Du
Jiantao Jiao
55
14
0
21 Dec 2021
Variational Quantum Soft Actor-Critic
Variational Quantum Soft Actor-Critic
Qingfeng Lan
22
20
0
20 Dec 2021
Previous
123...1056789
Next