ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.05905
  4. Cited By
Soft Actor-Critic Algorithms and Applications

Soft Actor-Critic Algorithms and Applications

13 December 2018
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
Jie Tan
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
ArXivPDFHTML

Papers citing "Soft Actor-Critic Algorithms and Applications"

50 / 480 papers shown
Title
Characterising the Robustness of Reinforcement Learning for Continuous
  Control using Disturbance Injection
Characterising the Robustness of Reinforcement Learning for Continuous Control using Disturbance Injection
Catherine R. Glossop
Jacopo Panerati
A. Krishnan
Zhaocong Yuan
Angela P. Schoellig
24
6
0
27 Oct 2022
Adaptive Behavior Cloning Regularization for Stable Offline-to-Online
  Reinforcement Learning
Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement Learning
Yi Zhao
Rinu Boney
Alexander Ilin
Arno Solin
Joni Pajarinen
OffRL
OnRL
28
39
0
25 Oct 2022
On Many-Actions Policy Gradient
On Many-Actions Policy Gradient
Michal Nauman
Marek Cygan
19
0
0
24 Oct 2022
Learning General World Models in a Handful of Reward-Free Deployments
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
62
9
0
23 Oct 2022
Solving Continuous Control via Q-learning
Solving Continuous Control via Q-learning
Tim Seyde
Peter Werner
Wilko Schwarting
Igor Gilitschenski
Martin Riedmiller
Daniela Rus
Markus Wulfmeier
OffRL
LRM
39
22
0
22 Oct 2022
Robust Imitation via Mirror Descent Inverse Reinforcement Learning
Robust Imitation via Mirror Descent Inverse Reinforcement Learning
Dong-Sig Han
Hyunseok Kim
Hyun-Dong Lee
Je-hwan Ryu
Byoung-Tak Zhang
28
2
0
20 Oct 2022
Limited or Biased: Modeling Sub-Rational Human Investors in Financial
  Markets
Limited or Biased: Modeling Sub-Rational Human Investors in Financial Markets
Penghang Liu
Kshama Dwarakanath
Svitlana Vyetrenko
Tucker Balch
AIFin
34
5
0
16 Oct 2022
PI-QT-Opt: Predictive Information Improves Multi-Task Robotic
  Reinforcement Learning at Scale
PI-QT-Opt: Predictive Information Improves Multi-Task Robotic Reinforcement Learning at Scale
Kuang-Huei Lee
Ted Xiao
A. Li
Paul Wohlhart
Ian S. Fischer
Yao Lu
53
10
0
15 Oct 2022
CUP: Critic-Guided Policy Reuse
CUP: Critic-Guided Policy Reuse
Jin Zhang
Siyuan Li
Chongjie Zhang
34
8
0
15 Oct 2022
Deep Reinforcement Learning-based Rebalancing Policies for Profit
  Maximization of Relay Nodes in Payment Channel Networks
Deep Reinforcement Learning-based Rebalancing Policies for Profit Maximization of Relay Nodes in Payment Channel Networks
Nikolaos Papadis
Leandros Tassiulas
23
5
0
13 Oct 2022
Self-Validated Physics-Embedding Network: A General Framework for
  Inverse Modelling
Self-Validated Physics-Embedding Network: A General Framework for Inverse Modelling
Ruiyuan Kang
D. Kyritsis
P. Liatsis
AI4CE
PINN
18
5
0
12 Oct 2022
VER: Scaling On-Policy RL Leads to the Emergence of Navigation in
  Embodied Rearrangement
VER: Scaling On-Policy RL Leads to the Emergence of Navigation in Embodied Rearrangement
Erik Wijmans
Irfan Essa
Dhruv Batra
OffRL
52
13
0
11 Oct 2022
Deep Reinforcement Learning Based Joint Downlink Beamforming and RIS
  Configuration in RIS-aided MU-MISO Systems Under Hardware Impairments and
  Imperfect CSI
Deep Reinforcement Learning Based Joint Downlink Beamforming and RIS Configuration in RIS-aided MU-MISO Systems Under Hardware Impairments and Imperfect CSI
Baturay Saglam
Doğa Gürgünoğlu
Suleyman Serdar Kozat
24
12
0
10 Oct 2022
GoalsEye: Learning High Speed Precision Table Tennis on a Physical Robot
GoalsEye: Learning High Speed Precision Table Tennis on a Physical Robot
Tianli Ding
L. Graesser
Saminda Abeyruwan
David B. DÁmbrosio
Anish Shankar
P. Sermanet
Pannag R. Sanketi
Corey Lynch
59
20
0
07 Oct 2022
Handling Sparse Rewards in Reinforcement Learning Using Model Predictive
  Control
Handling Sparse Rewards in Reinforcement Learning Using Model Predictive Control
Murad Dawood
Nils Dengler
Jorge de Heuvel
Maren Bennewitz
26
9
0
04 Oct 2022
Safe Reinforcement Learning From Pixels Using a Stochastic Latent
  Representation
Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation
Yannick Hogewind
T. D. Simão
Tal Kachman
N. Jansen
21
10
0
02 Oct 2022
Throwing Objects into A Moving Basket While Avoiding Obstacles
Throwing Objects into A Moving Basket While Avoiding Obstacles
Hamidreza Kasaei
Mohammadreza Kasaei
40
5
0
02 Oct 2022
Bayesian Q-learning With Imperfect Expert Demonstrations
Bayesian Q-learning With Imperfect Expert Demonstrations
Fengdi Che
Xiru Zhu
Doina Precup
David Meger
Gregory Dudek
19
2
0
01 Oct 2022
Boosting Exploration in Actor-Critic Algorithms by Incentivizing
  Plausible Novel States
Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States
C. Banerjee
Zhiyong Chen
N. Noman
28
3
0
01 Oct 2022
Guiding Safe Exploration with Weakest Preconditions
Guiding Safe Exploration with Weakest Preconditions
Greg Anderson
Swarat Chaudhuri
Işıl Dillig
36
6
0
28 Sep 2022
Disentangling Transfer in Continual Reinforcement Learning
Disentangling Transfer in Continual Reinforcement Learning
Maciej Wołczyk
Michal Zajkac
Razvan Pascanu
Lukasz Kuciñski
Piotr Milo's
CLL
81
27
0
28 Sep 2022
Modern Machine Learning Tools for Monitoring and Control of Industrial
  Processes: A Survey
Modern Machine Learning Tools for Monitoring and Control of Industrial Processes: A Survey
R. Bhushan Gopaluni
Aditya Tulsyan
Benoît Chachuat
Biao Huang
J. M. Lee
Faraz Amjad
S. Damarla
Jong Woo Kim
Nathan P. Lawrence
AI4CE
21
38
0
22 Sep 2022
Revisiting Discrete Soft Actor-Critic
Revisiting Discrete Soft Actor-Critic
Haibin Zhou
Zichuan Lin
Junyou Li
Qiang Fu
Wei Yang
Deheng Ye
51
12
0
21 Sep 2022
Honor of Kings Arena: an Environment for Generalization in Competitive
  Reinforcement Learning
Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning
Hua Wei
Jingxiao Chen
Xiyang Ji
Hongyang Qin
Minwen Deng
...
Lin Liu
Lanxiao Huang
Deheng Ye
Qiang Fu
Wei Yang
43
28
0
18 Sep 2022
Neuromuscular Reinforcement Learning to Actuate Human Limbs through FES
Neuromuscular Reinforcement Learning to Actuate Human Limbs through FES
Nat Wannawas
A. Shafti
Aldo A. Faisal
OffRL
16
9
0
16 Sep 2022
Reducing Variance in Temporal-Difference Value Estimation via Ensemble
  of Deep Networks
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Litian Liang
Yaosheng Xu
Stephen Marcus McAleer
Dailin Hu
Alexander Ihler
Pieter Abbeel
Roy Fox
OOD
27
16
0
16 Sep 2022
Deep Reinforcement Learning for Cryptocurrency Trading: Practical
  Approach to Address Backtest Overfitting
Deep Reinforcement Learning for Cryptocurrency Trading: Practical Approach to Address Backtest Overfitting
Berend Gort
Xiao-Yang Liu
Xinghang Sun
Jiechao Gao
Shuai Chen
Chris Wang
32
13
0
12 Sep 2022
Instruction-driven history-aware policies for robotic manipulations
Instruction-driven history-aware policies for robotic manipulations
Pierre-Louis Guhur
Shizhe Chen
Ricardo Garcia Pinel
Makarand Tapaswi
Ivan Laptev
Cordelia Schmid
LM&Ro
113
102
0
11 Sep 2022
Variational Inference for Model-Free and Model-Based Reinforcement
  Learning
Variational Inference for Model-Free and Model-Based Reinforcement Learning
Felix Leibfried
OffRL
23
0
0
04 Sep 2022
Co-Imitation: Learning Design and Behaviour by Imitation
Co-Imitation: Learning Design and Behaviour by Imitation
C. Rajani
Karol Arndt
David Blanco Mulero
K. Luck
Ville Kyrki
26
4
0
02 Sep 2022
Sample Efficient Learning of Factored Embeddings of Tensor Fields
Sample Efficient Learning of Factored Embeddings of Tensor Fields
Taemin Heo
Minh Nguyen
MedIm
23
0
0
01 Sep 2022
A further exploration of deep Multi-Agent Reinforcement Learning with
  Hybrid Action Space
A further exploration of deep Multi-Agent Reinforcement Learning with Hybrid Action Space
Hongzhi Hua
Guixuan Wen
Kaigui Wu
27
1
0
30 Aug 2022
Augmenting Reinforcement Learning with Transformer-based Scene
  Representation Learning for Decision-making of Autonomous Driving
Augmenting Reinforcement Learning with Transformer-based Scene Representation Learning for Decision-making of Autonomous Driving
Haochen Liu
Zhiyu Huang
Xiaoyu Mo
Chen Lv
ViT
OffRL
35
34
0
24 Aug 2022
Entropy Enhanced Multi-Agent Coordination Based on Hierarchical Graph
  Learning for Continuous Action Space
Entropy Enhanced Multi-Agent Coordination Based on Hierarchical Graph Learning for Continuous Action Space
Yining Chen
Ke Wang
Guang-hua Song
Xiaohong Jiang
28
3
0
23 Aug 2022
Spectral Decomposition Representation for Reinforcement Learning
Spectral Decomposition Representation for Reinforcement Learning
Tongzheng Ren
Tianjun Zhang
Lisa Lee
Joseph E. Gonzalez
Dale Schuurmans
Bo Dai
OffRL
40
27
0
19 Aug 2022
Unified Policy Optimization for Continuous-action Reinforcement Learning
  in Non-stationary Tasks and Games
Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games
Rongjun Qin
Fan Luo
Hong Qian
Yang Yu
30
2
0
19 Aug 2022
Entropy Augmented Reinforcement Learning
Entropy Augmented Reinforcement Learning
Jianfei Ma
36
0
0
19 Aug 2022
Towards Augmented Microscopy with Reinforcement Learning-Enhanced
  Workflows
Towards Augmented Microscopy with Reinforcement Learning-Enhanced Workflows
Michael Xu
Abinash Kumar
J. Lebeau
18
7
0
04 Aug 2022
A Survey on Masked Autoencoder for Self-supervised Learning in Vision
  and Beyond
A Survey on Masked Autoencoder for Self-supervised Learning in Vision and Beyond
Chaoning Zhang
Chenshuang Zhang
Junha Song
John Seon Keun Yi
Kang Zhang
In So Kweon
SSL
57
72
0
30 Jul 2022
Improved Policy Optimization for Online Imitation Learning
Improved Policy Optimization for Online Imitation Learning
J. Lavington
Sharan Vaswani
Mark Schmidt
OffRL
25
6
0
29 Jul 2022
PI-ARS: Accelerating Evolution-Learned Visual-Locomotion with Predictive
  Information Representations
PI-ARS: Accelerating Evolution-Learned Visual-Locomotion with Predictive Information Representations
Kuang-Huei Lee
Ofir Nachum
Tingnan Zhang
S. Guadarrama
Jie Tan
Wenhao Yu
24
16
0
27 Jul 2022
Abstract Demonstrations and Adaptive Exploration for Efficient and
  Stable Multi-step Sparse Reward Reinforcement Learning
Abstract Demonstrations and Adaptive Exploration for Efficient and Stable Multi-step Sparse Reward Reinforcement Learning
Xintong Yang
Ze Ji
Jing Wu
Yunyu Lai
OffRL
27
5
0
19 Jul 2022
Online Game Level Generation from Music
Online Game Level Generation from Music
Ziqi Wang
Jialin Liu
37
2
0
12 Jul 2022
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Edoardo Cetin
Philip J. Ball
Steve Roberts
Oya Celiktutan
35
36
0
03 Jul 2022
Learning fast and agile quadrupedal locomotion over complex terrain
Learning fast and agile quadrupedal locomotion over complex terrain
Xu Chang
Zhitong Zhang
Honglei An
Hongxu Ma
Qing Wei
29
0
0
02 Jul 2022
q-Learning in Continuous Time
q-Learning in Continuous Time
Yanwei Jia
X. Zhou
OffRL
51
70
0
02 Jul 2022
Colonoscopy Navigation using End-to-End Deep Visuomotor Control: A User
  Study
Colonoscopy Navigation using End-to-End Deep Visuomotor Control: A User Study
Ameya Pore
M. Finocchiaro
Diego DallÁlba
A. Hernansanz
G. Ciuti
A. Arezzo
A. Menciassi
A. Casals
Paolo Fiorini
32
14
0
30 Jun 2022
DayDreamer: World Models for Physical Robot Learning
DayDreamer: World Models for Physical Robot Learning
Philipp Wu
Alejandro Escontrela
Danijar Hafner
Ken Goldberg
Pieter Abbeel
61
277
0
28 Jun 2022
Fast Population-Based Reinforcement Learning on a Single Machine
Fast Population-Based Reinforcement Learning on a Single Machine
Arthur Flajolet
Claire Bizon Monroc
Karim Beguir
Thomas Pierrot
OffRL
32
10
0
17 Jun 2022
SMPL: Simulated Industrial Manufacturing and Process Control Learning
  Environments
SMPL: Simulated Industrial Manufacturing and Process Control Learning Environments
Mohan Zhang
Xiaozhou Wang
Benjamin Decardi-Nelson
Bo Song
A. Zhang
...
Jiayi Cheng
Xiaohong Liu
DengDeng Yu
Matthew Poon
Animesh Garg
26
4
0
17 Jun 2022
Previous
123456...8910
Next