ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.05905
  4. Cited By
Soft Actor-Critic Algorithms and Applications

Soft Actor-Critic Algorithms and Applications

13 December 2018
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
Jie Tan
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
ArXivPDFHTML

Papers citing "Soft Actor-Critic Algorithms and Applications"

50 / 475 papers shown
Title
Evolving Populations of Diverse RL Agents with MAP-Elites
Evolving Populations of Diverse RL Agents with MAP-Elites
Thomas Pierrot
Arthur Flajolet
35
8
0
09 Mar 2023
Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint
Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint
Taisuke Kobayashi
51
3
0
08 Mar 2023
Graph Decision Transformer
Graph Decision Transformer
Shengchao Hu
Li Shen
Ya Zhang
Dacheng Tao
OffRL
36
15
0
07 Mar 2023
Constrained Reinforcement Learning and Formal Verification for Safe
  Colonoscopy Navigation
Constrained Reinforcement Learning and Formal Verification for Safe Colonoscopy Navigation
Davide Corsi
Luca Marzari
Ameya Pore
Alessandro Farinelli
A. Casals
Paolo Fiorini
Diego DallÁlba
27
9
0
06 Mar 2023
Virtual Guidance as a Mid-level Representation for Navigation with Augmented Reality
Virtual Guidance as a Mid-level Representation for Navigation with Augmented Reality
Hsuan-Kung Yang
Tsung-Chih Chiang
Tingxin Liu
Chun-Wei Huang
Jou-Min Liu
Tsu-Ching Hsiao
Chun-Yi Lee
28
1
0
05 Mar 2023
Hallucinated Adversarial Control for Conservative Offline Policy
  Evaluation
Hallucinated Adversarial Control for Conservative Offline Policy Evaluation
Jonas Rothfuss
Bhavya Sukhija
Tobias Birchler
Parnian Kassraie
Andreas Krause
OffRL
21
10
0
02 Mar 2023
The In-Sample Softmax for Offline Reinforcement Learning
The In-Sample Softmax for Offline Reinforcement Learning
Chenjun Xiao
Han Wang
Yangchen Pan
Adam White
Martha White
OffRL
29
26
0
28 Feb 2023
Active Reward Learning from Online Preferences
Active Reward Learning from Online Preferences
Vivek Myers
Erdem Biyik
Dorsa Sadigh
OffRL
37
12
0
27 Feb 2023
Minimax-Bayes Reinforcement Learning
Minimax-Bayes Reinforcement Learning
Thomas Kleine Buening
Christos Dimitrakakis
Hannes Eriksson
Divya Grover
Emilio Jorge
OffRL
16
5
0
21 Feb 2023
Differentiable Arbitrating in Zero-sum Markov Games
Differentiable Arbitrating in Zero-sum Markov Games
Jing Wang
Meichen Song
Feng Gao
Boyi Liu
Zhaoran Wang
Yi Wu
43
2
0
20 Feb 2023
Demonstration-Guided Reinforcement Learning with Efficient Exploration
  for Task Automation of Surgical Robot
Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot
Tao Huang
Kai-xiang Chen
Bin Li
Yunhui Liu
Qingxu Dou
35
23
0
20 Feb 2023
Exploiting Unlabeled Data for Feedback Efficient Human Preference based
  Reinforcement Learning
Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning
Mudit Verma
Siddhant Bhambri
Subbarao Kambhampati
39
4
0
17 Feb 2023
Investigating the role of model-based learning in exploration and
  transfer
Investigating the role of model-based learning in exploration and transfer
Jacob Walker
Eszter Vértes
Yazhe Li
Gabriel Dulac-Arnold
Ankesh Anand
T. Weber
Jessica B. Hamrick
OffRL
36
7
0
08 Feb 2023
Predictable MDP Abstraction for Unsupervised Model-Based RL
Predictable MDP Abstraction for Unsupervised Model-Based RL
Seohong Park
Sergey Levine
24
9
0
08 Feb 2023
Efficient Online Reinforcement Learning with Offline Data
Efficient Online Reinforcement Learning with Offline Data
Philip J. Ball
Laura M. Smith
Ilya Kostrikov
Sergey Levine
OffRL
OnRL
40
163
0
06 Feb 2023
Target-based Surrogates for Stochastic Optimization
Target-based Surrogates for Stochastic Optimization
J. Lavington
Sharan Vaswani
Reza Babanezhad
Mark Schmidt
Nicolas Le Roux
57
5
0
06 Feb 2023
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Pouya Hamadanian
Arash Nasr-Esfahany
Malte Schwarzkopf
Siddartha Sen
MohammadIman Alizadeh
CLL
OffRL
52
0
0
04 Feb 2023
MARLIN: Soft Actor-Critic based Reinforcement Learning for Congestion
  Control in Real Networks
MARLIN: Soft Actor-Critic based Reinforcement Learning for Congestion Control in Real Networks
Raffaele Galliera
A. Morelli
Roberto Fronteddu
N. Suri
32
4
0
02 Feb 2023
Distillation Policy Optimization
Distillation Policy Optimization
Jianfei Ma
OffRL
26
1
0
01 Feb 2023
CRC-RL: A Novel Visual Feature Representation Architecture for
  Unsupervised Reinforcement Learning
CRC-RL: A Novel Visual Feature Representation Architecture for Unsupervised Reinforcement Learning
Darshita Jain
A. Majumder
S. Dutta
Swagat Kumar
SSL
34
1
0
31 Jan 2023
Transferring Multiple Policies to Hotstart Reinforcement Learning in an
  Air Compressor Management Problem
Transferring Multiple Policies to Hotstart Reinforcement Learning in an Air Compressor Management Problem
Hélène Plisnier
Denis Steckelmacher
Jeroen Willems
B. Depraetere
Ann Nowé
OffRL
32
1
0
30 Jan 2023
Learning passive policies with virtual energy tanks in robotics
Learning passive policies with virtual energy tanks in robotics
R. Zanella
G. Palli
Stefano Stramigioli
Federico Califano
30
3
0
30 Jan 2023
Zero-Shot Transfer of Haptics-Based Object Insertion Policies
Zero-Shot Transfer of Haptics-Based Object Insertion Policies
Samarth Brahmbhatt
A. Deka
Andrew Spielberg
M. Muller
9
6
0
29 Jan 2023
Which Experiences Are Influential for Your Agent? Policy Iteration with Turn-over Dropout
Takuya Hiraoka
Takashi Onishi
Yoshimasa Tsuruoka
OffRL
29
0
0
26 Jan 2023
Multi-Agent Interplay in a Competitive Survival Environment
Multi-Agent Interplay in a Competitive Survival Environment
Andrea Fanti
23
0
0
19 Jan 2023
Deep Reinforcement Learning for Autonomous Ground Vehicle Exploration
  Without A-Priori Maps
Deep Reinforcement Learning for Autonomous Ground Vehicle Exploration Without A-Priori Maps
Shathushan Sivashangaran
A. Eskandarian
32
4
0
10 Jan 2023
Hint assisted reinforcement learning: an application in radio astronomy
Hint assisted reinforcement learning: an application in radio astronomy
S. Yatawatta
30
1
0
10 Jan 2023
On The Fragility of Learned Reward Functions
On The Fragility of Learned Reward Functions
Lev McKinney
Yawen Duan
David M. Krueger
Adam Gleave
33
20
0
09 Jan 2023
MERLIN: Multi-agent offline and transfer learning for occupant-centric
  energy flexible operation of grid-interactive communities using smart meter
  data and CityLearn
MERLIN: Multi-agent offline and transfer learning for occupant-centric energy flexible operation of grid-interactive communities using smart meter data and CityLearn
Kingsley Nweye
S. Sankaranarayanan
Zoltán Nagy
OffRL
AI4CE
25
25
0
31 Dec 2022
On Pathologies in KL-Regularized Reinforcement Learning from Expert
  Demonstrations
On Pathologies in KL-Regularized Reinforcement Learning from Expert Demonstrations
Tim G. J. Rudner
Cong Lu
Michael A. Osborne
Yarin Gal
Yee Whye Teh
OffRL
38
27
0
28 Dec 2022
Understanding the Complexity Gains of Single-Task RL with a Curriculum
Understanding the Complexity Gains of Single-Task RL with a Curriculum
Qiyang Li
Yuexiang Zhai
Yi Ma
Sergey Levine
37
14
0
24 Dec 2022
Dexterous Manipulation from Images: Autonomous Real-World RL via Substep
  Guidance
Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance
Kelvin Xu
Zheyuan Hu
Ria Doshi
Aaron Rovinsky
Vikash Kumar
Abhishek Gupta
Sergey Levine
32
19
0
19 Dec 2022
Cross-Domain Transfer via Semantic Skill Imitation
Cross-Domain Transfer via Semantic Skill Imitation
Karl Pertsch
Ruta Desai
Vikash Kumar
Franziska Meier
Joseph J. Lim
Dhruv Batra
Akshara Rai
LM&Ro
16
19
0
14 Dec 2022
MoDem: Accelerating Visual Model-Based Reinforcement Learning with
  Demonstrations
MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations
Nicklas Hansen
Yixin Lin
H. Su
Xiaolong Wang
Vikash Kumar
Aravind Rajeswaran
OffRL
32
49
0
12 Dec 2022
Generalizing LTL Instructions via Future Dependent Options
Generalizing LTL Instructions via Future Dependent Options
Duo Xu
Faramarz Fekri
OffRL
AI4CE
24
1
0
08 Dec 2022
RLogist: Fast Observation Strategy on Whole-slide Images with Deep
  Reinforcement Learning
RLogist: Fast Observation Strategy on Whole-slide Images with Deep Reinforcement Learning
Boxuan Zhao
Jun Zhang
Deheng Ye
Jiancheng Cao
Xiao Han
Qiang Fu
Wei Yang
OffRL
31
9
0
04 Dec 2022
A Hierarchical Approach for Strategic Motion Planning in Autonomous
  Racing
A Hierarchical Approach for Strategic Motion Planning in Autonomous Racing
Rudolf Reiter
Jasper Hoffmann
Joschka Boedecker
Moritz Diehl
30
13
0
03 Dec 2022
Karolos: An Open-Source Reinforcement Learning Framework for Robot-Task
  Environments
Karolos: An Open-Source Reinforcement Learning Framework for Robot-Task Environments
Christian Bitter
Timo Thun
Tobias Meisen
36
1
0
01 Dec 2022
Domain Generalization for Robust Model-Based Offline Reinforcement
  Learning
Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Alan Clark
Shoaib Ahmed Siddiqui
Robert Kirk
Usman Anwar
Stephen Chung
David M. Krueger
OOD
OffRL
29
0
0
27 Nov 2022
Actively Learning Costly Reward Functions for Reinforcement Learning
Actively Learning Costly Reward Functions for Reinforcement Learning
André Eberhard
Houssam Metni
G. Fahland
A. Stroh
Pascal Friederich
OffRL
41
0
0
23 Nov 2022
Model-based Trajectory Stitching for Improved Offline Reinforcement
  Learning
Model-based Trajectory Stitching for Improved Offline Reinforcement Learning
Charles A. Hepburn
Giovanni Montana
OffRL
34
13
0
21 Nov 2022
Building a Subspace of Policies for Scalable Continual Learning
Building a Subspace of Policies for Scalable Continual Learning
Jean-Baptiste Gaya
T. Doan
Lucas Caccia
Laure Soulier
Ludovic Denoyer
Roberta Raileanu
CLL
42
29
0
18 Nov 2022
Learning Reward Functions for Robotic Manipulation by Observing Humans
Learning Reward Functions for Robotic Manipulation by Observing Humans
Minttu Alakuijala
Gabriel Dulac-Arnold
Julien Mairal
Jean Ponce
Cordelia Schmid
OffRL
39
27
0
16 Nov 2022
ToolFlowNet: Robotic Manipulation with Tools via Predicting Tool Flow
  from Point Clouds
ToolFlowNet: Robotic Manipulation with Tools via Predicting Tool Flow from Point Clouds
Daniel Seita
Yufei Wang
Sarthak J. Shetty
Edward Li
Zackory M. Erickson
David Held
3DPC
30
49
0
16 Nov 2022
Model Based Residual Policy Learning with Applications to Antenna
  Control
Model Based Residual Policy Learning with Applications to Antenna Control
Viktor Eriksson Mollerstedt
Alessio Russo
Maxime Bouton
OffRL
31
3
0
16 Nov 2022
Offline Reinforcement Learning with Adaptive Behavior Regularization
Offline Reinforcement Learning with Adaptive Behavior Regularization
Yunfan Zhou
Xijun Li
Qingyu Qu
OffRL
27
1
0
15 Nov 2022
CACTO: Continuous Actor-Critic with Trajectory Optimization -- Towards
  global optimality
CACTO: Continuous Actor-Critic with Trajectory Optimization -- Towards global optimality
Gianluigi Grandesso
Elisa Alboni
G. P. R. Papini
Patrick M. Wensing
Andrea Del Prete
30
15
0
12 Nov 2022
Progress and summary of reinforcement learning on energy management of
  MPS-EV
Progress and summary of reinforcement learning on energy management of MPS-EV
Jincheng Hu
Yang Lin
Liang Chu
Zhuoran Hou
Jihan Li
Jingjing Jiang
Yuanjian Zhang
23
12
0
08 Nov 2022
Leveraging Fully Observable Policies for Learning under Partial
  Observability
Leveraging Fully Observable Policies for Learning under Partial Observability
Hai V. Nguyen
Andrea Baisero
Dian Wang
Chris Amato
Robert W. Platt
OffRL
30
19
0
03 Nov 2022
Characterising the Robustness of Reinforcement Learning for Continuous
  Control using Disturbance Injection
Characterising the Robustness of Reinforcement Learning for Continuous Control using Disturbance Injection
Catherine R. Glossop
Jacopo Panerati
A. Krishnan
Zhaocong Yuan
Angela P. Schoellig
24
6
0
27 Oct 2022
Previous
12345...8910
Next