ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.05905
  4. Cited By
Soft Actor-Critic Algorithms and Applications

Soft Actor-Critic Algorithms and Applications

13 December 2018
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
Jie Tan
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
ArXivPDFHTML

Papers citing "Soft Actor-Critic Algorithms and Applications"

50 / 475 papers shown
Title
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
Michal Nauman
Marek Cygan
40
1
0
30 Oct 2023
Ask more, know better: Reinforce-Learned Prompt Questions for Decision
  Making with Large Language Models
Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models
Xue Yan
Yan Song
Xinyu Cui
Filippos Christianos
Haifeng Zhang
D. Mguni
Jun Wang
LRM
122
7
0
27 Oct 2023
TD-MPC2: Scalable, Robust World Models for Continuous Control
TD-MPC2: Scalable, Robust World Models for Continuous Control
Nicklas Hansen
Hao Su
Xiaolong Wang
MU
32
128
0
25 Oct 2023
One is More: Diverse Perspectives within a Single Network for Efficient
  DRL
One is More: Diverse Perspectives within a Single Network for Efficient DRL
Yiqin Tan
Ling Pan
Longbo Huang
OffRL
40
0
0
21 Oct 2023
Offline Reinforcement Learning for Optimizing Production Bidding
  Policies
Offline Reinforcement Learning for Optimizing Production Bidding Policies
D. Korenkevych
Frank Cheng
Artsiom Balakir
Alex Nikulkov
Lingnan Gao
Zhihao Cen
Zuobing Xu
Zheqing Zhu
OffRL
31
1
0
13 Oct 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
33
34
0
13 Oct 2023
Energy-Efficient Visual Search by Eye Movement and Low-Latency Spiking
  Neural Network
Energy-Efficient Visual Search by Eye Movement and Low-Latency Spiking Neural Network
Yunhui Zhou
Dongqi Han
Yuguo Yu
3DH
39
0
0
10 Oct 2023
Reinforcement Learning in the Era of LLMs: What is Essential? What is
  needed? An RL Perspective on RLHF, Prompting, and Beyond
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond
Hao Sun
OffRL
34
21
0
09 Oct 2023
Increasing Entropy to Boost Policy Gradient Performance on
  Personalization Tasks
Increasing Entropy to Boost Policy Gradient Performance on Personalization Tasks
Andrew Starnes
Anton Dereventsov
Clayton Webster
24
0
0
09 Oct 2023
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced
  Datasets
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets
Zhang-Wei Hong
Aviral Kumar
Sathwik Karnik
Abhishek Bhandwaldar
Akash Srivastava
Joni Pajarinen
Romain Laroche
Abhishek Gupta
Pulkit Agrawal
OffRL
38
19
0
06 Oct 2023
Amortized Network Intervention to Steer the Excitatory Point Processes
Amortized Network Intervention to Steer the Excitatory Point Processes
Zitao Song
Wendi Ren
Sourav Garg
29
1
0
06 Oct 2023
Stackelberg Batch Policy Learning
Stackelberg Batch Policy Learning
Wenzhuo Zhou
Annie Qu
OffRL
35
1
0
28 Sep 2023
V2X-Lead: LiDAR-based End-to-End Autonomous Driving with
  Vehicle-to-Everything Communication Integration
V2X-Lead: LiDAR-based End-to-End Autonomous Driving with Vehicle-to-Everything Communication Integration
Zhi-Guo Deng
Yanjun Shi
Weiming Shen
31
0
0
26 Sep 2023
Adapting Double Q-Learning for Continuous Reinforcement Learning
Adapting Double Q-Learning for Continuous Reinforcement Learning
Arsenii Kuznetsov
OffRL
OnRL
24
0
0
25 Sep 2023
Learning to Recover for Safe Reinforcement Learning
Learning to Recover for Safe Reinforcement Learning
Haoyu Wang
Xin Yuan
Qinqing Ren
36
0
0
21 Sep 2023
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement
  Learning
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning
David Yunis
Justin Jung
Falcon Z. Dai
Matthew R. Walter
OffRL
47
0
0
08 Sep 2023
REBOOT: Reuse Data for Bootstrapping Efficient Real-World Dexterous
  Manipulation
REBOOT: Reuse Data for Bootstrapping Efficient Real-World Dexterous Manipulation
Zheyuan Hu
Aaron Rovinsky
Jianlan Luo
Vikash Kumar
Abhishek Gupta
Sergey Levine
OffRL
27
9
0
06 Sep 2023
Foundational Policy Acquisition via Multitask Learning for Motor Skill Generation
Foundational Policy Acquisition via Multitask Learning for Motor Skill Generation
Satoshi Yamamori
Jun Morimoto
28
0
0
31 Aug 2023
IOB: Integrating Optimization Transfer and Behavior Transfer for
  Multi-Policy Reuse
IOB: Integrating Optimization Transfer and Behavior Transfer for Multi-Policy Reuse
Siyuan Li
Haoyang Li
Jin Zhang
Zhen Wang
Peng Liu
Chongjie Zhang
OffRL
33
1
0
14 Aug 2023
Worrisome Properties of Neural Network Controllers and Their Symbolic
  Representations
Worrisome Properties of Neural Network Controllers and Their Symbolic Representations
J. Cyranka
Kevin E. M. Church
J. Lessard
42
0
0
28 Jul 2023
Reinforcement Learning by Guided Safe Exploration
Reinforcement Learning by Guided Safe Exploration
Qisong Yang
T. D. Simão
N. Jansen
Simon Tindemans
M. Spaan
OffRL
OnRL
34
5
0
26 Jul 2023
PASTA: Pretrained Action-State Transformer Agents
PASTA: Pretrained Action-State Transformer Agents
Raphael Boige
Yannis Flet-Berliac
Arthur Flajolet
Guillaume Richard
Thomas Pierrot
LM&Ro
OffRL
42
5
0
20 Jul 2023
ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource
  Allocation
ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation
Abhijeet Pendyala
Justin Dettmer
Tobias Glasmachers
Asma Atamna
OffRL
19
6
0
06 Jul 2023
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning
Outongyi Lv
Bingxin Zhou
OffRL
44
0
0
05 Jul 2023
Sim-to-real transfer of active suspension control using deep
  reinforcement learning
Sim-to-real transfer of active suspension control using deep reinforcement learning
Viktor Wiberg
Erik Wallin
Arvid Fälldin
Tobias Semberg
Morgan Rossander
E. Wadbro
Martin Servin
36
7
0
19 Jun 2023
Empowering NLG: Offline Reinforcement Learning for Informal
  Summarization in Online Domains
Empowering NLG: Offline Reinforcement Learning for Informal Summarization in Online Domains
Zhiwei Tai
Po-Chuan Chen
OffRL
26
0
0
17 Jun 2023
On the Efficacy of 3D Point Cloud Reinforcement Learning
On the Efficacy of 3D Point Cloud Reinforcement Learning
Z. Ling
Yuan Yao
Xuanlin Li
H. Su
3DPC
34
13
0
11 Jun 2023
Preference-grounded Token-level Guidance for Language Model Fine-tuning
Preference-grounded Token-level Guidance for Language Model Fine-tuning
Shentao Yang
Shujian Zhang
Congying Xia
Yihao Feng
Caiming Xiong
Mi Zhou
29
23
0
01 Jun 2023
Adaptive PD Control using Deep Reinforcement Learning for Local-Remote
  Teleoperation with Stochastic Time Delays
Adaptive PD Control using Deep Reinforcement Learning for Local-Remote Teleoperation with Stochastic Time Delays
Lucy McCutcheon
Saber Fallah
36
0
0
26 May 2023
Unsupervised Discovery of Continuous Skills on a Sphere
Unsupervised Discovery of Continuous Skills on a Sphere
Takahisa Imagawa
Takuya Hiraoka
Yoshimasa Tsuruoka
35
0
0
21 May 2023
What Matters in Reinforcement Learning for Tractography
What Matters in Reinforcement Learning for Tractography
Antoine Théberge
Christian Desrosiers
Maxime Descoteaux
Pierre-Marc Jodoin
OffRL
29
2
0
15 May 2023
A Theoretical Analysis of Optimistic Proximal Policy Optimization in
  Linear Markov Decision Processes
A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes
Han Zhong
Tong Zhang
35
26
0
15 May 2023
Policy Gradient Algorithms Implicitly Optimize by Continuation
Policy Gradient Algorithms Implicitly Optimize by Continuation
Adrien Bolland
Gilles Louppe
D. Ernst
39
3
0
11 May 2023
Reducing the Cost of Cycle-Time Tuning for Real-World Policy
  Optimization
Reducing the Cost of Cycle-Time Tuning for Real-World Policy Optimization
Homayoon Farrahi
Rupam Mahmood
34
5
0
09 May 2023
Autonomous Navigation for Robot-assisted Intraluminal and Endovascular
  Procedures: A Systematic Review
Autonomous Navigation for Robot-assisted Intraluminal and Endovascular Procedures: A Systematic Review
Ameya Pore
Zhen Li
Diego DallÁlba
A. Hernansanz
Elena De Momi
A. Menciassi
Alicia Casals Gelpí
J. Dankelman
Paolo Fiorini
E. V. Poorten
24
29
0
06 May 2023
Learning Generalizable Pivoting Skills
Learning Generalizable Pivoting Skills
Xiang Zhang
Siddarth Jain
Baichuan Huang
Masayoshi Tomizuka
Diego Romeres
42
14
0
04 May 2023
TorchBench: Benchmarking PyTorch with High API Surface Coverage
TorchBench: Benchmarking PyTorch with High API Surface Coverage
Yueming Hao
Xu Zhao
Bin Bao
David Berard
William Constable
Adnan Aziz
Xu Liu
38
5
0
27 Apr 2023
Fulfilling Formal Specifications ASAP by Model-free Reinforcement
  Learning
Fulfilling Formal Specifications ASAP by Model-free Reinforcement Learning
Mengyu Liu
Pengyuan Lu
Xin Chen
Fanxin Kong
O. Sokolsky
Insup Lee
31
3
0
25 Apr 2023
Reclaimer: A Reinforcement Learning Approach to Dynamic Resource
  Allocation for Cloud Microservices
Reclaimer: A Reinforcement Learning Approach to Dynamic Resource Allocation for Cloud Microservices
Quintin Fettes
Avinash Karanth
Razvan Bunescu
Brandon Beckwith
S. Subramoney
27
3
0
17 Apr 2023
Efficient Quality-Diversity Optimization through Diverse Quality Species
Efficient Quality-Diversity Optimization through Diverse Quality Species
Ryan Wickman
Bibek Poudel
Taylor Michael Villarreal
Xiaofei Zhang
Weizi Li
38
6
0
14 Apr 2023
Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning
Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning
Tongzhou Wang
Antonio Torralba
Phillip Isola
Amy Zhang
OffRL
34
34
0
03 Apr 2023
Utilizing Reinforcement Learning for de novo Drug Design
Utilizing Reinforcement Learning for de novo Drug Design
Hampus Gummesson Svensson
C. Tyrchan
Ola Engkvist
M. Chehreghani
43
17
0
30 Mar 2023
Model-Based Reinforcement Learning with Isolated Imaginations
Model-Based Reinforcement Learning with Isolated Imaginations
Minting Pan
Xiangming Zhu
Yitao Zheng
Yunbo Wang
Xiaokang Yang
34
0
0
27 Mar 2023
Inverse Reinforcement Learning without Reinforcement Learning
Inverse Reinforcement Learning without Reinforcement Learning
Gokul Swamy
Sanjiban Choudhury
J. Andrew Bagnell
Zhiwei Steven Wu
21
34
0
26 Mar 2023
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and
  Global Optimality
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and Global Optimality
François Ged
M. H. Veiga
33
0
0
22 Mar 2023
SACPlanner: Real-World Collision Avoidance with a Soft Actor Critic
  Local Planner and Polar State Representations
SACPlanner: Real-World Collision Avoidance with a Soft Actor Critic Local Planner and Polar State Representations
Khaled Nakhleh
Minahil Raza
Mack Tang
M. Andrews
Rinu Boney
I. Hadžić
Jeongran Lee
Atefeh Mohajeri
Karina Palyutina
20
4
0
21 Mar 2023
Towards Real-World Applications of Personalized Anesthesia Using Policy
  Constraint Q Learning for Propofol Infusion Control
Towards Real-World Applications of Personalized Anesthesia Using Policy Constraint Q Learning for Propofol Infusion Control
Xiuding Cai
Jiao Chen
Yaoyao Zhu
Beiming Wang
Yu Yao
OffRL
38
5
0
17 Mar 2023
Reinforcement Learning-based Wavefront Sensorless Adaptive Optics
  Approaches for Satellite-to-Ground Laser Communication
Reinforcement Learning-based Wavefront Sensorless Adaptive Optics Approaches for Satellite-to-Ground Laser Communication
Payam Parvizi
Runnan Zou
C. Bellinger
R. Cheriton
D. Spinello
6
2
0
13 Mar 2023
Learning Model-Free Robust Precoding for Cooperative Multibeam Satellite
  Communications
Learning Model-Free Robust Precoding for Cooperative Multibeam Satellite Communications
Steffen Gracla
Alea Schröder
Maik Röper
C. Bockelmann
D. Wübben
Armin Dekorsy
11
4
0
13 Mar 2023
Visual-Policy Learning through Multi-Camera View to Single-Camera View
  Knowledge Distillation for Robot Manipulation Tasks
Visual-Policy Learning through Multi-Camera View to Single-Camera View Knowledge Distillation for Robot Manipulation Tasks
C. Acar
Kuluhan Binici
Alp Tekirdag
Yan Wu
37
1
0
13 Mar 2023
Previous
123456...8910
Next