Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.05905
Cited By
Soft Actor-Critic Algorithms and Applications
13 December 2018
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
Jie Tan
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Soft Actor-Critic Algorithms and Applications"
50 / 475 papers shown
Title
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
Michal Nauman
Marek Cygan
40
1
0
30 Oct 2023
Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models
Xue Yan
Yan Song
Xinyu Cui
Filippos Christianos
Haifeng Zhang
D. Mguni
Jun Wang
LRM
122
7
0
27 Oct 2023
TD-MPC2: Scalable, Robust World Models for Continuous Control
Nicklas Hansen
Hao Su
Xiaolong Wang
MU
32
128
0
25 Oct 2023
One is More: Diverse Perspectives within a Single Network for Efficient DRL
Yiqin Tan
Ling Pan
Longbo Huang
OffRL
40
0
0
21 Oct 2023
Offline Reinforcement Learning for Optimizing Production Bidding Policies
D. Korenkevych
Frank Cheng
Artsiom Balakir
Alex Nikulkov
Lingnan Gao
Zhihao Cen
Zuobing Xu
Zheqing Zhu
OffRL
31
1
0
13 Oct 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
33
34
0
13 Oct 2023
Energy-Efficient Visual Search by Eye Movement and Low-Latency Spiking Neural Network
Yunhui Zhou
Dongqi Han
Yuguo Yu
3DH
39
0
0
10 Oct 2023
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond
Hao Sun
OffRL
34
21
0
09 Oct 2023
Increasing Entropy to Boost Policy Gradient Performance on Personalization Tasks
Andrew Starnes
Anton Dereventsov
Clayton Webster
24
0
0
09 Oct 2023
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets
Zhang-Wei Hong
Aviral Kumar
Sathwik Karnik
Abhishek Bhandwaldar
Akash Srivastava
Joni Pajarinen
Romain Laroche
Abhishek Gupta
Pulkit Agrawal
OffRL
38
19
0
06 Oct 2023
Amortized Network Intervention to Steer the Excitatory Point Processes
Zitao Song
Wendi Ren
Sourav Garg
29
1
0
06 Oct 2023
Stackelberg Batch Policy Learning
Wenzhuo Zhou
Annie Qu
OffRL
35
1
0
28 Sep 2023
V2X-Lead: LiDAR-based End-to-End Autonomous Driving with Vehicle-to-Everything Communication Integration
Zhi-Guo Deng
Yanjun Shi
Weiming Shen
31
0
0
26 Sep 2023
Adapting Double Q-Learning for Continuous Reinforcement Learning
Arsenii Kuznetsov
OffRL
OnRL
24
0
0
25 Sep 2023
Learning to Recover for Safe Reinforcement Learning
Haoyu Wang
Xin Yuan
Qinqing Ren
36
0
0
21 Sep 2023
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning
David Yunis
Justin Jung
Falcon Z. Dai
Matthew R. Walter
OffRL
47
0
0
08 Sep 2023
REBOOT: Reuse Data for Bootstrapping Efficient Real-World Dexterous Manipulation
Zheyuan Hu
Aaron Rovinsky
Jianlan Luo
Vikash Kumar
Abhishek Gupta
Sergey Levine
OffRL
27
9
0
06 Sep 2023
Foundational Policy Acquisition via Multitask Learning for Motor Skill Generation
Satoshi Yamamori
Jun Morimoto
28
0
0
31 Aug 2023
IOB: Integrating Optimization Transfer and Behavior Transfer for Multi-Policy Reuse
Siyuan Li
Haoyang Li
Jin Zhang
Zhen Wang
Peng Liu
Chongjie Zhang
OffRL
33
1
0
14 Aug 2023
Worrisome Properties of Neural Network Controllers and Their Symbolic Representations
J. Cyranka
Kevin E. M. Church
J. Lessard
42
0
0
28 Jul 2023
Reinforcement Learning by Guided Safe Exploration
Qisong Yang
T. D. Simão
N. Jansen
Simon Tindemans
M. Spaan
OffRL
OnRL
34
5
0
26 Jul 2023
PASTA: Pretrained Action-State Transformer Agents
Raphael Boige
Yannis Flet-Berliac
Arthur Flajolet
Guillaume Richard
Thomas Pierrot
LM&Ro
OffRL
42
5
0
20 Jul 2023
ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation
Abhijeet Pendyala
Justin Dettmer
Tobias Glasmachers
Asma Atamna
OffRL
19
6
0
06 Jul 2023
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning
Outongyi Lv
Bingxin Zhou
OffRL
44
0
0
05 Jul 2023
Sim-to-real transfer of active suspension control using deep reinforcement learning
Viktor Wiberg
Erik Wallin
Arvid Fälldin
Tobias Semberg
Morgan Rossander
E. Wadbro
Martin Servin
36
7
0
19 Jun 2023
Empowering NLG: Offline Reinforcement Learning for Informal Summarization in Online Domains
Zhiwei Tai
Po-Chuan Chen
OffRL
26
0
0
17 Jun 2023
On the Efficacy of 3D Point Cloud Reinforcement Learning
Z. Ling
Yuan Yao
Xuanlin Li
H. Su
3DPC
34
13
0
11 Jun 2023
Preference-grounded Token-level Guidance for Language Model Fine-tuning
Shentao Yang
Shujian Zhang
Congying Xia
Yihao Feng
Caiming Xiong
Mi Zhou
29
23
0
01 Jun 2023
Adaptive PD Control using Deep Reinforcement Learning for Local-Remote Teleoperation with Stochastic Time Delays
Lucy McCutcheon
Saber Fallah
36
0
0
26 May 2023
Unsupervised Discovery of Continuous Skills on a Sphere
Takahisa Imagawa
Takuya Hiraoka
Yoshimasa Tsuruoka
35
0
0
21 May 2023
What Matters in Reinforcement Learning for Tractography
Antoine Théberge
Christian Desrosiers
Maxime Descoteaux
Pierre-Marc Jodoin
OffRL
29
2
0
15 May 2023
A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes
Han Zhong
Tong Zhang
35
26
0
15 May 2023
Policy Gradient Algorithms Implicitly Optimize by Continuation
Adrien Bolland
Gilles Louppe
D. Ernst
39
3
0
11 May 2023
Reducing the Cost of Cycle-Time Tuning for Real-World Policy Optimization
Homayoon Farrahi
Rupam Mahmood
34
5
0
09 May 2023
Autonomous Navigation for Robot-assisted Intraluminal and Endovascular Procedures: A Systematic Review
Ameya Pore
Zhen Li
Diego DallÁlba
A. Hernansanz
Elena De Momi
A. Menciassi
Alicia Casals Gelpí
J. Dankelman
Paolo Fiorini
E. V. Poorten
24
29
0
06 May 2023
Learning Generalizable Pivoting Skills
Xiang Zhang
Siddarth Jain
Baichuan Huang
Masayoshi Tomizuka
Diego Romeres
42
14
0
04 May 2023
TorchBench: Benchmarking PyTorch with High API Surface Coverage
Yueming Hao
Xu Zhao
Bin Bao
David Berard
William Constable
Adnan Aziz
Xu Liu
38
5
0
27 Apr 2023
Fulfilling Formal Specifications ASAP by Model-free Reinforcement Learning
Mengyu Liu
Pengyuan Lu
Xin Chen
Fanxin Kong
O. Sokolsky
Insup Lee
31
3
0
25 Apr 2023
Reclaimer: A Reinforcement Learning Approach to Dynamic Resource Allocation for Cloud Microservices
Quintin Fettes
Avinash Karanth
Razvan Bunescu
Brandon Beckwith
S. Subramoney
27
3
0
17 Apr 2023
Efficient Quality-Diversity Optimization through Diverse Quality Species
Ryan Wickman
Bibek Poudel
Taylor Michael Villarreal
Xiaofei Zhang
Weizi Li
38
6
0
14 Apr 2023
Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning
Tongzhou Wang
Antonio Torralba
Phillip Isola
Amy Zhang
OffRL
34
34
0
03 Apr 2023
Utilizing Reinforcement Learning for de novo Drug Design
Hampus Gummesson Svensson
C. Tyrchan
Ola Engkvist
M. Chehreghani
43
17
0
30 Mar 2023
Model-Based Reinforcement Learning with Isolated Imaginations
Minting Pan
Xiangming Zhu
Yitao Zheng
Yunbo Wang
Xiaokang Yang
34
0
0
27 Mar 2023
Inverse Reinforcement Learning without Reinforcement Learning
Gokul Swamy
Sanjiban Choudhury
J. Andrew Bagnell
Zhiwei Steven Wu
21
34
0
26 Mar 2023
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and Global Optimality
François Ged
M. H. Veiga
33
0
0
22 Mar 2023
SACPlanner: Real-World Collision Avoidance with a Soft Actor Critic Local Planner and Polar State Representations
Khaled Nakhleh
Minahil Raza
Mack Tang
M. Andrews
Rinu Boney
I. Hadžić
Jeongran Lee
Atefeh Mohajeri
Karina Palyutina
20
4
0
21 Mar 2023
Towards Real-World Applications of Personalized Anesthesia Using Policy Constraint Q Learning for Propofol Infusion Control
Xiuding Cai
Jiao Chen
Yaoyao Zhu
Beiming Wang
Yu Yao
OffRL
38
5
0
17 Mar 2023
Reinforcement Learning-based Wavefront Sensorless Adaptive Optics Approaches for Satellite-to-Ground Laser Communication
Payam Parvizi
Runnan Zou
C. Bellinger
R. Cheriton
D. Spinello
6
2
0
13 Mar 2023
Learning Model-Free Robust Precoding for Cooperative Multibeam Satellite Communications
Steffen Gracla
Alea Schröder
Maik Röper
C. Bockelmann
D. Wübben
Armin Dekorsy
11
4
0
13 Mar 2023
Visual-Policy Learning through Multi-Camera View to Single-Camera View Knowledge Distillation for Robot Manipulation Tasks
C. Acar
Kuluhan Binici
Alp Tekirdag
Yan Wu
37
1
0
13 Mar 2023
Previous
1
2
3
4
5
6
...
8
9
10
Next