Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.05905
Cited By
Soft Actor-Critic Algorithms and Applications
13 December 2018
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
Jie Tan
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Soft Actor-Critic Algorithms and Applications"
50 / 475 papers shown
Title
Evolving Populations of Diverse RL Agents with MAP-Elites
Thomas Pierrot
Arthur Flajolet
35
8
0
09 Mar 2023
Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint
Taisuke Kobayashi
51
3
0
08 Mar 2023
Graph Decision Transformer
Shengchao Hu
Li Shen
Ya Zhang
Dacheng Tao
OffRL
36
15
0
07 Mar 2023
Constrained Reinforcement Learning and Formal Verification for Safe Colonoscopy Navigation
Davide Corsi
Luca Marzari
Ameya Pore
Alessandro Farinelli
A. Casals
Paolo Fiorini
Diego DallÁlba
27
9
0
06 Mar 2023
Virtual Guidance as a Mid-level Representation for Navigation with Augmented Reality
Hsuan-Kung Yang
Tsung-Chih Chiang
Tingxin Liu
Chun-Wei Huang
Jou-Min Liu
Tsu-Ching Hsiao
Chun-Yi Lee
28
1
0
05 Mar 2023
Hallucinated Adversarial Control for Conservative Offline Policy Evaluation
Jonas Rothfuss
Bhavya Sukhija
Tobias Birchler
Parnian Kassraie
Andreas Krause
OffRL
21
10
0
02 Mar 2023
The In-Sample Softmax for Offline Reinforcement Learning
Chenjun Xiao
Han Wang
Yangchen Pan
Adam White
Martha White
OffRL
29
26
0
28 Feb 2023
Active Reward Learning from Online Preferences
Vivek Myers
Erdem Biyik
Dorsa Sadigh
OffRL
37
12
0
27 Feb 2023
Minimax-Bayes Reinforcement Learning
Thomas Kleine Buening
Christos Dimitrakakis
Hannes Eriksson
Divya Grover
Emilio Jorge
OffRL
16
5
0
21 Feb 2023
Differentiable Arbitrating in Zero-sum Markov Games
Jing Wang
Meichen Song
Feng Gao
Boyi Liu
Zhaoran Wang
Yi Wu
43
2
0
20 Feb 2023
Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot
Tao Huang
Kai-xiang Chen
Bin Li
Yunhui Liu
Qingxu Dou
35
23
0
20 Feb 2023
Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning
Mudit Verma
Siddhant Bhambri
Subbarao Kambhampati
39
4
0
17 Feb 2023
Investigating the role of model-based learning in exploration and transfer
Jacob Walker
Eszter Vértes
Yazhe Li
Gabriel Dulac-Arnold
Ankesh Anand
T. Weber
Jessica B. Hamrick
OffRL
36
7
0
08 Feb 2023
Predictable MDP Abstraction for Unsupervised Model-Based RL
Seohong Park
Sergey Levine
24
9
0
08 Feb 2023
Efficient Online Reinforcement Learning with Offline Data
Philip J. Ball
Laura M. Smith
Ilya Kostrikov
Sergey Levine
OffRL
OnRL
40
163
0
06 Feb 2023
Target-based Surrogates for Stochastic Optimization
J. Lavington
Sharan Vaswani
Reza Babanezhad
Mark Schmidt
Nicolas Le Roux
57
5
0
06 Feb 2023
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Pouya Hamadanian
Arash Nasr-Esfahany
Malte Schwarzkopf
Siddartha Sen
MohammadIman Alizadeh
CLL
OffRL
52
0
0
04 Feb 2023
MARLIN: Soft Actor-Critic based Reinforcement Learning for Congestion Control in Real Networks
Raffaele Galliera
A. Morelli
Roberto Fronteddu
N. Suri
32
4
0
02 Feb 2023
Distillation Policy Optimization
Jianfei Ma
OffRL
26
1
0
01 Feb 2023
CRC-RL: A Novel Visual Feature Representation Architecture for Unsupervised Reinforcement Learning
Darshita Jain
A. Majumder
S. Dutta
Swagat Kumar
SSL
34
1
0
31 Jan 2023
Transferring Multiple Policies to Hotstart Reinforcement Learning in an Air Compressor Management Problem
Hélène Plisnier
Denis Steckelmacher
Jeroen Willems
B. Depraetere
Ann Nowé
OffRL
32
1
0
30 Jan 2023
Learning passive policies with virtual energy tanks in robotics
R. Zanella
G. Palli
Stefano Stramigioli
Federico Califano
30
3
0
30 Jan 2023
Zero-Shot Transfer of Haptics-Based Object Insertion Policies
Samarth Brahmbhatt
A. Deka
Andrew Spielberg
M. Muller
9
6
0
29 Jan 2023
Which Experiences Are Influential for Your Agent? Policy Iteration with Turn-over Dropout
Takuya Hiraoka
Takashi Onishi
Yoshimasa Tsuruoka
OffRL
29
0
0
26 Jan 2023
Multi-Agent Interplay in a Competitive Survival Environment
Andrea Fanti
23
0
0
19 Jan 2023
Deep Reinforcement Learning for Autonomous Ground Vehicle Exploration Without A-Priori Maps
Shathushan Sivashangaran
A. Eskandarian
32
4
0
10 Jan 2023
Hint assisted reinforcement learning: an application in radio astronomy
S. Yatawatta
30
1
0
10 Jan 2023
On The Fragility of Learned Reward Functions
Lev McKinney
Yawen Duan
David M. Krueger
Adam Gleave
33
20
0
09 Jan 2023
MERLIN: Multi-agent offline and transfer learning for occupant-centric energy flexible operation of grid-interactive communities using smart meter data and CityLearn
Kingsley Nweye
S. Sankaranarayanan
Zoltán Nagy
OffRL
AI4CE
25
25
0
31 Dec 2022
On Pathologies in KL-Regularized Reinforcement Learning from Expert Demonstrations
Tim G. J. Rudner
Cong Lu
Michael A. Osborne
Yarin Gal
Yee Whye Teh
OffRL
38
27
0
28 Dec 2022
Understanding the Complexity Gains of Single-Task RL with a Curriculum
Qiyang Li
Yuexiang Zhai
Yi Ma
Sergey Levine
37
14
0
24 Dec 2022
Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance
Kelvin Xu
Zheyuan Hu
Ria Doshi
Aaron Rovinsky
Vikash Kumar
Abhishek Gupta
Sergey Levine
32
19
0
19 Dec 2022
Cross-Domain Transfer via Semantic Skill Imitation
Karl Pertsch
Ruta Desai
Vikash Kumar
Franziska Meier
Joseph J. Lim
Dhruv Batra
Akshara Rai
LM&Ro
16
19
0
14 Dec 2022
MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations
Nicklas Hansen
Yixin Lin
H. Su
Xiaolong Wang
Vikash Kumar
Aravind Rajeswaran
OffRL
32
49
0
12 Dec 2022
Generalizing LTL Instructions via Future Dependent Options
Duo Xu
Faramarz Fekri
OffRL
AI4CE
24
1
0
08 Dec 2022
RLogist: Fast Observation Strategy on Whole-slide Images with Deep Reinforcement Learning
Boxuan Zhao
Jun Zhang
Deheng Ye
Jiancheng Cao
Xiao Han
Qiang Fu
Wei Yang
OffRL
31
9
0
04 Dec 2022
A Hierarchical Approach for Strategic Motion Planning in Autonomous Racing
Rudolf Reiter
Jasper Hoffmann
Joschka Boedecker
Moritz Diehl
30
13
0
03 Dec 2022
Karolos: An Open-Source Reinforcement Learning Framework for Robot-Task Environments
Christian Bitter
Timo Thun
Tobias Meisen
36
1
0
01 Dec 2022
Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Alan Clark
Shoaib Ahmed Siddiqui
Robert Kirk
Usman Anwar
Stephen Chung
David M. Krueger
OOD
OffRL
29
0
0
27 Nov 2022
Actively Learning Costly Reward Functions for Reinforcement Learning
André Eberhard
Houssam Metni
G. Fahland
A. Stroh
Pascal Friederich
OffRL
41
0
0
23 Nov 2022
Model-based Trajectory Stitching for Improved Offline Reinforcement Learning
Charles A. Hepburn
Giovanni Montana
OffRL
34
13
0
21 Nov 2022
Building a Subspace of Policies for Scalable Continual Learning
Jean-Baptiste Gaya
T. Doan
Lucas Caccia
Laure Soulier
Ludovic Denoyer
Roberta Raileanu
CLL
42
29
0
18 Nov 2022
Learning Reward Functions for Robotic Manipulation by Observing Humans
Minttu Alakuijala
Gabriel Dulac-Arnold
Julien Mairal
Jean Ponce
Cordelia Schmid
OffRL
39
27
0
16 Nov 2022
ToolFlowNet: Robotic Manipulation with Tools via Predicting Tool Flow from Point Clouds
Daniel Seita
Yufei Wang
Sarthak J. Shetty
Edward Li
Zackory M. Erickson
David Held
3DPC
30
49
0
16 Nov 2022
Model Based Residual Policy Learning with Applications to Antenna Control
Viktor Eriksson Mollerstedt
Alessio Russo
Maxime Bouton
OffRL
31
3
0
16 Nov 2022
Offline Reinforcement Learning with Adaptive Behavior Regularization
Yunfan Zhou
Xijun Li
Qingyu Qu
OffRL
27
1
0
15 Nov 2022
CACTO: Continuous Actor-Critic with Trajectory Optimization -- Towards global optimality
Gianluigi Grandesso
Elisa Alboni
G. P. R. Papini
Patrick M. Wensing
Andrea Del Prete
30
15
0
12 Nov 2022
Progress and summary of reinforcement learning on energy management of MPS-EV
Jincheng Hu
Yang Lin
Liang Chu
Zhuoran Hou
Jihan Li
Jingjing Jiang
Yuanjian Zhang
23
12
0
08 Nov 2022
Leveraging Fully Observable Policies for Learning under Partial Observability
Hai V. Nguyen
Andrea Baisero
Dian Wang
Chris Amato
Robert W. Platt
OffRL
30
19
0
03 Nov 2022
Characterising the Robustness of Reinforcement Learning for Continuous Control using Disturbance Injection
Catherine R. Glossop
Jacopo Panerati
A. Krishnan
Zhaocong Yuan
Angela P. Schoellig
24
6
0
27 Oct 2022
Previous
1
2
3
4
5
...
8
9
10
Next