Soft Actor-Critic Algorithms and Applications

13 December 2018

Jie Tan

Vikash Kumar

Henry Zhu

Abhishek Gupta

Pieter Abbeel

Sergey Levine

ArXiv PDF HTML

Papers citing "Soft Actor-Critic Algorithms and Applications"

50 / 480 papers shown

Title
Attention Based Communication and Control for Multi-UAV Path Planning Hamid Shiri Hyowoon Seo Jihong Park M. Bennis 21 14 0 20 Dec 2021
Autonomous Reinforcement Learning: Formalism and Benchmarking Archit Sharma Kelvin Xu Nikhil Sardana Abhishek Gupta Karol Hausman Sergey Levine Chelsea Finn OffRL 60 26 0 17 Dec 2021
On Optimizing Interventions in Shared Autonomy Weihao Tan David Koleczek Siddhant Pradhan Nicholas Perello Vivek Chettiar Vishal Rohra Aaslesha Rajaram Soundararajan Srinivasan H. M. S. Hossain Yash Chandak 31 4 0 16 Dec 2021
Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Trevor Ablett Bryan Chan Jonathan Kelly 37 4 0 16 Dec 2021
Invariance Through Latent Alignment Takuma Yoneda Ge Yang Matthew R. Walter Bradly C. Stadie OOD 23 9 0 15 Dec 2021
Stochastic Planner-Actor-Critic for Unsupervised Deformable Image Registration Ziwei Luo Jing Hu Xin Wang Shu Hu Bin Kong Youbing Yin Qi Song Xi Wu Siwei Lyu MedIm 27 12 0 14 Dec 2021
Stochastic Actor-Executor-Critic for Image-to-Image Translation Ziwei Luo Jing Hu Xin Wang Siwei Lyu Bin Kong Youbing Yin Qi Song Xi Wu BDL EGVM 30 5 0 14 Dec 2021
Zero-Shot Uncertainty-Aware Deployment of Simulation Trained Policies on Real-World Robots Krishan Rana Vibhavari Dasagi Jesse Haviland Ben Talbot Michael Milford Niko Sünderhauf 32 1 0 10 Dec 2021
Recent Advances in Reinforcement Learning in Finance B. Hambly Renyuan Xu Huining Yang OffRL 29 167 0 08 Dec 2021
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks Linghui Meng Muning Wen Yaodong Yang Chenyang Le Xiyun Li Weinan Zhang Ying Wen Haifeng Zhang Jun Wang Bo Xu OffRL 28 38 0 06 Dec 2021
Residual Pathway Priors for Soft Equivariance Constraints Marc Finzi Gregory W. Benton A. Wilson BDL UQCV 24 54 0 02 Dec 2021
Real-world challenges for multi-agent reinforcement learning in grid-interactive buildings Kingsley Nweye Bo Liu Peter Stone Zoltán Nagy OffRL AI4CE 37 37 0 25 Nov 2021
Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning Nicolai Dorka Tim Welschehold Joschka Boedecker Wolfram Burgard OffRL 30 9 0 24 Nov 2021
Policy Gradient and Actor-Critic Learning in Continuous Time and Space: Theory and Algorithms Yanwei Jia X. Zhou OffRL 34 79 0 22 Nov 2021
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance Yanqiu Wu Xinyue Chen Che Wang Yiming Zhang Keith Ross OffRL 17 9 0 17 Nov 2021
AW-Opt: Learning Robotic Skills with Imitation and Reinforcement at Scale Yao Lu Karol Hausman Yevgen Chebotar Mengyuan Yan Eric Jang ... Ted Xiao A. Irpan Mohi Khansari Dmitry Kalashnikov Sergey Levine OffRL 95 59 0 09 Nov 2021
d3rlpy: An Offline Deep Reinforcement Learning Library Takuma Seno M. Imai OffRL GP 65 100 0 06 Nov 2021
Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies Tim Seyde Igor Gilitschenski Wilko Schwarting Bartolomeo Stellato Martin Riedmiller Markus Wulfmeier Daniela Rus 26 44 0 03 Nov 2021
An Adaptable Approach to Learn Realistic Legged Locomotion without Examples Daniel Felipe Ordoñez Apraez Antonio Agudo Francesc Moreno-Noguer Mario Martin 44 8 0 28 Oct 2021
RoMA: Robust Model Adaptation for Offline Model-based Optimization Sihyun Yu Sungsoo Ahn Le Song Jinwoo Shin OffRL 37 31 0 27 Oct 2021
Distributed Multi-Agent Deep Reinforcement Learning Framework for Whole-building HVAC Control Vinay Hanumaiah Sahika Genc AI4CE 16 6 0 26 Oct 2021
Learning Insertion Primitives with Discrete-Continuous Hybrid Action Space for Robotic Assembly Tasks Yongyu Wang Shiyu Jin Changhao Wang Xinghao Zhu Masayoshi Tomizuka 28 42 0 25 Oct 2021
Hierarchical Skills for Efficient Exploration Jonas Gehring Gabriel Synnaeve Andreas Krause Nicolas Usunier 28 40 0 20 Oct 2021
Continuous Control with Action Quantization from Demonstrations Robert Dadashi Léonard Hussenot Damien Vincent Sertan Girgin Anton Raichuk M. Geist Olivier Pietquin OffRL 33 23 0 19 Oct 2021
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning Edoardo Cetin Oya Celiktutan OffRL 44 17 0 07 Oct 2021
Evaluating model-based planning and planner amortization for continuous control Arunkumar Byravan Leonard Hasenclever Piotr Trochim M. Berk Mirza Alessandro Davide Ialongo ... Jost Tobias Springenberg A. Abdolmaleki N. Heess J. Merel Martin Riedmiller 55 17 0 07 Oct 2021
On the Privacy Risks of Deploying Recurrent Neural Networks in Machine Learning Models Yunhao Yang Parham Gohari Ufuk Topcu AAML 35 3 0 06 Oct 2021
Adaptive control of a mechatronic system using constrained residual reinforcement learning Tom Staessens Tom Lefebvre Guillaume Crevecoeur 22 16 0 06 Oct 2021
Dropout Q-Functions for Doubly Efficient Reinforcement Learning Takuya Hiraoka Takahisa Imagawa Taisei Hashimoto Takashi Onishi Yoshimasa Tsuruoka 13 105 0 05 Oct 2021
Hit and Lead Discovery with Explorative RL and Fragment-based Molecule Generation Soojung Yang Doyeong Hwang Seul Lee Seongok Ryu Sung Ju Hwang 36 67 0 04 Oct 2021
The $f$ -Divergence Reinforcement Learning Framework Chen Gong Qiang He Yunpeng Bai Zhouyi Yang Xiaoyu Chen Xinwen Hou Xianjie Zhang Yu Liu Guoliang Fan 36 3 0 24 Sep 2021
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience C. Banerjee Zhiyong Chen N. Noman 19 30 0 24 Sep 2021
Accessibility-Based Clustering for Efficient Learning of Locomotion Skills Chong Zhang Wanming Yu Zhibin Li 31 9 0 23 Sep 2021
Shape Control of Deformable Linear Objects with Offline and Online Learning of Local Linear Deformation Models Mingrui Yu Hanzhong Zhong Xiang-Yang Li OffRL AI4CE 38 41 0 23 Sep 2021
Decentralized Global Connectivity Maintenance for Multi-Robot Navigation: A Reinforcement Learning Approach Minghao Li Yingrui Jie Yang Kong Hui Cheng 43 9 0 17 Sep 2021
Computation Rate Maximum for Mobile Terminals in UAV-assisted Wireless Powered MEC Networks with Fairness Constraint Xiaoyi Zhou Liang Huang Tong Ye Weiqiang Sun 11 1 0 13 Sep 2021
Learning to Navigate Sidewalks in Outdoor Environments Maks Sorokin Jie Tan Karen Liu Sehoon Ha 26 41 0 12 Sep 2021
Encoding Distributional Soft Actor-Critic for Autonomous Driving in Multi-lane Scenarios Jingliang Duan Yangang Ren Fawang Zhang Yang Guan Dongjie Yu Shengbo Eben Li B. Cheng Lin Zhao 23 7 0 12 Sep 2021
Optimizing Quantum Variational Circuits with Deep Reinforcement Learning Owen Lockwood 22 9 0 07 Sep 2021
APPLE: Adaptive Planner Parameter Learning from Evaluative Feedback Zizhao Wang Xuesu Xiao Garrett A. Warnell Peter Stone 25 44 0 22 Aug 2021
Implicitly Regularized RL with Implicit Q-Values Nino Vieillard Marcin Andrychowicz Anton Raichuk Olivier Pietquin M. Geist OffRL 24 9 0 16 Aug 2021
A Pragmatic Look at Deep Imitation Learning Kai Arulkumaran D. Lillrank 35 9 0 04 Aug 2021
Deep Reinforcement Learning Based Networked Control with Network Delays for Signal Temporal Logic Specifications Junya Ikemoto T. Ushio 21 3 0 03 Aug 2021
A Reinforcement Learning Approach for Scheduling in mmWave Networks M. Dogan Yahya H. Ezzeldin Christina Fragouli Addison W. Bohannon 22 10 0 01 Aug 2021
ManiSkill: Generalizable Manipulation Skill Benchmark with Large-Scale Demonstrations Tongzhou Mu Z. Ling Fanbo Xiang Derek Yang Xuanlin Li Stone Tao Zhiao Huang Zhiwei Jia Hao Su 41 132 0 30 Jul 2021
DRIVE: Deep Reinforced Accident Anticipation with Visual Explanation Wentao Bao Qi Yu Yu Kong FAtt 27 39 0 21 Jul 2021
Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics Krishan Rana Vibhavari Dasagi Jesse Haviland Ben Talbot Michael Milford Niko Sünderhauf BDL OffRL 27 31 0 21 Jul 2021
Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning Denis Yarats Rob Fergus A. Lazaric Lerrel Pinto OffRL 36 338 0 20 Jul 2021
LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of Sparse Reward Iterative Tasks Albert Wilcox Ashwin Balakrishna Brijen Thananjeyan Joseph E. Gonzalez Ken Goldberg 29 11 0 10 Jul 2021
Offline Meta-Reinforcement Learning with Online Self-Supervision Vitchyr H. Pong Ashvin Nair Laura M. Smith Catherine Huang Sergey Levine OffRL 34 66 0 08 Jul 2021