Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.05905
Cited By
Soft Actor-Critic Algorithms and Applications
13 December 2018
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
Jie Tan
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Soft Actor-Critic Algorithms and Applications"
50 / 475 papers shown
Title
Average-Reward Maximum Entropy Reinforcement Learning for Underactuated Double Pendulum Tasks
Jean Seong Bjorn Choe
Bumkyu Choi
Jong-kook Kim
43
2
0
13 Sep 2024
IR2: Implicit Rendezvous for Robotic Exploration Teams under Sparse Intermittent Connectivity
Derek Ming Siang Tan
Yixiao Ma
Jingsong Liang
Yi Cheng Chng
Yuhong Cao
Guillaume Sartoretti
61
3
0
07 Sep 2024
LSR-IGRU: Stock Trend Prediction Based on Long Short-Term Relationships and Improved GRU
Peng Zhu
Yuante Li
Yifan Hu
Qinyuan Liu
Dawei Cheng
Yuqi Liang
AIFin
AI4TS
46
4
0
26 Aug 2024
Physics-Driven AI Correction in Laser Absorption Sensing Quantification
Ruiyuan Kang
P. Liatsis
Meixia Geng
Qingjie Yang
45
0
0
20 Aug 2024
Parallel Distributional Deep Reinforcement Learning for Mapless Navigation of Terrestrial Mobile Robots
V. A. Kich
A. H. Kolling
J. C. Jesus
Gabriel V. Heisler
Hiago Jacobs
...
André da Silva Kelbouscas
Akihisa Ohya
Ricardo B. Grando
Paulo Lilles Jorge Drews-Jr
D. T. Gamarra
38
3
0
11 Aug 2024
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Haozhe Ma
Zhengding Luo
Thanh Vinh Vo
Kuankuan Sima
Tze-Yun Leong
46
6
0
06 Aug 2024
Hamilton-Jacobi Reachability in Reinforcement Learning: A Survey
Milan Ganai
Sicun Gao
Sylvia Herbert
42
6
0
12 Jul 2024
RoboMorph: Evolving Robot Morphology using Large Language Models
Kevin Qiu
Krzysztof Ciebiera
Krzysztof Ciebiera
Marek Cygan
Marek Cygan
Łukasz Kuciński
LM&Ro
54
0
0
11 Jul 2024
EAGERx: Graph-Based Framework for Sim2real Robot Learning
B. V. D. Heijden
Jelle Luijkx
Laura Ferranti
Jens Kober
Robert Babuška
39
0
0
05 Jul 2024
CIMRL: Combining IMitation and Reinforcement Learning for Safe Autonomous Driving
Jonathan Booher
Khashayar Rohanimanesh
Junhong Xu
Vladislav Isenbaev
Ashwin Balakrishna
Ishan Gupta
Wei Liu
Aleksandr Petiushko
OffRL
34
7
0
13 Jun 2024
AC4MPC: Actor-Critic Reinforcement Learning for Nonlinear Model Predictive Control
Rudolf Reiter
Andrea Ghezzi
Katrin Baumgärtner
Jasper Hoffmann
Robert D. McAllister
Moritz Diehl
36
6
0
06 Jun 2024
Value Improved Actor Critic Algorithms
Yaniv Oren
Moritz A. Zanger
Pascal R. van der Vaart
M. Spaan
Wendelin Bohmer
Wendelin Bohmer
OffRL
33
0
0
03 Jun 2024
Do's and Don'ts: Learning Desirable Skills with Instruction Videos
Hyunseung Kim
ByungKun Lee
Hojoon Lee
Dongyoon Hwang
Donghu Kim
Jaegul Choo
39
1
0
01 Jun 2024
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Linjiajie Fang
Ruoxue Liu
Jing Zhang
Wenjia Wang
Bing-Yi Jing
OffRL
56
3
0
31 May 2024
Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity
Yan Yang
Bin Gao
Ya-xiang Yuan
50
2
0
30 May 2024
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption
Bernd Frauenknecht
Artur Eisele
Devdutt Subhasish
Friedrich Solowjow
Sebastian Trimpe
51
5
0
29 May 2024
Hierarchical World Models as Visual Whole-Body Humanoid Controllers
Nicklas Hansen
V. JyothirS
Vlad Sobal
Yann LeCun
Xiaolong Wang
Hao Su
VGen
54
10
0
28 May 2024
A Pontryagin Perspective on Reinforcement Learning
Onno Eberhard
Claire Vernade
Michael Muehlebach
45
2
0
28 May 2024
Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Michal Nauman
M. Ostaszewski
Krzysztof Jankowski
Piotr Milo's
Marek Cygan
OffRL
49
17
0
25 May 2024
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
Jiafei Lyu
Chenjia Bai
Jingwen Yang
Zongqing Lu
Xiu Li
30
9
0
24 May 2024
Exclusively Penalized Q-learning for Offline Reinforcement Learning
Junghyuk Yeom
Yonghyeon Jo
Jungmo Kim
Sanghyeon Lee
Seungyul Han
OffRL
42
2
0
23 May 2024
RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes
Kyle Stachowicz
Sergey Levine
22
6
0
07 May 2024
The Curse of Diversity in Ensemble-Based Exploration
Zhixuan Lin
P. DÓro
Evgenii Nikishin
Rameswar Panda
44
1
0
07 May 2024
Enhancing Q-Learning with Large Language Model Heuristics
Xiefeng Wu
LRM
32
0
0
06 May 2024
S
2
^2
2
AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic
Safa Messaoud
Billel Mokeddem
Zhenghai Xue
Linsey Pang
Bo An
Haipeng Chen
Sanjay Chawla
46
3
0
02 May 2024
Employing Federated Learning for Training Autonomous HVAC Systems
Fredrik Hagström
Vikas K. Garg
Fabricio Oliveira
AI4CE
70
0
0
01 May 2024
A Dual Perspective of Reinforcement Learning for Imposing Policy Constraints
Bram De Cooman
Johan A. K. Suykens
40
0
0
25 Apr 2024
SENSOR: Imitate Third-Person Expert's Behaviors via Active Sensoring
Kaichen Huang
Minghao Shao
Shenghua Wan
Hai-Hang Sun
Shuai Feng
Le Gan
De-Chuan Zhan
40
0
0
04 Apr 2024
K-percent Evaluation for Lifelong RL
Golnaz Mesbahi
Parham Mohammad Panahi
Olya Mastikhina
Martha White
Adam White
CLL
OffRL
42
0
0
02 Apr 2024
ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models
Runyu Ma
Jelle Luijkx
Zlatan Ajanović
Jens Kober
LM&Ro
LRM
42
8
0
14 Mar 2024
Bayesian Constraint Inference from User Demonstrations Based on Margin-Respecting Preference Models
Dimitris Papadimitriou
Daniel S. Brown
50
1
0
04 Mar 2024
Koopman-Assisted Reinforcement Learning
Preston Rozwood
Edward Mehrez
Ludger Paehler
Wen Sun
Steven L. Brunton
40
6
0
04 Mar 2024
Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Michal Nauman
Michal Bortkiewicz
Piotr Milo's
Tomasz Trzciñski
M. Ostaszewski
Marek Cygan
OffRL
37
17
0
01 Mar 2024
A Model-Based Approach for Improving Reinforcement Learning Efficiency Leveraging Expert Observations
E. C. Ozcan
Vittorio Giammarino
James Queeney
I. Paschalidis
OffRL
42
0
0
29 Feb 2024
Enhancing Reinforcement Learning Agents with Local Guides
Paul Daoudi
Bogdan Robu
Christophe Prieur
Ludovic Dos Santos
M. Barlier
OnRL
31
3
0
21 Feb 2024
SInViG: A Self-Evolving Interactive Visual Agent for Human-Robot Interaction
Jie Xu
Hanbo Zhang
Xinghang Li
Huaping Liu
Xuguang Lan
Tao Kong
LM&Ro
38
3
0
19 Feb 2024
Scaling Artificial Intelligence for Digital Wargaming in Support of Decision-Making
Scotty Black
Christian J. Darken
19
2
0
08 Feb 2024
Exploration Without Maps via Zero-Shot Out-of-Distribution Deep Reinforcement Learning
Shathushan Sivashangaran
Apoorva Khairnar
A. Eskandarian
OffRL
45
0
0
07 Feb 2024
Integrating DeepRL with Robust Low-Level Control in Robotic Manipulators for Non-Repetitive Reaching Tasks
Mehdi Heydari Shahna
Seyed Adel Alizadeh Kolagar
Jouni Mattila
24
4
0
04 Feb 2024
Behind the Myth of Exploration in Policy Gradients
Adrien Bolland
Gaspard Lambrechts
Damien Ernst
59
0
0
31 Jan 2024
Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator
Ryoma Furuyama
Daiki Kuyoshi
Satoshi Yamane
23
0
0
30 Jan 2024
Tacit algorithmic collusion in deep reinforcement learning guided price competition: A study using EV charge pricing game
Diwas Paudel
Tapas K. Das
20
0
0
25 Jan 2024
Physics-Informed Multi-Agent Reinforcement Learning for Distributed Multi-Robot Problems
Eduardo Sebastián
T. Duong
Nikolay Atanasov
Eduardo Montijano
C. Sagüés
36
3
0
30 Dec 2023
Colored Noise in PPO: Improved Exploration and Performance through Correlated Action Sampling
Jakob J. Hollenstein
Georg Martius
J. Piater
22
3
0
18 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
36
8
0
15 Dec 2023
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
80
5
0
13 Dec 2023
Similarity-based Knowledge Transfer for Cross-Domain Reinforcement Learning
Sergio A. Serrano
J. Martínez-Carranza
L. Sucar
30
2
0
05 Dec 2023
Virtual Action Actor-Critic Framework for Exploration (Student Abstract)
Bumgeun Park
Taeyoung Kim
Quoc-Vinh Lai-Dang
Dongsoo Har
19
1
0
06 Nov 2023
Robust Adversarial Reinforcement Learning via Bounded Rationality Curricula
Aryaman Reddi
Maximilian Tölle
Jan Peters
Georgia Chalvatzaki
Carlo DÉramo
45
4
0
03 Nov 2023
A Tractable Inference Perspective of Offline RL
Xuejie Liu
Guy Van den Broeck
Mathias Niepert
Yitao Liang
OffRL
36
1
0
31 Oct 2023
Previous
1
2
3
4
5
...
8
9
10
Next