Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1801.01290
Cited By
v1
v2 (latest)
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
4 January 2018
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor"
50 / 4,552 papers shown
Advancing Safe Mechanical Ventilation Using Offline RL With Hybrid Actions and Clinically Aligned Rewards
Muhammad Hamza Yousuf
Jason Li
S. Vahdati
Raphael Theilen
Jakob Wittenstein
Jens Lehmann
OffRL
206
1
0
17 Jun 2025
Scaling Algorithm Distillation for Continuous Control with Mamba
Samuel Beaussant
Mehdi Mounsif
199
0
0
16 Jun 2025
Overcoming Overfitting in Reinforcement Learning via Gaussian Process Diffusion Policy
Symposium on Software Performance (SP), 2025
Amornyos Horprasert
Esa Apriaskar
Xingyu Liu
Lanlan Su
Lyudmila S. Mihaylova
152
1
0
16 Jun 2025
Learning Swing-up Maneuvers for a Suspended Aerial Manipulation Platform in a Hierarchical Control Framework
Hemjyoti Das
Minh Nhat Vu
Christian Ott
120
0
0
16 Jun 2025
A Novel ViDAR Device With Visual Inertial Encoder Odometry and Reinforcement Learning-Based Active SLAM Method
IEEE Transactions on Industrial Informatics (IEEE TII), 2025
Zhanhua Xin
Zhihao Wang
Shenghao Zhang
Wanchao Chi
Yan Meng
Shihan Kong
Yan Xiong
Chong Zhang
Yuzhen Liu
Junzhi Yu
110
0
0
16 Jun 2025
Flow-Based Policy for Online Reinforcement Learning
Lei Lv
Y. Li
Yu-Juan Luo
F. Sun
Tao Kong
Jiafeng Xu
Xiao Ma
346
9
0
15 Jun 2025
Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models
Tung M. Luu
Younghwan Lee
Donghoon Lee
Sunho Kim
Min Jun Kim
Chang D. Yoo
ALM
VLM
202
8
0
15 Jun 2025
Palpation Alters Auditory Pain Expressions with Gender-Specific Variations in Robopatients
IEEE Transactions on Medical Robotics and Bionics (TMRB), 2025
Chapa Sirithunge
Yue Xie
Saitarun Nadipineni
Fumiya Iida
Thilina Dulantha Lalitharatne
152
0
0
13 Jun 2025
CIRO7.2: A Material Network with Circularity of -7.2 and Reinforcement-Learning-Controlled Robotic Disassembler
Federico Zocco
Monica Malvezzi
158
0
0
13 Jun 2025
DoublyAware: Dual Planning and Policy Awareness for Temporal Difference Learning in Humanoid Locomotion
Khang Nguyen
An T. Le
Jan Peters
Minh Nhat Vu
219
0
0
12 Jun 2025
Bipedal Balance Control with Whole-body Musculoskeletal Standing and Falling Simulations
Chengtian Ma
Yunyue Wei
Chenhui Zuo
Chen Zhang
Yanan Sui
226
1
0
11 Jun 2025
Efficient Preference-Based Reinforcement Learning: Randomized Exploration Meets Experimental Design
Andreas Schlaginhaufen
Reda Ouhamma
Maryam Kamgarpour
249
1
0
11 Jun 2025
Wasserstein Barycenter Soft Actor-Critic
Zahra Shahrooei
Ali Baheri
OffRL
288
1
0
11 Jun 2025
On a few pitfalls in KL divergence gradient estimation for RL
Yunhao Tang
Rémi Munos
263
10
0
11 Jun 2025
Intention-Conditioned Flow Occupancy Models
Chongyi Zheng
S. Park
Sergey Levine
Benjamin Eysenbach
AI4TS
OffRL
AI4CE
304
2
0
10 Jun 2025
Your Agent Can Defend Itself against Backdoor Attacks
Li Changjiang
Liang Jiacheng
Cao Bochuan
Chen Jinghui
Wang Ting
AAML
LLMAG
359
5
0
10 Jun 2025
Offline RL with Smooth OOD Generalization in Convex Hull and its Neighborhood
International Conference on Learning Representations (ICLR), 2025
Qingmao Yao
Zhichao Lei
Tianyuan Chen
Ziyue Yuan
Xuefan Chen
Jianxiang Liu
Faguo Wu
Xiao Zhang
OffRL
202
2
0
10 Jun 2025
Time-Aware World Model for Adaptive Prediction and Control
Anh N. Nhu
Sanghyun Son
Ming-Chyuan Lin
AI4TS
TTA
222
1
0
10 Jun 2025
Dynamical System Optimization
Emo Todorov
164
0
0
10 Jun 2025
Re4MPC: Reactive Nonlinear MPC for Multi-model Motion Planning via Deep Reinforcement Learning
Neset Unver Akmandor
Sarvesh Prajapati
Mark Zolotas
T. Padır
138
0
0
10 Jun 2025
Deep Reinforcement Learning-Based Motion Planning and PDE Control for Flexible Manipulators
IEEE Robotics and Automation Letters (IEEE RA-L), 2025
Amir Hossein Barjini
Seyed Adel Alizadeh Kolagar
Sadeq Yaqubi
Jouni Mattila
118
2
0
10 Jun 2025
MOBODY: Model Based Off-Dynamics Offline Reinforcement Learning
Yihong Guo
Yu Yang
Pan Xu
Anqi Liu
OffRL
217
1
0
10 Jun 2025
Realistic Urban Traffic Generator using Decentralized Federated Learning for the SUMO simulator
IEEE Open Journal of the Communications Society (IEEE Open J. Commun. Soc.), 2025
Alberto Bazán-Guillén
Carlos Beis-Penedo
Diego Cajaraville-Aboy
Pablo Barbecho-Bautista
R. Redondo
Luis J. de la Cruz Llopis
Ana Fernández-Vilas
Mónica Aguilar Igartua
M. Fernández-Veiga
AI4TS
182
0
0
09 Jun 2025
Graph-Assisted Stitching for Offline Hierarchical Reinforcement Learning
Seungho Baek
Taegeon Park
Jongchan Park
Seungjun Oh
Yusung Kim
OffRL
287
2
0
09 Jun 2025
Monotone and Conservative Policy Iteration Beyond the Tabular Case
Eshwar S. R.
Gugan Thoppe
Ananyabrata Barua
Aditya Gopalan
Gal Dalal
196
1
0
08 Jun 2025
Learning What Matters Now: A Dual-Critic Context-Aware RL Framework for Priority-Driven Information Gain
Dimitris Panagopoulos
Adolfo Perrusquía
Weisi Guo
135
0
0
07 Jun 2025
AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification
Geonwoo Cho
Jaemoon Lee
Jaegyun Im
Subi Lee
Jihwan Lee
Sundong Kim
310
0
0
06 Jun 2025
Gradual Transition from Bellman Optimality Operator to Bellman Operator in Online Reinforcement Learning
Motoki Omura
Kazuki Ota
Takayuki Osa
Yusuke Mukuta
Tatsuya Harada
OffRL
307
0
0
06 Jun 2025
Self driving algorithm for an active four wheel drive racecar
Gergely Bari
Laszlo Palkovics
216
0
0
06 Jun 2025
AutoQD: Automatic Discovery of Diverse Behaviors with Quality-Diversity Optimization
Saeed Hedayatian
Stefanos Nikolaidis
124
1
0
05 Jun 2025
When Maximum Entropy Misleads Policy Optimization
Ruipeng Zhang
Ya-Chien Chang
Sicun Gao
173
6
0
05 Jun 2025
Self-Predictive Dynamics for Generalization of Vision-based Reinforcement Learning
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Kyungsoo Kim
Jeongsoo Ha
Yusung Kim
BDL
168
10
0
05 Jun 2025
Latent Guided Sampling for Combinatorial Optimization
Sobihan Surendran
Adeline Fermanian
Sylvain Le Corff
BDL
OffRL
221
0
0
04 Jun 2025
Autonomous Vehicle Lateral Control Using Deep Reinforcement Learning with MPC-PID Demonstration
Chengdong Wu
Sven Kirchner
Nils Purschke
Alois Knoll
172
2
0
04 Jun 2025
Horizon Reduction Makes RL Scalable
Seohong Park
Kevin Frans
Deepinder Mann
Benjamin Eysenbach
Aviral Kumar
Sergey Levine
OffRL
632
15
0
04 Jun 2025
An Efficient Task-Oriented Dialogue Policy: Evolutionary Reinforcement Learning Injected by Elite Individuals
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yangyang Zhao
Ben Niu
L. Qin
Shihan Wang
250
3
0
04 Jun 2025
FLIP: Flowability-Informed Powder Weighing
Nikola Radulov
Alex Wright
Thomas Little
Andrew I. Cooper
Gabriella Pizzuto
270
1
0
04 Jun 2025
Verification-Guided Falsification for Safe RL via Explainable Abstraction and Risk-Aware Exploration
Tuan Le
Risal Shahriar Shefin
Debashis Gupta
Thai Le
Sarra Alqahtani
OffRL
242
0
0
04 Jun 2025
Bridging the Performance Gap Between Target-Free and Target-Based Reinforcement Learning
Théo Vincent
Yogesh Tripathi
Tim Lukas Faust
Yaniv Oren
Jan Peters
Carlo DÉramo
CLL
238
2
0
04 Jun 2025
Self-Composing Policies for Scalable Continual Reinforcement Learning
Mikel Malagón
Josu Ceberio
Jose A. Lozano
CLL
338
10
0
04 Jun 2025
Confidence-Guided Human-AI Collaboration: Reinforcement Learning with Distributional Proxy Value Propagation for Autonomous Driving
Li Zeqiao
Wang Yijing
Wang Haoyu
Li Zheng
Li Peng
Zuo zhiqiang
Hu Chuan
343
0
0
04 Jun 2025
Ensemble-MIX: Enhancing Sample Efficiency in Multi-Agent RL Using Ensemble Methods
Tom Danino
Nahum Shimkin
212
1
0
03 Jun 2025
A Hybrid Approach to Indoor Social Navigation: Integrating Reactive Local Planning and Proactive Global Planning
IEEE International Conference on Robotics and Automation (ICRA), 2025
Arnab Debnath
Gregory J. Stein
Jana Kosecka
180
1
0
03 Jun 2025
Think Twice, Act Once: A Co-Evolution Framework of LLM and RL for Large-Scale Decision Making
Xu Wan
Wenyue Xu
Chao Yang
Mingyang Sun
221
3
0
03 Jun 2025
Efficient Manipulation-Enhanced Semantic Mapping With Uncertainty-Informed Action Selection
Nils Dengler
Jesper Mucke
Rohit Menon
Maren Bennewitz
239
5
0
02 Jun 2025
Reinforcement Learning with Data Bootstrapping for Dynamic Subgoal Pursuit in Humanoid Robot Navigation
Chengyang Peng
Zhihao Zhang
Shiting Gong
Sankalp Agrawal
Keith A. Redmill
Ayonga Hereid
166
1
0
02 Jun 2025
Bidirectional Soft Actor-Critic: Leveraging Forward and Reverse KL Divergence for Efficient Reinforcement Learning
Yixian Zhang
Huaze Tang
Changxu Wei
Wenbo Ding
179
0
0
02 Jun 2025
MAGIK: Mapping to Analogous Goals via Imagination-enabled Knowledge Transfer
Ajsal Shereef Palattuparambil
Thommen George Karimpanal
Santu Rana
OffRL
308
0
0
02 Jun 2025
Q-ARDNS-Multi: A Multi-Agent Quantum Reinforcement Learning Framework with Meta-Cognitive Adaptation for Complex 3D Environments
Umberto Gonçalves de Sousa
AI4CE
75
0
0
02 Jun 2025
Trajectory First: A Curriculum for Discovering Diverse Policies
Cornelius V. Braun
Sayantan Auddy
Marc Toussaint
300
0
0
02 Jun 2025
Previous
1
2
3
...
8
9
10
...
90
91
92
Next
Page 9 of 92
Page
of 92
Go