Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1702.08165
Cited By
Reinforcement Learning with Deep Energy-Based Policies
27 February 2017
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Reinforcement Learning with Deep Energy-Based Policies"
50 / 241 papers shown
Title
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
C. Steinparz
Thomas Schmied
Fabian Paischer
Marius-Constantin Dinu
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
CLL
24
11
0
12 Jul 2022
Safe-FinRL: A Low Bias and Variance Deep Reinforcement Learning Implementation for High-Freq Stock Trading
Zitao Song
Xuyang Jin
Chenliang Li
OffRL
AIFin
17
1
0
13 Jun 2022
On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting
Tomasz Korbak
Hady ElSahar
Germán Kruszewski
Marc Dymetman
CLL
17
50
0
01 Jun 2022
Critic Sequential Monte Carlo
Vasileios Lioutas
J. Lavington
Justice Sefas
Matthew Niedoba
Yunpeng Liu
Berend Zwartsenberg
Setareh Dabiri
Frank D. Wood
Adam Scibior
44
7
0
30 May 2022
Maximum entropy optimal density control of discrete-time linear systems and Schrödinger bridges
Kaito Ito
Kenji Kashima
11
12
0
11 Apr 2022
A Primer on Maximum Causal Entropy Inverse Reinforcement Learning
Adam Gleave
Sam Toyer
21
13
0
22 Mar 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
12
9
0
24 Feb 2022
A Globally Convergent Evolutionary Strategy for Stochastic Constrained Optimization with Applications to Reinforcement Learning
Youssef Diouane
Aurelien Lucchi
Vihang Patil
19
3
0
21 Feb 2022
Can Interpretable Reinforcement Learning Manage Prosperity Your Way?
Charl Maree
C. Omlin
AIFin
11
6
0
18 Feb 2022
Sequential Bayesian experimental designs via reinforcement learning
Hikaru Asano
OffRL
15
0
0
14 Feb 2022
Reinforcement Learning Your Way: Agent Characterization through Policy Regularization
Charl Maree
C. Omlin
14
8
0
21 Jan 2022
A Prescriptive Dirichlet Power Allocation Policy with Deep Reinforcement Learning
Yuan Tian
Minghao Han
Chetan S. Kulkarni
Olga Fink
25
13
0
20 Jan 2022
Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime
B. Kerimkulov
J. Leahy
David Siska
Lukasz Szpruch
22
11
0
18 Jan 2022
Entropy-Regularized Partially Observed Markov Decision Processes
Timothy L. Molloy
G. Nair
13
2
0
22 Dec 2021
Variational Quantum Soft Actor-Critic
Qingfeng Lan
14
20
0
20 Dec 2021
ColO-RAN: Developing Machine Learning-based xApps for Open RAN Closed-loop Control on Programmable Experimental Platforms
Michele Polese
Leonardo Bonati
Salvatore D’oro
S. Basagni
Tommaso Melodia
26
148
0
17 Dec 2021
Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic
Zhihai Wang
Jie Wang
Qi Zhou
Bin Li
Houqiang Li
19
30
0
16 Dec 2021
Recent Advances in Reinforcement Learning in Finance
B. Hambly
Renyuan Xu
Huining Yang
OffRL
27
166
0
08 Dec 2021
Optimization-free Ground Contact Force Constraint Satisfaction in Quadrupedal Locomotion
Eric N. Sihite
Pravin Dangol
A. Ramezani
25
15
0
24 Nov 2021
Maximum Entropy Differential Dynamic Programming
Oswin So
Ziyi Wang
Evangelos A. Theodorou
45
14
0
13 Oct 2021
Learning from Ambiguous Demonstrations with Self-Explanation Guided Reinforcement Learning
Yantian Zha
L. Guan
Subbarao Kambhampati
26
5
0
11 Oct 2021
Divergence-Regularized Multi-Agent Actor-Critic
Kefan Su
Zongqing Lu
46
25
0
01 Oct 2021
Emergence of Theory of Mind Collaboration in Multiagent Systems
Luyao Yuan
Zipeng Fu
Linqi Zhou
Kexin Yang
Song-Chun Zhu
46
10
0
30 Sep 2021
Robust Control Under Uncertainty via Bounded Rationality and Differential Privacy
Vincent Pacelli
Anirudha Majumdar
24
9
0
17 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
36
92
0
14 Sep 2021
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods
Xin Guo
Anran Hu
Junzi Zhang
OffRL
23
6
0
13 Sep 2021
Encoding Distributional Soft Actor-Critic for Autonomous Driving in Multi-lane Scenarios
Jingliang Duan
Yangang Ren
Fawang Zhang
Yang Guan
Dongjie Yu
Shengbo Eben Li
B. Cheng
Lin Zhao
21
6
0
12 Sep 2021
Soft Hierarchical Graph Recurrent Networks for Many-Agent Partially Observable Environments
Zhenhui Ye
Xiaohong Jiang
Guang-hua Song
Bowei Yang
25
1
0
05 Sep 2021
A Survey of Exploration Methods in Reinforcement Learning
Susan Amin
Maziar Gomrokchi
Harsh Satija
H. V. Hoof
Doina Precup
OffRL
16
80
0
01 Sep 2021
Implicit Behavioral Cloning
Peter R. Florence
Corey Lynch
Andy Zeng
Oscar Ramirez
Ayzaan Wahid
Laura Downs
Adrian S. Wong
Johnny Lee
Igor Mordatch
Jonathan Tompson
OffRL
49
368
0
01 Sep 2021
Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning
Andrea Zanette
Martin J. Wainwright
Emma Brunskill
OffRL
29
111
0
19 Aug 2021
Smoother Entropy for Active State Trajectory Estimation and Obfuscation in POMDPs
Timothy L. Molloy
G. Nair
22
13
0
19 Aug 2021
Entropy Regularized Motion Planning via Stein Variational Inference
Alexander Lambert
Byron Boots
43
12
0
11 Jul 2021
Goal-Conditioned Reinforcement Learning with Imagined Subgoals
Elliot Chane-Sane
Cordelia Schmid
Ivan Laptev
19
139
0
01 Jul 2021
Applications of the Free Energy Principle to Machine Learning and Neuroscience
Beren Millidge
DRL
20
7
0
30 Jun 2021
Active Learning in Robotics: A Review of Control Principles
Annalisa T. Taylor
Thomas A. Berrueta
Todd D. Murphey
27
70
0
25 Jun 2021
The Option Keyboard: Combining Skills in Reinforcement Learning
André Barreto
Diana Borsa
Shaobo Hou
Gheorghe Comanici
Eser Aygun
...
Daniel Toyama
Jonathan J. Hunt
Shibl Mourad
David Silver
Doina Precup
11
98
0
24 Jun 2021
Sampling with Mirrored Stein Operators
Jiaxin Shi
Chang-rui Liu
Lester W. Mackey
45
19
0
23 Jun 2021
IQ-Learn: Inverse soft-Q Learning for Imitation
Divyansh Garg
Shuvam Chakraborty
Chris Cundy
Jiaming Song
Matthieu Geist
Stefano Ermon
39
178
0
23 Jun 2021
Flow Network based Generative Models for Non-Iterative Diverse Candidate Generation
Emmanuel Bengio
Moksh Jain
Maksym Korablyov
Doina Precup
Yoshua Bengio
24
306
0
08 Jun 2021
An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning
Changnan Xiao
Haosen Shi
Jiajun Fan
Shihong Deng
18
5
0
01 Jun 2021
Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear Function Approximation
Zaiwei Chen
S. Khodadadian
S. T. Maguluri
OffRL
63
29
0
26 May 2021
Composable Energy Policies for Reactive Motion Generation and Reinforcement Learning
Julen Urain
Anqi Li
Puze Liu
Carlo DÉramo
Jan Peters
33
25
0
11 May 2021
Stein's Method Meets Computational Statistics: A Review of Some Recent Developments
Andreas Anastasiou
Alessandro Barp
F. Briol
B. Ebner
Robert E. Gaunt
...
Qiang Liu
Lester W. Mackey
Chris J. Oates
Gesine Reinert
Yvik Swan
22
35
0
07 May 2021
Hierarchical Reinforcement Learning for Air-to-Air Combat
Adrian P. Pope
J. Ide
Daria Mićović
Henry Diaz
D. Rosenbluth
Lee Ritholtz
Jason C. Twedt
Thayne T. Walker
K. Alcedo
D. Javorsek
17
72
0
03 May 2021
Learning to Robustly Negotiate Bi-Directional Lane Usage in High-Conflict Driving Scenarios
Christoph Killing
Adam R. Villaflor
John M. Dolan
25
5
0
22 Mar 2021
Offline Reinforcement Learning with Fisher Divergence Critic Regularization
Ilya Kostrikov
Jonathan Tompson
Rob Fergus
Ofir Nachum
OffRL
29
300
0
14 Mar 2021
Modelling Behavioural Diversity for Learning in Open-Ended Games
Nicolas Perez Nieves
Yaodong Yang
Oliver Slumbers
D. Mguni
Ying Wen
Jun Wang
22
67
0
14 Mar 2021
S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning
Samarth Sinha
Ajay Mandlekar
Animesh Garg
OffRL
26
104
0
10 Mar 2021
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
Benjamin Eysenbach
Sergey Levine
OOD
26
174
0
10 Mar 2021
Previous
1
2
3
4
5
Next