Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1702.08165
Cited By
Reinforcement Learning with Deep Energy-Based Policies
27 February 2017
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Reinforcement Learning with Deep Energy-Based Policies"
42 / 242 papers shown
Title
Modified Actor-Critics
Erinc Merdivan
S. Hanke
M. Geist
14
2
0
02 Jul 2019
Learning-Driven Exploration for Reinforcement Learning
Muhammad Usama
D. Chang
11
10
0
17 Jun 2019
Learning Representations by Maximizing Mutual Information Across Views
Philip Bachman
R. Devon Hjelm
William Buchwalter
SSL
52
1,452
0
03 Jun 2019
A Regularized Opponent Model with Maximum Entropy Objective
Zheng Tian
Ying Wen
Zhichen Gong
Faiz Punakkath
Shihao Zou
Jun Wang
24
30
0
17 May 2019
End-to-End Robotic Reinforcement Learning without Reward Engineering
Avi Singh
Larry Yang
Kristian Hartikainen
Chelsea Finn
Sergey Levine
SSL
OffRL
40
266
0
16 Apr 2019
Learning Probabilistic Multi-Modal Actor Models for Vision-Based Robotic Grasping
Mengyuan Yan
A. Li
Mrinal Kalakrishnan
P. Pastor
13
18
0
15 Apr 2019
Multi-Preference Actor Critic
Ishan Durugkar
Matthew J. Hausknecht
Adith Swaminathan
Patrick MacAlpine
9
1
0
05 Apr 2019
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies
Riley Simmons-Edler
Ben Eisner
E. Mitchell
Sebastian Seung
Daniel D. Lee
8
28
0
25 Mar 2019
Diagnosing Bottlenecks in Deep Q-learning Algorithms
Justin Fu
Aviral Kumar
Matthew Soh
Sergey Levine
OffRL
11
140
0
26 Feb 2019
Discretizing Continuous Action Space for On-Policy Optimization
Yunhao Tang
Shipra Agrawal
OffRL
26
117
0
29 Jan 2019
Amplifying the Imitation Effect for Reinforcement Learning of UCAV's Mission Execution
G. Lee
Chang Ouk Kim
8
4
0
17 Jan 2019
Learning to Walk via Deep Reinforcement Learning
Tuomas Haarnoja
Sehoon Ha
Aurick Zhou
Jie Tan
George Tucker
Sergey Levine
12
433
0
26 Dec 2018
VIREL: A Variational Inference Framework for Reinforcement Learning
M. Fellows
Anuj Mahajan
Tim G. J. Rudner
Shimon Whiteson
DRL
17
53
0
03 Nov 2018
Preparing for the Unexpected: Diversity Improves Planning Resilience in Evolutionary Algorithms
Thomas Gabor
Lenz Belzner
Thomy Phan
Kyrill Schmid
9
14
0
30 Oct 2018
Stein Variational Gradient Descent as Moment Matching
Qiang Liu
Dilin Wang
12
37
0
27 Oct 2018
Optimization of Molecules via Deep Reinforcement Learning
Zhenpeng Zhou
S. Kearnes
Li Li
R. Zare
Patrick F. Riley
AI4CE
16
532
0
19 Oct 2018
A Survey and Critique of Multiagent Deep Reinforcement Learning
Pablo Hernandez-Leal
Bilal Kartal
Matthew E. Taylor
OffRL
30
550
0
12 Oct 2018
Learning to Perform Local Rewriting for Combinatorial Optimization
Xinyun Chen
Yuandong Tian
NAI
OffRL
16
335
0
30 Sep 2018
Negative Update Intervals in Deep Multi-Agent Reinforcement Learning
Gregory Palmer
Rahul Savani
K. Tuyls
20
27
0
13 Sep 2018
Stochastic Particle-Optimization Sampling and the Non-Asymptotic Convergence Theory
Jianyi Zhang
Ruiyi Zhang
Lawrence Carin
Changyou Chen
12
46
0
05 Sep 2018
Policy Optimization as Wasserstein Gradient Flows
Ruiyi Zhang
Changyou Chen
Chunyuan Li
Lawrence Carin
14
66
0
09 Aug 2018
Entropy Maximization for Markov Decision Processes Under Temporal Logic Constraints
Y. Savas
Melkior Ornik
Murat Cubuktepe
Mustafa O. Karabag
Ufuk Topcu
24
38
0
09 Jul 2018
Using Reinforcement Learning with Partial Vehicle Detection for Intelligent Traffic Signal Control
Rusheng Zhang
A. Ishikawa
Wenli Wang
Benjamin Striner
Ozan Tonguz
16
100
0
04 Jul 2018
Maximum a Posteriori Policy Optimisation
A. Abdolmaleki
Jost Tobias Springenberg
Yuval Tassa
Rémi Munos
N. Heess
Martin Riedmiller
20
469
0
14 Jun 2018
Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings
John D. Co-Reyes
YuXuan Liu
Abhishek Gupta
Benjamin Eysenbach
Pieter Abbeel
Sergey Levine
SSL
BDL
AIFin
21
142
0
07 Jun 2018
Variational Inverse Control with Events: A General Framework for Data-Driven Reward Definition
Justin Fu
Avi Singh
Dibya Ghosh
Larry Yang
Sergey Levine
BDL
14
125
0
29 May 2018
A Unified Particle-Optimization Framework for Scalable Bayesian Sampling
Changyou Chen
Ruiyi Zhang
Wenlin Wang
Bai Li
Liqun Chen
21
86
0
29 May 2018
Maximum Causal Tsallis Entropy Imitation Learning
Kyungjae Lee
Sungjoon Choi
Songhwai Oh
OOD
21
19
0
22 May 2018
Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review
Sergey Levine
AI4CE
BDL
22
656
0
02 May 2018
Multiagent Soft Q-Learning
E. Wei
Drew Wicke
David Freelan
S. Luke
13
79
0
25 Apr 2018
Evolved Policy Gradients
Rein Houthooft
Richard Y. Chen
Phillip Isola
Bradly C. Stadie
Filip Wolski
Jonathan Ho
Pieter Abbeel
32
227
0
13 Feb 2018
Balancing Two-Player Stochastic Games with Soft Q-Learning
Jordi Grau-Moya
Felix Leibfried
Haitham Bou-Ammar
18
42
0
09 Feb 2018
SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation
Bo Dai
Albert Eaton Shaw
Lihong Li
Lin Xiao
Niao He
Zhen Liu
Jianshu Chen
Le Song
24
25
0
29 Dec 2017
Boosting the Actor with Dual Critic
Bo Dai
Albert Eaton Shaw
Niao He
Lihong Li
Le Song
19
46
0
29 Dec 2017
A short variational proof of equivalence between policy gradients and soft Q learning
Pierre Harvey Richemond
B. Maginnis
16
5
0
22 Dec 2017
Backpropagation through the Void: Optimizing control variates for black-box gradient estimation
Will Grathwohl
Dami Choi
Yuhuai Wu
Geoffrey Roeder
David Duvenaud
36
300
0
31 Oct 2017
Learning Robust Rewards with Adversarial Inverse Reinforcement Learning
Justin Fu
Katie Z Luo
Sergey Levine
36
739
0
30 Oct 2017
Continuous-Time Flows for Efficient Inference and Density Estimation
Changyou Chen
Chunyuan Li
Liquan Chen
Wenlin Wang
Yunchen Pu
Lawrence Carin
TPM
34
57
0
04 Sep 2017
An Information-Theoretic Optimality Principle for Deep Reinforcement Learning
Felix Leibfried
Jordi Grau-Moya
Haitham Bou-Ammar
24
24
0
06 Aug 2017
Distral: Robust Multitask Reinforcement Learning
Yee Whye Teh
V. Bapst
Wojciech M. Czarnecki
John Quan
J. Kirkpatrick
R. Hadsell
N. Heess
Razvan Pascanu
25
542
0
13 Jul 2017
Equivalence Between Policy Gradients and Soft Q-Learning
John Schulman
Xi Chen
Pieter Abbeel
OffRL
29
339
0
21 Apr 2017
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
104
1,502
0
25 Jan 2017
Previous
1
2
3
4
5