ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1702.08165
  4. Cited By
Reinforcement Learning with Deep Energy-Based Policies

Reinforcement Learning with Deep Energy-Based Policies

27 February 2017
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
ArXivPDFHTML

Papers citing "Reinforcement Learning with Deep Energy-Based Policies"

42 / 242 papers shown
Title
Modified Actor-Critics
Modified Actor-Critics
Erinc Merdivan
S. Hanke
M. Geist
14
2
0
02 Jul 2019
Learning-Driven Exploration for Reinforcement Learning
Learning-Driven Exploration for Reinforcement Learning
Muhammad Usama
D. Chang
11
10
0
17 Jun 2019
Learning Representations by Maximizing Mutual Information Across Views
Learning Representations by Maximizing Mutual Information Across Views
Philip Bachman
R. Devon Hjelm
William Buchwalter
SSL
52
1,452
0
03 Jun 2019
A Regularized Opponent Model with Maximum Entropy Objective
A Regularized Opponent Model with Maximum Entropy Objective
Zheng Tian
Ying Wen
Zhichen Gong
Faiz Punakkath
Shihao Zou
Jun Wang
24
30
0
17 May 2019
End-to-End Robotic Reinforcement Learning without Reward Engineering
End-to-End Robotic Reinforcement Learning without Reward Engineering
Avi Singh
Larry Yang
Kristian Hartikainen
Chelsea Finn
Sergey Levine
SSL
OffRL
40
266
0
16 Apr 2019
Learning Probabilistic Multi-Modal Actor Models for Vision-Based Robotic
  Grasping
Learning Probabilistic Multi-Modal Actor Models for Vision-Based Robotic Grasping
Mengyuan Yan
A. Li
Mrinal Kalakrishnan
P. Pastor
13
18
0
15 Apr 2019
Multi-Preference Actor Critic
Multi-Preference Actor Critic
Ishan Durugkar
Matthew J. Hausknecht
Adith Swaminathan
Patrick MacAlpine
9
1
0
05 Apr 2019
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies
Riley Simmons-Edler
Ben Eisner
E. Mitchell
Sebastian Seung
Daniel D. Lee
8
28
0
25 Mar 2019
Diagnosing Bottlenecks in Deep Q-learning Algorithms
Diagnosing Bottlenecks in Deep Q-learning Algorithms
Justin Fu
Aviral Kumar
Matthew Soh
Sergey Levine
OffRL
11
140
0
26 Feb 2019
Discretizing Continuous Action Space for On-Policy Optimization
Discretizing Continuous Action Space for On-Policy Optimization
Yunhao Tang
Shipra Agrawal
OffRL
26
117
0
29 Jan 2019
Amplifying the Imitation Effect for Reinforcement Learning of UCAV's
  Mission Execution
Amplifying the Imitation Effect for Reinforcement Learning of UCAV's Mission Execution
G. Lee
Chang Ouk Kim
8
4
0
17 Jan 2019
Learning to Walk via Deep Reinforcement Learning
Learning to Walk via Deep Reinforcement Learning
Tuomas Haarnoja
Sehoon Ha
Aurick Zhou
Jie Tan
George Tucker
Sergey Levine
12
433
0
26 Dec 2018
VIREL: A Variational Inference Framework for Reinforcement Learning
VIREL: A Variational Inference Framework for Reinforcement Learning
M. Fellows
Anuj Mahajan
Tim G. J. Rudner
Shimon Whiteson
DRL
17
53
0
03 Nov 2018
Preparing for the Unexpected: Diversity Improves Planning Resilience in
  Evolutionary Algorithms
Preparing for the Unexpected: Diversity Improves Planning Resilience in Evolutionary Algorithms
Thomas Gabor
Lenz Belzner
Thomy Phan
Kyrill Schmid
9
14
0
30 Oct 2018
Stein Variational Gradient Descent as Moment Matching
Stein Variational Gradient Descent as Moment Matching
Qiang Liu
Dilin Wang
12
37
0
27 Oct 2018
Optimization of Molecules via Deep Reinforcement Learning
Optimization of Molecules via Deep Reinforcement Learning
Zhenpeng Zhou
S. Kearnes
Li Li
R. Zare
Patrick F. Riley
AI4CE
16
532
0
19 Oct 2018
A Survey and Critique of Multiagent Deep Reinforcement Learning
A Survey and Critique of Multiagent Deep Reinforcement Learning
Pablo Hernandez-Leal
Bilal Kartal
Matthew E. Taylor
OffRL
30
550
0
12 Oct 2018
Learning to Perform Local Rewriting for Combinatorial Optimization
Learning to Perform Local Rewriting for Combinatorial Optimization
Xinyun Chen
Yuandong Tian
NAI
OffRL
16
335
0
30 Sep 2018
Negative Update Intervals in Deep Multi-Agent Reinforcement Learning
Negative Update Intervals in Deep Multi-Agent Reinforcement Learning
Gregory Palmer
Rahul Savani
K. Tuyls
20
27
0
13 Sep 2018
Stochastic Particle-Optimization Sampling and the Non-Asymptotic
  Convergence Theory
Stochastic Particle-Optimization Sampling and the Non-Asymptotic Convergence Theory
Jianyi Zhang
Ruiyi Zhang
Lawrence Carin
Changyou Chen
12
46
0
05 Sep 2018
Policy Optimization as Wasserstein Gradient Flows
Policy Optimization as Wasserstein Gradient Flows
Ruiyi Zhang
Changyou Chen
Chunyuan Li
Lawrence Carin
14
66
0
09 Aug 2018
Entropy Maximization for Markov Decision Processes Under Temporal Logic
  Constraints
Entropy Maximization for Markov Decision Processes Under Temporal Logic Constraints
Y. Savas
Melkior Ornik
Murat Cubuktepe
Mustafa O. Karabag
Ufuk Topcu
24
38
0
09 Jul 2018
Using Reinforcement Learning with Partial Vehicle Detection for
  Intelligent Traffic Signal Control
Using Reinforcement Learning with Partial Vehicle Detection for Intelligent Traffic Signal Control
Rusheng Zhang
A. Ishikawa
Wenli Wang
Benjamin Striner
Ozan Tonguz
16
100
0
04 Jul 2018
Maximum a Posteriori Policy Optimisation
Maximum a Posteriori Policy Optimisation
A. Abdolmaleki
Jost Tobias Springenberg
Yuval Tassa
Rémi Munos
N. Heess
Martin Riedmiller
20
469
0
14 Jun 2018
Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement
  Learning with Trajectory Embeddings
Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings
John D. Co-Reyes
YuXuan Liu
Abhishek Gupta
Benjamin Eysenbach
Pieter Abbeel
Sergey Levine
SSL
BDL
AIFin
21
142
0
07 Jun 2018
Variational Inverse Control with Events: A General Framework for
  Data-Driven Reward Definition
Variational Inverse Control with Events: A General Framework for Data-Driven Reward Definition
Justin Fu
Avi Singh
Dibya Ghosh
Larry Yang
Sergey Levine
BDL
14
125
0
29 May 2018
A Unified Particle-Optimization Framework for Scalable Bayesian Sampling
A Unified Particle-Optimization Framework for Scalable Bayesian Sampling
Changyou Chen
Ruiyi Zhang
Wenlin Wang
Bai Li
Liqun Chen
21
86
0
29 May 2018
Maximum Causal Tsallis Entropy Imitation Learning
Maximum Causal Tsallis Entropy Imitation Learning
Kyungjae Lee
Sungjoon Choi
Songhwai Oh
OOD
21
19
0
22 May 2018
Reinforcement Learning and Control as Probabilistic Inference: Tutorial
  and Review
Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review
Sergey Levine
AI4CE
BDL
22
656
0
02 May 2018
Multiagent Soft Q-Learning
Multiagent Soft Q-Learning
E. Wei
Drew Wicke
David Freelan
S. Luke
13
79
0
25 Apr 2018
Evolved Policy Gradients
Evolved Policy Gradients
Rein Houthooft
Richard Y. Chen
Phillip Isola
Bradly C. Stadie
Filip Wolski
Jonathan Ho
Pieter Abbeel
32
227
0
13 Feb 2018
Balancing Two-Player Stochastic Games with Soft Q-Learning
Balancing Two-Player Stochastic Games with Soft Q-Learning
Jordi Grau-Moya
Felix Leibfried
Haitham Bou-Ammar
18
42
0
09 Feb 2018
SBEED: Convergent Reinforcement Learning with Nonlinear Function
  Approximation
SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation
Bo Dai
Albert Eaton Shaw
Lihong Li
Lin Xiao
Niao He
Zhen Liu
Jianshu Chen
Le Song
24
25
0
29 Dec 2017
Boosting the Actor with Dual Critic
Boosting the Actor with Dual Critic
Bo Dai
Albert Eaton Shaw
Niao He
Lihong Li
Le Song
19
46
0
29 Dec 2017
A short variational proof of equivalence between policy gradients and
  soft Q learning
A short variational proof of equivalence between policy gradients and soft Q learning
Pierre Harvey Richemond
B. Maginnis
16
5
0
22 Dec 2017
Backpropagation through the Void: Optimizing control variates for
  black-box gradient estimation
Backpropagation through the Void: Optimizing control variates for black-box gradient estimation
Will Grathwohl
Dami Choi
Yuhuai Wu
Geoffrey Roeder
David Duvenaud
36
300
0
31 Oct 2017
Learning Robust Rewards with Adversarial Inverse Reinforcement Learning
Learning Robust Rewards with Adversarial Inverse Reinforcement Learning
Justin Fu
Katie Z Luo
Sergey Levine
36
739
0
30 Oct 2017
Continuous-Time Flows for Efficient Inference and Density Estimation
Continuous-Time Flows for Efficient Inference and Density Estimation
Changyou Chen
Chunyuan Li
Liquan Chen
Wenlin Wang
Yunchen Pu
Lawrence Carin
TPM
34
57
0
04 Sep 2017
An Information-Theoretic Optimality Principle for Deep Reinforcement
  Learning
An Information-Theoretic Optimality Principle for Deep Reinforcement Learning
Felix Leibfried
Jordi Grau-Moya
Haitham Bou-Ammar
24
24
0
06 Aug 2017
Distral: Robust Multitask Reinforcement Learning
Distral: Robust Multitask Reinforcement Learning
Yee Whye Teh
V. Bapst
Wojciech M. Czarnecki
John Quan
J. Kirkpatrick
R. Hadsell
N. Heess
Razvan Pascanu
25
542
0
13 Jul 2017
Equivalence Between Policy Gradients and Soft Q-Learning
Equivalence Between Policy Gradients and Soft Q-Learning
John Schulman
Xi Chen
Pieter Abbeel
OffRL
29
339
0
21 Apr 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
104
1,502
0
25 Jan 2017
Previous
12345