ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1702.08165
  4. Cited By
Reinforcement Learning with Deep Energy-Based Policies

Reinforcement Learning with Deep Energy-Based Policies

27 February 2017
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
ArXivPDFHTML

Papers citing "Reinforcement Learning with Deep Energy-Based Policies"

50 / 242 papers shown
Title
Reinforcement Learning for Generative AI: A Survey
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
46
10
0
28 Aug 2023
Reinforcement Learning by Guided Safe Exploration
Reinforcement Learning by Guided Safe Exploration
Qisong Yang
T. D. Simão
N. Jansen
Simon Tindemans
M. Spaan
OffRL
OnRL
26
5
0
26 Jul 2023
Multi-Player Zero-Sum Markov Games with Networked Separable Interactions
Multi-Player Zero-Sum Markov Games with Networked Separable Interactions
Chanwoo Park
Kaipeng Zhang
Asuman Ozdaglar
30
8
0
13 Jul 2023
Provably Fast Finite Particle Variants of SVGD via Virtual Particle
  Stochastic Approximation
Provably Fast Finite Particle Variants of SVGD via Virtual Particle Stochastic Approximation
Aniket Das
Dheeraj M. Nagaraj
35
7
0
27 May 2023
Let the Flows Tell: Solving Graph Combinatorial Optimization Problems
  with GFlowNets
Let the Flows Tell: Solving Graph Combinatorial Optimization Problems with GFlowNets
Dinghuai Zhang
H. Dai
Nikolay Malkin
Aaron Courville
Yoshua Bengio
L. Pan
24
36
0
26 May 2023
Regularization and Variance-Weighted Regression Achieves Minimax
  Optimality in Linear MDPs: Theory and Practice
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Toshinori Kitamura
Tadashi Kozuno
Yunhao Tang
Nino Vieillard
Michal Valko
...
Olivier Pietquin
M. Geist
Csaba Szepesvári
Wataru Kumagai
Yutaka Matsuo
OffRL
30
2
0
22 May 2023
Policy Representation via Diffusion Probability Model for Reinforcement
  Learning
Policy Representation via Diffusion Probability Model for Reinforcement Learning
Long Yang
Zhixiong Huang
Fenghao Lei
Yucun Zhong
Yiming Yang
Cong Fang
Shiting Wen
Binbin Zhou
Zhouchen Lin
DiffM
28
39
0
22 May 2023
Bayesian Reinforcement Learning with Limited Cognitive Load
Bayesian Reinforcement Learning with Limited Cognitive Load
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
OffRL
34
8
0
05 May 2023
Distance Weighted Supervised Learning for Offline Interaction Data
Distance Weighted Supervised Learning for Offline Interaction Data
Joey Hejna
Jensen Gao
Dorsa Sadigh
OffRL
36
12
0
26 Apr 2023
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value
  Regularization
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Haoran Xu
Li Jiang
Jianxiong Li
Zhuoran Yang
Zhaoran Wang
Victor Chan
Xianyuan Zhan
OffRL
36
71
0
28 Mar 2023
Multi-Task Reinforcement Learning in Continuous Control with Successor
  Feature-Based Concurrent Composition
Multi-Task Reinforcement Learning in Continuous Control with Successor Feature-Based Concurrent Composition
Y. Liu
Aamir Ahmad
29
4
0
24 Mar 2023
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and
  Global Optimality
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and Global Optimality
François Ged
M. H. Veiga
23
0
0
22 Mar 2023
Fast Rates for Maximum Entropy Exploration
Fast Rates for Maximum Entropy Exploration
D. Tiapkin
Denis Belomestny
Daniele Calandriello
Eric Moulines
Rémi Munos
A. Naumov
Pierre Perrault
Yunhao Tang
Michal Valko
Pierre Menard
41
17
0
14 Mar 2023
Twice Regularized Markov Decision Processes: The Equivalence between
  Robustness and Regularization
Twice Regularized Markov Decision Processes: The Equivalence between Robustness and Regularization
E. Derman
Yevgeniy Men
M. Geist
Shie Mannor
39
1
0
12 Mar 2023
Inference on Optimal Dynamic Policies via Softmax Approximation
Inference on Optimal Dynamic Policies via Softmax Approximation
Qizhao Chen
Morgane Austern
Vasilis Syrgkanis
OffRL
29
1
0
08 Mar 2023
Virtual Guidance as a Mid-level Representation for Navigation with Augmented Reality
Virtual Guidance as a Mid-level Representation for Navigation with Augmented Reality
Hsuan-Kung Yang
Tsung-Chih Chiang
Tingxin Liu
Chun-Wei Huang
Jou-Min Liu
Tsu-Ching Hsiao
Chun-Yi Lee
28
1
0
05 Mar 2023
Model-based Constrained MDP for Budget Allocation in Sequential
  Incentive Marketing
Model-based Constrained MDP for Budget Allocation in Sequential Incentive Marketing
Shuai Xiao
Le Guo
Zaifan Jiang
Lei Lv
Yuanbo Chen
Jun Zhu
Shuang Yang
19
21
0
02 Mar 2023
Why Target Networks Stabilise Temporal Difference Methods
Why Target Networks Stabilise Temporal Difference Methods
Matt Fellows
Matthew Smith
Shimon Whiteson
OOD
AAML
15
7
0
24 Feb 2023
Differentiable Arbitrating in Zero-sum Markov Games
Differentiable Arbitrating in Zero-sum Markov Games
Jing Wang
Meichen Song
Feng Gao
Boyi Liu
Zhaoran Wang
Yi Wu
37
2
0
20 Feb 2023
Stochastic Generative Flow Networks
Stochastic Generative Flow Networks
L. Pan
Dinghuai Zhang
Moksh Jain
Longbo Huang
Yoshua Bengio
BDL
44
30
0
19 Feb 2023
When Demonstrations Meet Generative World Models: A Maximum Likelihood
  Framework for Offline Inverse Reinforcement Learning
When Demonstrations Meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning
Siliang Zeng
Chenliang Li
Alfredo García
Min-Fong Hong
OffRL
34
13
0
15 Feb 2023
An Efficient Solution to s-Rectangular Robust Markov Decision Processes
An Efficient Solution to s-Rectangular Robust Markov Decision Processes
Navdeep Kumar
Kfir Y. Levy
Kaixin Wang
Shie Mannor
36
1
0
31 Jan 2023
PAC-Bayesian Soft Actor-Critic Learning
PAC-Bayesian Soft Actor-Critic Learning
Bahareh Tasdighi
Abdullah Akgul
Manuel Haussmann
Kenny Kazimirzak Brink
M. Kandemir
34
3
0
30 Jan 2023
Learning to Generate All Feasible Actions
Learning to Generate All Feasible Actions
Mirco Theile
Daniele Bernardini
Raphael Trumpp
C. Piazza
Marco Caccamo
Alberto L. Sangiovanni-Vincentelli
29
2
0
26 Jan 2023
Automated deep reinforcement learning for real-time scheduling strategy
  of multi-energy system integrated with post-carbon and direct-air carbon
  captured system
Automated deep reinforcement learning for real-time scheduling strategy of multi-energy system integrated with post-carbon and direct-air carbon captured system
Tobi Michael Alabi
Nathan P. Lawrence
Lin Lu
Zaiyue Yang
R. Bhushan Gopaluni
11
28
0
18 Jan 2023
A survey and taxonomy of loss functions in machine learning
A survey and taxonomy of loss functions in machine learning
Lorenzo Ciampiconi
A. Elwood
Marco Leonardi
A. Mohamed
A. Rozza
MU
FaML
9
25
0
13 Jan 2023
A Constrained-Optimization Approach to the Execution of Prioritized
  Stacks of Learned Multi-Robot Tasks
A Constrained-Optimization Approach to the Execution of Prioritized Stacks of Learned Multi-Robot Tasks
Gennaro Notomista
17
2
0
13 Jan 2023
On The Fragility of Learned Reward Functions
On The Fragility of Learned Reward Functions
Lev McKinney
Yawen Duan
David M. Krueger
Adam Gleave
28
20
0
09 Jan 2023
Active Classification of Moving Targets with Learned Control Policies
Active Classification of Moving Targets with Learned Control Policies
Álvaro Serra-Gómez
Eduardo Montijano
Wendelin Bohmer
Javier Alonso-Mora
48
4
0
06 Dec 2022
Curriculum Learning for Relative Overgeneralization
Curriculum Learning for Relative Overgeneralization
Lin Shi
Bei Peng
25
1
0
06 Dec 2022
Utilizing Prior Solutions for Reward Shaping and Composition in
  Entropy-Regularized Reinforcement Learning
Utilizing Prior Solutions for Reward Shaping and Composition in Entropy-Regularized Reinforcement Learning
Jacob Adamczyk
A. Arriojas
Stas Tiomkin
R. Kulkarni
37
8
0
02 Dec 2022
Causal Deep Reinforcement Learning Using Observational Data
Causal Deep Reinforcement Learning Using Observational Data
Wenxuan Zhu
Chao Yu
Q. Zhang
CML
OffRL
20
5
0
28 Nov 2022
A Finite-Particle Convergence Rate for Stein Variational Gradient
  Descent
A Finite-Particle Convergence Rate for Stein Variational Gradient Descent
Jiaxin Shi
Lester W. Mackey
28
18
0
17 Nov 2022
Policy-Based Reinforcement Learning for Assortative Matching in Human
  Behavior Modeling
Policy-Based Reinforcement Learning for Assortative Matching in Human Behavior Modeling
Ou Deng
Qun Jin
17
1
0
08 Nov 2022
On Rate-Distortion Theory in Capacity-Limited Cognition & Reinforcement
  Learning
On Rate-Distortion Theory in Capacity-Limited Cognition & Reinforcement Learning
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
28
4
0
30 Oct 2022
RMBench: Benchmarking Deep Reinforcement Learning for Robotic
  Manipulator Control
RMBench: Benchmarking Deep Reinforcement Learning for Robotic Manipulator Control
Yanfei Xiang
Xin Wang
Shu Hu
Bin Zhu
Xiaomeng Huang
Xi Wu
Siwei Lyu
SSL
29
5
0
20 Oct 2022
CUP: Critic-Guided Policy Reuse
CUP: Critic-Guided Policy Reuse
Jin Zhang
Siyuan Li
Chongjie Zhang
23
8
0
15 Oct 2022
Flexible Attention-Based Multi-Policy Fusion for Efficient Deep
  Reinforcement Learning
Flexible Attention-Based Multi-Policy Fusion for Efficient Deep Reinforcement Learning
Zih-Yun Chiu
Yi-Lin Tuan
William Yang Wang
Michael C. Yip
OffRL
25
3
0
07 Oct 2022
Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees
Siliang Zeng
Chenliang Li
Alfredo García
Min-Fong Hong
34
42
0
04 Oct 2022
Boosting Exploration in Actor-Critic Algorithms by Incentivizing
  Plausible Novel States
Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States
C. Banerjee
Zhiyong Chen
N. Noman
20
3
0
01 Oct 2022
Regularized Soft Actor-Critic for Behavior Transfer Learning
Regularized Soft Actor-Critic for Behavior Transfer Learning
Mingxi Tan
Andong Tian
Ludovic Denoyer
20
3
0
27 Sep 2022
DEFT: Diverse Ensembles for Fast Transfer in Reinforcement Learning
DEFT: Diverse Ensembles for Fast Transfer in Reinforcement Learning
S. Adebola
Satvik Sharma
K. Shivakumar
OffRL
8
1
0
26 Sep 2022
A Deep Reinforcement Learning-Based Charging Scheduling Approach with
  Augmented Lagrangian for Electric Vehicle
A Deep Reinforcement Learning-Based Charging Scheduling Approach with Augmented Lagrangian for Electric Vehicle
Guibin Chen
Xiaoying Shi
19
3
0
20 Sep 2022
Towards Task-Prioritized Policy Composition
Towards Task-Prioritized Policy Composition
Finn Rietz
Erik Schaffernicht
Todor Stoyanov
J. A. Stork
20
0
0
20 Sep 2022
Age of Semantics in Cooperative Communications: To Expedite Simulation
  Towards Real via Offline Reinforcement Learning
Age of Semantics in Cooperative Communications: To Expedite Simulation Towards Real via Offline Reinforcement Learning
Xianfu Chen
Zhifeng Zhao
S. Mao
Celimuge Wu
Honggang Zhang
M. Bennis
OffRL
18
3
0
19 Sep 2022
Variational Inference for Model-Free and Model-Based Reinforcement
  Learning
Variational Inference for Model-Free and Model-Based Reinforcement Learning
Felix Leibfried
OffRL
13
0
0
04 Sep 2022
Towards Artificial Virtuous Agents: Games, Dilemmas and Machine Learning
Ajay Vishwanath
E. Bøhn
Ole-Christoffer Granmo
Charl Maree
C. Omlin
AI4CE
30
5
0
30 Aug 2022
Semantic Driven Energy based Out-of-Distribution Detection
Semantic Driven Energy based Out-of-Distribution Detection
Abhishek Joshi
Sathish Chalasani
K. N. Iyer
OODD
29
4
0
23 Aug 2022
Entropy Enhanced Multi-Agent Coordination Based on Hierarchical Graph
  Learning for Continuous Action Space
Entropy Enhanced Multi-Agent Coordination Based on Hierarchical Graph Learning for Continuous Action Space
Yining Chen
Ke Wang
Guang-hua Song
Xiaohong Jiang
20
3
0
23 Aug 2022
Entropy Augmented Reinforcement Learning
Entropy Augmented Reinforcement Learning
Jianfei Ma
28
0
0
19 Aug 2022
Previous
12345
Next