ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.08617
  4. Cited By
Distributed Distributional Deterministic Policy Gradients

Distributed Distributional Deterministic Policy Gradients

23 April 2018
Gabriel Barth-Maron
Matthew W. Hoffman
David Budden
Will Dabney
Dan Horgan
TB Dhruva
Alistair Muldal
N. Heess
Timothy Lillicrap
    OffRL
ArXivPDFHTML

Papers citing "Distributed Distributional Deterministic Policy Gradients"

50 / 108 papers shown
Title
Learning State Representations via Retracing in Reinforcement Learning
Learning State Representations via Retracing in Reinforcement Learning
Changmin Yu
Dong Li
Jianye Hao
Jun Wang
Neil Burgess
27
7
0
24 Nov 2021
Renewable energy integration and microgrid energy trading using
  multi-agent deep reinforcement learning
Renewable energy integration and microgrid energy trading using multi-agent deep reinforcement learning
Daniel J. B. Harrold
Jun Cao
Zhongbo Fan
28
61
0
21 Nov 2021
Bayesian Sequential Optimal Experimental Design for Nonlinear Models
  Using Policy Gradient Reinforcement Learning
Bayesian Sequential Optimal Experimental Design for Nonlinear Models Using Policy Gradient Reinforcement Learning
Wanggang Shen
Xun Huan
11
39
0
28 Oct 2021
Learning Robotic Manipulation Skills Using an Adaptive Force-Impedance
  Action Space
Learning Robotic Manipulation Skills Using an Adaptive Force-Impedance Action Space
Maximilian Ulmer
Elie Aljalbout
Sascha Schwarz
Sami Haddadin
13
6
0
19 Oct 2021
Safety-Critical Learning of Robot Control with Temporal Logic
  Specifications
Safety-Critical Learning of Robot Control with Temporal Logic Specifications
Mingyu Cai
C. Vasile
35
4
0
07 Sep 2021
Learning Practically Feasible Policies for Online 3D Bin Packing
Learning Practically Feasible Policies for Online 3D Bin Packing
Hang Zhao
Chenyang Zhu
Xin Xu
Hui Huang
Kai Xu
OffRL
16
80
0
31 Aug 2021
Federated Reinforcement Learning: Techniques, Applications, and Open
  Challenges
Federated Reinforcement Learning: Techniques, Applications, and Open Challenges
Jiaju Qi
Qihao Zhou
Lei Lei
Kan Zheng
FedML
31
145
0
26 Aug 2021
Conservative Offline Distributional Reinforcement Learning
Conservative Offline Distributional Reinforcement Learning
Yecheng Jason Ma
Dinesh Jayaraman
Osbert Bastani
OffRL
70
78
0
12 Jul 2021
Evaluating the progress of Deep Reinforcement Learning in the real
  world: aligning domain-agnostic and domain-specific research
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
25
6
0
07 Jul 2021
Evolving Hierarchical Memory-Prediction Machines in Multi-Task
  Reinforcement Learning
Evolving Hierarchical Memory-Prediction Machines in Multi-Task Reinforcement Learning
Stephen Kelly
Tatiana Voegerl
W. Banzhaf
C. Gondro
45
13
0
23 Jun 2021
What Matters for Adversarial Imitation Learning?
What Matters for Adversarial Imitation Learning?
Manu Orsini
Anton Raichuk
Léonard Hussenot
Damien Vincent
Robert Dadashi
Sertan Girgin
M. Geist
Olivier Bachem
Olivier Pietquin
Marcin Andrychowicz
42
77
0
01 Jun 2021
On Instrumental Variable Regression for Deep Offline Policy Evaluation
On Instrumental Variable Regression for Deep Offline Policy Evaluation
Yutian Chen
Liyuan Xu
Çağlar Gülçehre
T. Paine
A. Gretton
Nando de Freitas
Arnaud Doucet
OffRL
39
18
0
21 May 2021
Regularized Behavior Value Estimation
Regularized Behavior Value Estimation
Çağlar Gülçehre
Sergio Gomez Colmenarejo
Ziyun Wang
Jakub Sygnowski
T. Paine
Konrad Zolna
Yutian Chen
Matthew W. Hoffman
Razvan Pascanu
Nando de Freitas
OffRL
23
37
0
17 Mar 2021
Robust MAML: Prioritization task buffer with adaptive learning process
  for model-agnostic meta-learning
Robust MAML: Prioritization task buffer with adaptive learning process for model-agnostic meta-learning
Thanh Nguyen
Tung M. Luu
T. Pham
Sanzhar Rakhimkul
Chang D. Yoo
19
10
0
15 Mar 2021
Latent Imagination Facilitates Zero-Shot Transfer in Autonomous Racing
Latent Imagination Facilitates Zero-Shot Transfer in Autonomous Racing
Axel Brunnbauer
Luigi Berducci
Andreas Brandstätter
Mathias Lechner
Ramin Hasani
Daniela Rus
Radu Grosu
LM&Ro
38
37
0
08 Mar 2021
Foresee then Evaluate: Decomposing Value Estimation with Latent Future
  Prediction
Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction
Hongyao Tang
Jianye Hao
Guangyong Chen
Pengfei Chen
Cheng Chen
Yaodong Yang
Lu Zhang
Wulong Liu
Zhaopeng Meng
OffRL
35
4
0
03 Mar 2021
Task-Agnostic Morphology Evolution
Task-Agnostic Morphology Evolution
D. Hejna
Pieter Abbeel
Lerrel Pinto
30
26
0
25 Feb 2021
Auto-Agent-Distiller: Towards Efficient Deep Reinforcement Learning Agents via Neural Architecture Search
Auto-Agent-Distiller: Towards Efficient Deep Reinforcement Learning Agents via Neural Architecture Search
Y. Fu
Zhongzhi Yu
Yongan Zhang
Yingyan Lin
22
4
0
24 Dec 2020
Offline Learning from Demonstrations and Unlabeled Experience
Offline Learning from Demonstrations and Unlabeled Experience
Konrad Zolna
Alexander Novikov
Ksenia Konyushkova
Çağlar Gülçehre
Ziyun Wang
Y. Aytar
Misha Denil
Nando de Freitas
Scott E. Reed
SSL
OffRL
32
66
0
27 Nov 2020
Deep Reinforcement Learning for Resource Constrained Multiclass
  Scheduling in Wireless Networks
Deep Reinforcement Learning for Resource Constrained Multiclass Scheduling in Wireless Networks
Apostolos Avranas
Marios Kountouris
P. Ciblat
19
7
0
27 Nov 2020
Model-based Reinforcement Learning for Continuous Control with Posterior
  Sampling
Model-based Reinforcement Learning for Continuous Control with Posterior Sampling
Ying Fan
Yifei Ming
25
17
0
20 Nov 2020
LBGP: Learning Based Goal Planning for Autonomous Following in Front
LBGP: Learning Based Goal Planning for Autonomous Following in Front
Payam Nikdel
R. Vaughan
Mo Chen
22
13
0
05 Nov 2020
Robust Constrained Reinforcement Learning for Continuous Control with
  Model Misspecification
Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification
D. Mankowitz
D. A. Calian
Rae Jeong
Cosmin Paduraru
N. Heess
Sumanth Dathathri
Martin Riedmiller
Timothy A. Mann
24
11
0
20 Oct 2020
Softmax Deep Double Deterministic Policy Gradients
Softmax Deep Double Deterministic Policy Gradients
Ling Pan
Qingpeng Cai
Longbo Huang
72
86
0
19 Oct 2020
Review: Deep Learning in Electron Microscopy
Review: Deep Learning in Electron Microscopy
Jeffrey M. Ede
34
79
0
17 Sep 2020
Online 3D Bin Packing with Constrained Deep Reinforcement Learning
Online 3D Bin Packing with Constrained Deep Reinforcement Learning
Hang Zhao
Qijin She
Chenyang Zhu
Yifan Yang
Kai Xu
OffRL
19
119
0
26 Jun 2020
Non-local Policy Optimization via Diversity-regularized Collaborative
  Exploration
Non-local Policy Optimization via Diversity-regularized Collaborative Exploration
Zhenghao Peng
Hao Sun
Bolei Zhou
13
18
0
14 Jun 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
Self-Imitation Learning via Generalized Lower Bound Q-learning
Yunhao Tang
SSL
25
24
0
12 Jun 2020
Primal Wasserstein Imitation Learning
Primal Wasserstein Imitation Learning
Robert Dadashi
Léonard Hussenot
M. Geist
Olivier Pietquin
20
124
0
08 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
60
225
0
01 Jun 2020
A Distributional View on Multi-Objective Policy Optimization
A Distributional View on Multi-Objective Policy Optimization
A. Abdolmaleki
Sandy H. Huang
Leonard Hasenclever
Michael Neunert
H. F. Song
Martina Zambelli
M. Martins
N. Heess
R. Hadsell
Martin Riedmiller
21
74
0
15 May 2020
An empirical investigation of the challenges of real-world reinforcement
  learning
An empirical investigation of the challenges of real-world reinforcement learning
Gabriel Dulac-Arnold
Nir Levine
D. Mankowitz
Jerry Li
Cosmin Paduraru
Sven Gowal
Todd Hester
OffRL
34
120
0
24 Mar 2020
Balance Between Efficient and Effective Learning: Dense2Sparse Reward
  Shaping for Robot Manipulation with Environment Uncertainty
Balance Between Efficient and Effective Learning: Dense2Sparse Reward Shaping for Robot Manipulation with Environment Uncertainty
Yongle Luo
Kun Dong
Lili Zhao
Zhiyong Sun
Chao Zhou
Bo Song
34
13
0
05 Mar 2020
Q-Learning in enormous action spaces via amortized approximate
  maximization
Q-Learning in enormous action spaces via amortized approximate maximization
T. Wiele
David Warde-Farley
A. Mnih
Volodymyr Mnih
21
59
0
22 Jan 2020
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for
  Addressing Value Estimation Errors
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors
Jingliang Duan
Yang Guan
Shengbo Eben Li
Yangang Ren
B. Cheng
OffRL
17
173
0
09 Jan 2020
A Survey of Deep Reinforcement Learning in Video Games
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
37
188
0
23 Dec 2019
Direct and indirect reinforcement learning
Direct and indirect reinforcement learning
Yang Guan
Shengbo Eben Li
Jingliang Duan
Jie Li
Yangang Ren
Qi Sun
B. Cheng
OffRL
32
34
0
23 Dec 2019
Adaptive dynamic programming for nonaffine nonlinear optimal control
  problem with state constraints
Adaptive dynamic programming for nonaffine nonlinear optimal control problem with state constraints
Jingliang Duan
Zhengyu Liu
Shengbo Eben Li
Qi Sun
Zhenzhong Jia
B. Cheng
15
64
0
26 Nov 2019
Worst Cases Policy Gradients
Worst Cases Policy Gradients
Yichuan Tang
Jian Zhang
Ruslan Salakhutdinov
13
75
0
09 Nov 2019
Task-Relevant Adversarial Imitation Learning
Task-Relevant Adversarial Imitation Learning
Konrad Zolna
Scott E. Reed
Alexander Novikov
Sergio Gomez Colmenarejo
David Budden
Serkan Cabi
Misha Denil
Nando de Freitas
Ziyun Wang
GAN
22
61
0
02 Oct 2019
Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping
Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping
Cristian Bodnar
A. Li
Karol Hausman
P. Pastor
Mrinal Kalakrishnan
OffRL
20
50
0
01 Oct 2019
rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch
rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch
Adam Stooke
Pieter Abbeel
OffRL
19
96
0
03 Sep 2019
Dynamics-aware Embeddings
Dynamics-aware Embeddings
William F. Whitney
Rajat Agarwal
Kyunghyun Cho
Abhinav Gupta
SSL
17
53
0
25 Aug 2019
Deep Reinforcement Learning for Autonomous Internet of Things: Model,
  Applications and Challenges
Deep Reinforcement Learning for Autonomous Internet of Things: Model, Applications and Challenges
Lei Lei
Yue Tan
Kan Zheng
Shiwen Liu
K. Zheng
Xuemin Shen
Shen
OffRL
21
202
0
22 Jul 2019
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a
  Latent Variable Model
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
Alex X. Lee
Anusha Nagabandi
Pieter Abbeel
Sergey Levine
OffRL
BDL
25
371
0
01 Jul 2019
Deep Reinforcement Learning for Cyber Security
Deep Reinforcement Learning for Cyber Security
Thanh Thi Nguyen
Vijay Janapa Reddi
OffRL
AI4CE
10
313
0
13 Jun 2019
Physics-as-Inverse-Graphics: Unsupervised Physical Parameter Estimation
  from Video
Physics-as-Inverse-Graphics: Unsupervised Physical Parameter Estimation from Video
Miguel Jaques
Michael G. Burke
Timothy M. Hospedales
VGen
PINN
21
44
0
27 May 2019
Attention-based Deep Reinforcement Learning for Multi-view Environments
Attention-based Deep Reinforcement Learning for Multi-view Environments
Elaheh Barati
Xuewen Chen
Z. Zhong
14
6
0
10 May 2019
Learning Gentle Object Manipulation with Curiosity-Driven Deep
  Reinforcement Learning
Learning Gentle Object Manipulation with Curiosity-Driven Deep Reinforcement Learning
Sandy H. Huang
Martina Zambelli
Jackie Kay
M. Martins
Yuval Tassa
P. Pilarski
R. Hadsell
18
50
0
20 Mar 2019
Deep Reinforcement Learning with Decorrelation
Deep Reinforcement Learning with Decorrelation
B. Mavrin
Hengshuai Yao
Linglong Kong
22
8
0
18 Mar 2019
Previous
123
Next