ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1507.04296
  4. Cited By
Massively Parallel Methods for Deep Reinforcement Learning

Massively Parallel Methods for Deep Reinforcement Learning

15 July 2015
Arun Nair
Praveen Srinivasan
Sam Blackwell
Cagdas Alcicek
Rory Fearon
A. D. Maria
Vedavyas Panneershelvam
Mustafa Suleyman
Charlie Beattie
Stig Petersen
Shane Legg
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
    OffRL
    AI4CE
    GNN
ArXivPDFHTML

Papers citing "Massively Parallel Methods for Deep Reinforcement Learning"

50 / 204 papers shown
Title
Synchronous vs Asynchronous Reinforcement Learning in a Real World Robot
Synchronous vs Asynchronous Reinforcement Learning in a Real World Robot
Ali Parsaee
Fahim Shahriar
Chuxin He
Ruiqing Tan
OffRL
55
0
0
17 Mar 2025
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
Zhouyu He
Peng Qiao
Rongchun Li
Yong Dou
Yusong Tan
OffRL
57
0
0
27 Feb 2025
Provably Robust Federated Reinforcement Learning
Provably Robust Federated Reinforcement Learning
Minghong Fang
Xilong Wang
Neil Zhenqiang Gong
FedML
65
0
0
12 Feb 2025
Distributed Multi-Agent Reinforcement Learning with One-hop Neighbors and Compute Straggler Mitigation
Distributed Multi-Agent Reinforcement Learning with One-hop Neighbors and Compute Straggler Mitigation
Baoqian Wang
Junfei Xie
Nikolay A. Atanasov
28
0
0
03 Jan 2025
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Eliot Xing
Vernon Luk
Jean Oh
84
0
0
16 Dec 2024
Acceleration for Deep Reinforcement Learning using Parallel and
  Distributed Computing: A Survey
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Zhihong Liu
Xin Xu
Peng Qiao
Dongsheng Li
OffRL
20
2
0
08 Nov 2024
QEDCartographer: Automating Formal Verification Using Reward-Free
  Reinforcement Learning
QEDCartographer: Automating Formal Verification Using Reward-Free Reinforcement Learning
Alex Sanchez-Stern
Abhishek Varghese
Zhanna Kaufman
Dylan Zhang
Talia Ringer
Yuriy Brun
18
2
0
17 Aug 2024
A survey on secure decentralized optimization and learning
A survey on secure decentralized optimization and learning
Changxin Liu
Nicola Bastianello
Wei Huo
Yang Shi
Karl H. Johansson
40
1
0
16 Aug 2024
Reinforcement Learning based Workflow Scheduling in Cloud and Edge
  Computing Environments: A Taxonomy, Review and Future Directions
Reinforcement Learning based Workflow Scheduling in Cloud and Edge Computing Environments: A Taxonomy, Review and Future Directions
Amanda Jayanetti
Saman K. Halgamuge
Rajkumar Buyya
15
0
0
06 Aug 2024
Finite-Time Analysis of Asynchronous Multi-Agent TD Learning
Finite-Time Analysis of Asynchronous Multi-Agent TD Learning
Nicolò Dal Fabbro
Arman Adibi
Aritra Mitra
George J. Pappas
42
1
0
29 Jul 2024
SAPG: Split and Aggregate Policy Gradients
SAPG: Split and Aggregate Policy Gradients
Jayesh Singla
Ananye Agarwal
Deepak Pathak
OffRL
OnRL
32
3
0
29 Jul 2024
MAIDCRL: Semi-centralized Multi-Agent Influence Dense-CNN Reinforcement
  Learning
MAIDCRL: Semi-centralized Multi-Agent Influence Dense-CNN Reinforcement Learning
A. S. Nipu
Siming Liu
Anthony Harris
17
4
0
12 Feb 2024
Private Knowledge Sharing in Distributed Learning: A Survey
Private Knowledge Sharing in Distributed Learning: A Survey
Yasas Supeksala
Dinh C. Nguyen
Ming Ding
Thilina Ranbaduge
Calson Chua
Jun Zhang
Jun Li
H. Vincent Poor
27
0
0
08 Feb 2024
High Throughput Training of Deep Surrogates from Large Ensemble Runs
High Throughput Training of Deep Surrogates from Large Ensemble Runs
Lucas Meyer
M. Schouler
R. Caulk
Alejandro Ribés
Bruno Raffin
AI4CE
17
5
0
28 Sep 2023
Deploying Deep Reinforcement Learning Systems: A Taxonomy of Challenges
Deploying Deep Reinforcement Learning Systems: A Taxonomy of Challenges
Ahmed Haj Yahmed
Altaf Allah Abbassi
Amin Nikanjam
Heng Li
Foutse Khomh
OffRL
24
5
0
23 Aug 2023
A Survey From Distributed Machine Learning to Distributed Deep Learning
A Survey From Distributed Machine Learning to Distributed Deep Learning
Mohammad Dehghani
Zahra Yazdanparast
18
0
0
11 Jul 2023
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand
  Cores
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
Zhiyu Mei
Wei Fu
Jiaxuan Gao
Guang Wang
Huanchen Zhang
Yi Wu
OffRL
LRM
21
5
0
29 Jun 2023
Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at
  100k Steps-Per-Second
Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second
Vincent-Pierre Berges
Andrew Szot
Devendra Singh Chaplot
Aaron Gokaslan
Roozbeh Mottaghi
Dhruv Batra
Eric Undersander
LRM
LM&Ro
32
5
0
13 Jun 2023
Design Principles for Model Generalization and Scalable AI Integration
  in Radio Access Networks
Design Principles for Model Generalization and Scalable AI Integration in Radio Access Networks
Pablo Soldati
E. Ghadimi
Burak Demirel
Yu Wang
Raimundas Gaigalas
Mathias Sintorn
11
3
0
09 Jun 2023
Learned spatial data partitioning
Learned spatial data partitioning
Keizo Hori
Yuya Sasaki
Daichi Amagata
Y. Murosaki
Makoto Onizuka
13
3
0
08 Jun 2023
DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
Yunhao Tang
Tadashi Kozuno
Mark Rowland
A. Harutyunyan
Rémi Munos
Bernardo Avila-Pires
Michal Valko
11
0
0
29 May 2023
Client Selection for Federated Policy Optimization with Environment
  Heterogeneity
Client Selection for Federated Policy Optimization with Environment Heterogeneity
Zhijie Xie
S. H. Song
27
3
0
18 May 2023
Cooperative Multi-Agent Reinforcement Learning: Asynchronous
  Communication and Linear Function Approximation
Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation
Yifei Min
Jiafan He
Tianhao Wang
Quanquan Gu
38
7
0
10 May 2023
FedHQL: Federated Heterogeneous Q-Learning
FedHQL: Federated Heterogeneous Q-Learning
Flint Xiaofeng Fan
Yining Ma
Zhongxiang Dai
Cheston Tan
Bryan Kian Hsiang Low
Roger Wattenhofer
FedML
24
7
0
26 Jan 2023
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player
  Multi-Agent Learning Toolbox
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox
Qiyue Yin
Tongtong Yu
S. Shen
Jun Yang
Meijing Zhao
Kaiqi Huang
Bin Liang
Liangsheng Wang
OffRL
20
13
0
01 Dec 2022
Actively Learning Costly Reward Functions for Reinforcement Learning
Actively Learning Costly Reward Functions for Reinforcement Learning
André Eberhard
Houssam Metni
G. Fahland
A. Stroh
Pascal Friederich
OffRL
23
0
0
23 Nov 2022
Simulation-Based Parallel Training
Simulation-Based Parallel Training
Lucas Meyer
Alejandro Ribés
Bruno Raffin
AI4CE
23
2
0
08 Nov 2022
Classifying Ambiguous Identities in Hidden-Role Stochastic Games with
  Multi-Agent Reinforcement Learning
Classifying Ambiguous Identities in Hidden-Role Stochastic Games with Multi-Agent Reinforcement Learning
Shijie Han
Siyuan Li
Bo An
Wei Zhao
P. Liu
27
0
0
24 Oct 2022
Scaling up Stochastic Gradient Descent for Non-convex Optimisation
Scaling up Stochastic Gradient Descent for Non-convex Optimisation
S. Mohamad
H. Alamri
A. Bouchachia
42
3
0
06 Oct 2022
Real-Time Reinforcement Learning for Vision-Based Robotics Utilizing
  Local and Remote Computers
Real-Time Reinforcement Learning for Vision-Based Robotics Utilizing Local and Remote Computers
Yan Wang
G. Vasan
A. R. Mahmood
34
15
0
05 Oct 2022
MAN: Multi-Action Networks Learning
MAN: Multi-Action Networks Learning
Keqin Wang
Alison Bartsch
A. Farimani
14
3
0
19 Sep 2022
A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free
  Reinforcement Learning
A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning
Laura M. Smith
Ilya Kostrikov
Sergey Levine
OffRL
26
102
0
16 Aug 2022
Parametrically Retargetable Decision-Makers Tend To Seek Power
Parametrically Retargetable Decision-Makers Tend To Seek Power
Alexander Matt Turner
Prasad Tadepalli
12
18
0
27 Jun 2022
Asynchronous SGD Beats Minibatch SGD Under Arbitrary Delays
Asynchronous SGD Beats Minibatch SGD Under Arbitrary Delays
Konstantin Mishchenko
Francis R. Bach
Mathieu Even
Blake E. Woodworth
18
57
0
15 Jun 2022
Social Network Structure Shapes Innovation: Experience-sharing in RL
  with SAPIENS
Social Network Structure Shapes Innovation: Experience-sharing in RL with SAPIENS
Eleni Nisioti
Matéo Mahaut
Pierre-Yves Oudeyer
Ida Momennejad
Clément Moulin-Frier
19
9
0
10 Jun 2022
One Policy is Enough: Parallel Exploration with a Single Policy is
  Near-Optimal for Reward-Free Reinforcement Learning
One Policy is Enough: Parallel Exploration with a Single Policy is Near-Optimal for Reward-Free Reinforcement Learning
Pedro Cisneros-Velarde
Boxiang Lyu
Oluwasanmi Koyejo
Mladen Kolar
OffRL
26
3
0
31 May 2022
HyperTree Proof Search for Neural Theorem Proving
HyperTree Proof Search for Neural Theorem Proving
Guillaume Lample
Marie-Anne Lachaux
Thibaut Lavril
Xavier Martinet
Amaury Hayat
Gabriel Ebner
Aurelien Rodriguez
Timothée Lacroix
AIMat
23
133
0
23 May 2022
Chain of Thought Imitation with Procedure Cloning
Chain of Thought Imitation with Procedure Cloning
Mengjiao Yang
Dale Schuurmans
Pieter Abbeel
Ofir Nachum
OffRL
30
29
0
22 May 2022
Parallel bandit architecture based on laser chaos for reinforcement
  learning
Parallel bandit architecture based on laser chaos for reinforcement learning
Takashi Urushibara
N. Chauvet
Satoshi Kochi
S. Sunada
Kazutaka Kanno
Atsushi Uchida
R. Horisaki
Makoto Naruse
22
0
0
19 May 2022
Asynchronous Reinforcement Learning for Real-Time Control of Physical
  Robots
Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots
Yufeng Yuan
Rupam Mahmood
OffRL
20
18
0
23 Mar 2022
Uncovering Instabilities in Variational-Quantum Deep Q-Networks
Uncovering Instabilities in Variational-Quantum Deep Q-Networks
Maja Franz
Lucas Wolf
Maniraman Periyasamy
Christian Ufrecht
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
Wolfgang Mauerer
24
29
0
10 Feb 2022
Group-Agent Reinforcement Learning
Group-Agent Reinforcement Learning
Kaiyue Wu
Xiaoming Zeng
OOD
OffRL
12
3
0
10 Feb 2022
Faster Deep Reinforcement Learning with Slower Online Network
Faster Deep Reinforcement Learning with Slower Online Network
Kavosh Asadi
Rasool Fakoor
Omer Gottesman
Taesup Kim
Michael L. Littman
Alexander J. Smola
OnRL
11
6
0
10 Dec 2021
Fast and Data-Efficient Training of Rainbow: an Experimental Study on
  Atari
Fast and Data-Efficient Training of Rainbow: an Experimental Study on Atari
Dominik Schmidt
Thomas Schmied
OffRL
20
12
0
19 Nov 2021
AI in Human-computer Gaming: Techniques, Challenges and Opportunities
AI in Human-computer Gaming: Techniques, Challenges and Opportunities
Qiyue Yin
Jun Yang
Kaiqi Huang
Meijing Zhao
Wancheng Ni
Bin Liang
Yan Huang
Shu Wu
Liangsheng Wang
20
20
0
15 Nov 2021
Human-Level Control without Server-Grade Hardware
Human-Level Control without Server-Grade Hardware
Brett Daley
Chris Amato
BDL
OffRL
8
0
0
01 Nov 2021
Accelerating Distributed Deep Reinforcement Learning by In-Network
  Experience Sampling
Accelerating Distributed Deep Reinforcement Learning by In-Network Experience Sampling
Masaki Furukawa
Hiroki Matsutani
OffRL
14
1
0
26 Oct 2021
Containerized Distributed Value-Based Multi-Agent Reinforcement Learning
Containerized Distributed Value-Based Multi-Agent Reinforcement Learning
Siyang Wu
Tonghan Wang
Chenghao Li
Yang Hu
Chongjie Zhang
OffRL
21
1
0
15 Oct 2021
Parallel Actors and Learners: A Framework for Generating Scalable RL
  Implementations
Parallel Actors and Learners: A Framework for Generating Scalable RL Implementations
Chi Zhang
S. Kuppannagari
Viktor Prasanna
OffRL
11
7
0
03 Oct 2021
A Cramér Distance perspective on Quantile Regression based
  Distributional Reinforcement Learning
A Cramér Distance perspective on Quantile Regression based Distributional Reinforcement Learning
Alix Lhéritier
Nicolas Bondoux
11
5
0
01 Oct 2021
12345
Next