ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.06680
  4. Cited By
Dota 2 with Large Scale Deep Reinforcement Learning

Dota 2 with Large Scale Deep Reinforcement Learning

13 December 2019
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
Vicki Cheung
Przemyslaw Debiak
Christy Dennison
David Farhi
Quirin Fischer
Shariq Hashme
Christopher Hesse
Rafal Jozefowicz
Scott Gray
Catherine Olsson
J. Pachocki
Michael Petrov
Henrique Pondé de Oliveira Pinto
Jonathan Raiman
Tim Salimans
Jeremy Schlatter
Jonas Schneider
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
    GNN
    VLM
    CLL
    AI4CE
    LRM
ArXivPDFHTML

Papers citing "Dota 2 with Large Scale Deep Reinforcement Learning"

50 / 991 papers shown
Title
Training Deep Surrogate Models with Large Scale Online Learning
Training Deep Surrogate Models with Large Scale Online Learning
Lucas Meyer
M. Schouler
R. Caulk
Alejandro Ribés
Bruno Raffin
3DGS
AI4CE
22
4
0
28 Jun 2023
DCT: Dual Channel Training of Action Embeddings for Reinforcement
  Learning with Large Discrete Action Spaces
DCT: Dual Channel Training of Action Embeddings for Reinforcement Learning with Large Discrete Action Spaces
Pranavi Pathakota
Hardik Meisheri
H. Khadilkar
OffRL
8
0
0
28 Jun 2023
Diversity is Strength: Mastering Football Full Game with Interactive
  Reinforcement Learning of Multiple AIs
Diversity is Strength: Mastering Football Full Game with Interactive Reinforcement Learning of Multiple AIs
Chenglu Sun
Shuo Shen
Sijia Xu
Weidong Zhang
22
1
0
28 Jun 2023
Learning to Modulate pre-trained Models in RL
Learning to Modulate pre-trained Models in RL
Thomas Schmied
M. Hofmarcher
Fabian Paischer
Razvan Pascanu
Sepp Hochreiter
CLL
OffRL
24
14
0
26 Jun 2023
Is RLHF More Difficult than Standard RL?
Is RLHF More Difficult than Standard RL?
Yuanhao Wang
Qinghua Liu
Chi Jin
OffRL
17
57
0
25 Jun 2023
TVDO: Tchebycheff Value-Decomposition Optimization for Multi-Agent
  Reinforcement Learning
TVDO: Tchebycheff Value-Decomposition Optimization for Multi-Agent Reinforcement Learning
Xiao Hu
P. Guo
Chuanwei Zhou
Tong Zhang
Zhen Cui
27
0
0
24 Jun 2023
Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot
  Policy Imitation
Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation
Massimiliano Patacchiola
Mingfei Sun
Katja Hofmann
Richard E. Turner
OffRL
19
1
0
23 Jun 2023
Transferable Curricula through Difficulty Conditioned Generators
Transferable Curricula through Difficulty Conditioned Generators
Sidney Tio
Pradeep Varakantham
17
4
0
22 Jun 2023
MP3: Movement Primitive-Based (Re-)Planning Policy
MP3: Movement Primitive-Based (Re-)Planning Policy
Fabian Otto
Hongyi Zhou
Onur Celik
Ge Li
Rudolf Lioutikov
Gerhard Neumann
21
5
0
22 Jun 2023
Inroads into Autonomous Network Defence using Explained Reinforcement
  Learning
Inroads into Autonomous Network Defence using Explained Reinforcement Learning
Myles Foley
Miaowei Wang
M. Zoe
Chris Hicks
V. Mavroudis
AAML
11
13
0
15 Jun 2023
Semantic HELM: A Human-Readable Memory for Reinforcement Learning
Semantic HELM: A Human-Readable Memory for Reinforcement Learning
Fabian Paischer
Thomas Adler
M. Hofmarcher
Sepp Hochreiter
21
9
0
15 Jun 2023
Reward-Free Curricula for Training Robust World Models
Reward-Free Curricula for Training Robust World Models
Marc Rigter
Minqi Jiang
Ingmar Posner
VLM
OffRL
31
6
0
15 Jun 2023
Offline Multi-Agent Reinforcement Learning with Coupled Value
  Factorization
Offline Multi-Agent Reinforcement Learning with Coupled Value Factorization
Xiangsen Wang
Xianyuan Zhan
OffRL
21
5
0
15 Jun 2023
Theoretical Hardness and Tractability of POMDPs in RL with Partial
  Online State Information
Theoretical Hardness and Tractability of POMDPs in RL with Partial Online State Information
Ming Shi
Yingbin Liang
Ness B. Shroff
29
2
0
14 Jun 2023
Data Poisoning to Fake a Nash Equilibrium in Markov Games
Data Poisoning to Fake a Nash Equilibrium in Markov Games
Young Wu
Jeremy McMahan
Xiaojin Zhu
Qiaomin Xie
OffRL
24
2
0
13 Jun 2023
A Versatile Multi-Agent Reinforcement Learning Benchmark for Inventory
  Management
A Versatile Multi-Agent Reinforcement Learning Benchmark for Inventory Management
Xianliang Yang
Zhihao Liu
Wei Jiang
Chuheng Zhang
Li Zhao
Lei Song
Jiang Bian
30
13
0
13 Jun 2023
On the Efficacy of 3D Point Cloud Reinforcement Learning
On the Efficacy of 3D Point Cloud Reinforcement Learning
Z. Ling
Yuan Yao
Xuanlin Li
H. Su
3DPC
31
13
0
11 Jun 2023
iPLAN: Intent-Aware Planning in Heterogeneous Traffic via Distributed
  Multi-Agent Reinforcement Learning
iPLAN: Intent-Aware Planning in Heterogeneous Traffic via Distributed Multi-Agent Reinforcement Learning
Xiyang Wu
Rohan Chandra
Tianrui Guan
Amrit Singh Bedi
Dinesh Manocha
32
4
0
09 Jun 2023
TreeDQN: Learning to minimize Branch-and-Bound tree
TreeDQN: Learning to minimize Branch-and-Bound tree
Dmitry Sorokin
A. Kostin
11
1
0
09 Jun 2023
PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward
  Learning for Robotic Manipulation
PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation
Runze Liu
Yali Du
Fengshuo Bai
Jiafei Lyu
Xiu Li
27
6
0
06 Jun 2023
Networked Communication for Decentralised Agents in Mean-Field Games
Networked Communication for Decentralised Agents in Mean-Field Games
Patrick Benjamin
Alessandro Abate
FedML
40
2
0
05 Jun 2023
ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive
  Advantages
ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages
Andrew Jesson
Chris Xiaoxuan Lu
Gunshi Gupta
Angelos Filos
Jakob N. Foerster
Y. Gal
OffRL
25
5
0
02 Jun 2023
Hyperparameters in Reinforcement Learning and How To Tune Them
Hyperparameters in Reinforcement Learning and How To Tune Them
Theresa Eimer
Marius Lindauer
Roberta Raileanu
OffRL
27
34
0
02 Jun 2023
Investigating Navigation Strategies in the Morris Water Maze through
  Deep Reinforcement Learning
Investigating Navigation Strategies in the Morris Water Maze through Deep Reinforcement Learning
A. Liu
Alla Borisyuk
16
6
0
01 Jun 2023
Active Vision Reinforcement Learning under Limited Visual Observability
Active Vision Reinforcement Learning under Limited Visual Observability
Jinghuan Shang
Michael S. Ryoo
32
0
0
01 Jun 2023
Improving and Benchmarking Offline Reinforcement Learning Algorithms
Improving and Benchmarking Offline Reinforcement Learning Algorithms
Bingyi Kang
Xiao Ma
Yi-Ren Wang
Yang Yue
Shuicheng Yan
OffRL
8
9
0
01 Jun 2023
TorchRL: A data-driven decision-making library for PyTorch
TorchRL: A data-driven decision-making library for PyTorch
Albert Bou
Matteo Bettini
Sebastian Dittert
Vikash Kumar
Shagun Sodhani
Xiaomeng Yang
Gianni de Fabritiis
Vincent Moens
OffRL
AI4CE
22
37
0
01 Jun 2023
On the Linear Convergence of Policy Gradient under Hadamard
  Parameterization
On the Linear Convergence of Policy Gradient under Hadamard Parameterization
Jiacai Liu
Jinchi Chen
Ke Wei
16
2
0
31 May 2023
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Max Schwarzer
J. Obando-Ceron
Aaron C. Courville
Marc G. Bellemare
Rishabh Agarwal
P. S. Castro
OffRL
43
82
0
30 May 2023
Strategic Reasoning with Language Models
Strategic Reasoning with Language Models
Kanishk Gandhi
Dorsa Sadigh
Noah D. Goodman
LM&Ro
LRM
40
36
0
30 May 2023
On the Value of Myopic Behavior in Policy Reuse
On the Value of Myopic Behavior in Policy Reuse
Kang Xu
Chenjia Bai
Shuang Qiu
Haoran He
Bin Zhao
Zhen Wang
Wei Li
Xuelong Li
24
1
0
28 May 2023
A Hierarchical Approach to Population Training for Human-AI
  Collaboration
A Hierarchical Approach to Population Training for Human-AI Collaboration
Yi Loo
Chen Gong
Malika Meghjani
20
7
0
26 May 2023
Voyager: An Open-Ended Embodied Agent with Large Language Models
Voyager: An Open-Ended Embodied Agent with Large Language Models
Guanzhi Wang
Yuqi Xie
Yunfan Jiang
Ajay Mandlekar
Chaowei Xiao
Yuke Zhu
Linxi Fan
Anima Anandkumar
LM&Ro
SyDa
46
755
0
25 May 2023
Lucy-SKG: Learning to Play Rocket League Efficiently Using Deep
  Reinforcement Learning
Lucy-SKG: Learning to Play Rocket League Efficiently Using Deep Reinforcement Learning
V. Moschopoulos
Pantelis Kyriakidis
A. Lazaridis
I. Vlahavas
16
0
0
25 May 2023
GUARD: A Safe Reinforcement Learning Benchmark
GUARD: A Safe Reinforcement Learning Benchmark
Weiye Zhao
Rui Chen
Yifan Sun
Ruixuan Liu
Tianhao Wei
Changliu Liu
41
12
0
23 May 2023
Unsupervised Discovery of Continuous Skills on a Sphere
Unsupervised Discovery of Continuous Skills on a Sphere
Takahisa Imagawa
Takuya Hiraoka
Yoshimasa Tsuruoka
29
0
0
21 May 2023
Learning Diverse Risk Preferences in Population-based Self-play
Learning Diverse Risk Preferences in Population-based Self-play
Y. Jiang
Qihan Liu
Xiaoteng Ma
Chenghao Li
Yiqin Yang
Jun Yang
Bin Liang
Qianchuan Zhao
54
3
0
19 May 2023
Optimistic Natural Policy Gradient: a Simple Efficient Policy
  Optimization Framework for Online RL
Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL
Qinghua Liu
Gellert Weisz
András Gyorgy
Chi Jin
Csaba Szepesvári
OffRL
21
8
0
18 May 2023
Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid
  Reinforcement Learning
Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning
Gen Li
Wenhao Zhan
Jason D. Lee
Yuejie Chi
Yuxin Chen
OffRL
OnRL
73
12
0
17 May 2023
Exploring the Space of Key-Value-Query Models with Intention
Exploring the Space of Key-Value-Query Models with Intention
M. Garnelo
Wojciech M. Czarnecki
35
7
0
17 May 2023
Trojan Playground: A Reinforcement Learning Framework for Hardware
  Trojan Insertion and Detection
Trojan Playground: A Reinforcement Learning Framework for Hardware Trojan Insertion and Detection
Amin Sarihi
Ahmad Patooghy
Peter Jamieson
Abdel-Hameed A. Badawy
27
7
0
16 May 2023
More Like Real World Game Challenge for Partially Observable Multi-Agent
  Cooperation
More Like Real World Game Challenge for Partially Observable Multi-Agent Cooperation
Meng Yao
Xueou Feng
Qiyue Yin
16
0
0
15 May 2023
Cooperative Multi-Agent Reinforcement Learning: Asynchronous
  Communication and Linear Function Approximation
Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation
Yifei Min
Jiafan He
Tianhao Wang
Quanquan Gu
38
7
0
10 May 2023
SMAClite: A Lightweight Environment for Multi-Agent Reinforcement
  Learning
SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning
Adam Michalski
Filippos Christianos
Stefano V. Albrecht
13
3
0
09 May 2023
Information Design in Multi-Agent Reinforcement Learning
Information Design in Multi-Agent Reinforcement Learning
Yue Lin
Wenhao Li
H. Zha
Baoxiang Wang
29
10
0
08 May 2023
Stackelberg Games for Learning Emergent Behaviors During Competitive
  Autocurricula
Stackelberg Games for Learning Emergent Behaviors During Competitive Autocurricula
Boling Yang
Liyuan Zheng
Lillian J. Ratliff
Byron Boots
Joshua R. Smith
33
3
0
04 May 2023
Posterior Sampling for Deep Reinforcement Learning
Posterior Sampling for Deep Reinforcement Learning
Remo Sasso
Michelangelo Conserva
Paulo E. Rauber
OffRL
BDL
35
6
0
30 Apr 2023
From Explicit Communication to Tacit Cooperation:A Novel Paradigm for
  Cooperative MARL
From Explicit Communication to Tacit Cooperation:A Novel Paradigm for Cooperative MARL
Dapeng Li
Zhiwei Xu
Bin Zhang
Guoliang Fan
46
7
0
28 Apr 2023
Learning Environment for the Air Domain (LEAD)
Learning Environment for the Air Domain (LEAD)
Andreas Strand
Patrick Ribu Gorton
M. Asprusten
K. Brathen
29
1
0
27 Apr 2023
Games for Artificial Intelligence Research: A Review and Perspectives
Games for Artificial Intelligence Research: A Review and Perspectives
Chengpeng Hu
Yunlong Zhao
Ziqi Wang
Haocheng Du
Jialin Liu
AI4CE
35
12
0
26 Apr 2023
Previous
123...678...181920
Next