ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.06680
  4. Cited By
Dota 2 with Large Scale Deep Reinforcement Learning

Dota 2 with Large Scale Deep Reinforcement Learning

13 December 2019
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
Vicki Cheung
Przemyslaw Debiak
Christy Dennison
David Farhi
Quirin Fischer
Shariq Hashme
Christopher Hesse
Rafal Jozefowicz
Scott Gray
Catherine Olsson
J. Pachocki
Michael Petrov
Henrique Pondé de Oliveira Pinto
Jonathan Raiman
Tim Salimans
Jeremy Schlatter
Jonas Schneider
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
    GNN
    VLM
    CLL
    AI4CE
    LRM
ArXivPDFHTML

Papers citing "Dota 2 with Large Scale Deep Reinforcement Learning"

50 / 991 papers shown
Title
Scaling Is All You Need: Autonomous Driving with JAX-Accelerated
  Reinforcement Learning
Scaling Is All You Need: Autonomous Driving with JAX-Accelerated Reinforcement Learning
Moritz Harmel
Anubhav Paras
Andreas Pasternak
Nicholas Roy
Gary Linscott
LRM
21
1
0
23 Dec 2023
OpenRL: A Unified Reinforcement Learning Framework
OpenRL: A Unified Reinforcement Learning Framework
Shiyu Huang
Wentse Chen
Yiwen Sun
Fuqing Bie
Weijuan Tu
40
3
0
20 Dec 2023
XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
Alexander Nikulin
Vladislav Kurenkov
Ilya Zisman
Artem Agarkov
Viacheslav Sinii
Sergey Kolesnikov
24
25
0
19 Dec 2023
Multi-Agent Reinforcement Learning for Connected and Automated Vehicles
  Control: Recent Advancements and Future Prospects
Multi-Agent Reinforcement Learning for Connected and Automated Vehicles Control: Recent Advancements and Future Prospects
Min Hua
Dong Chen
Xinda Qi
Kun Jiang
Z. Liu
Quan Zhou
Hongming Xu
18
10
0
18 Dec 2023
Auto MC-Reward: Automated Dense Reward Design with Large Language Models
  for Minecraft
Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
Hao Li
Xue Yang
Zhaokai Wang
Xizhou Zhu
Jie Zhou
Yu Qiao
Xiaogang Wang
Hongsheng Li
Lewei Lu
Jifeng Dai
35
32
0
14 Dec 2023
Personalized Decision Supports based on Theory of Mind Modeling and
  Explainable Reinforcement Learning
Personalized Decision Supports based on Theory of Mind Modeling and Explainable Reinforcement Learning
Huao Li
Yao Fan
Keyang Zheng
Michael Lewis
Katia P. Sycara
25
0
0
13 Dec 2023
An Invitation to Deep Reinforcement Learning
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
78
5
0
13 Dec 2023
Building Open-Ended Embodied Agent via Language-Policy Bidirectional
  Adaptation
Building Open-Ended Embodied Agent via Language-Policy Bidirectional Adaptation
Shaopeng Zhai
Jie Wang
Tianyi Zhang
Fuxian Huang
Qi Zhang
Ming Zhou
Jing Hou
Yu Qiao
Yu Liu
LLMAG
LM&Ro
31
1
0
12 Dec 2023
No Prior Mask: Eliminate Redundant Action for Deep Reinforcement
  Learning
No Prior Mask: Eliminate Redundant Action for Deep Reinforcement Learning
Dianyu Zhong
Yiqin Yang
Qianchuan Zhao
22
6
0
11 Dec 2023
DCIR: Dynamic Consistency Intrinsic Reward for Multi-Agent Reinforcement
  Learning
DCIR: Dynamic Consistency Intrinsic Reward for Multi-Agent Reinforcement Learning
Kun-Li Channing Lin
Yufeng Wang
Peihao Chen
Runhao Zeng
Siyuan Zhou
Mingkui Tan
Chuang Gan
AI4CE
29
0
0
10 Dec 2023
The Generalization Gap in Offline Reinforcement Learning
The Generalization Gap in Offline Reinforcement Learning
Ishita Mediratta
Qingfei You
Minqi Jiang
Roberta Raileanu
OffRL
78
10
0
10 Dec 2023
Evolving Reservoirs for Meta Reinforcement Learning
Evolving Reservoirs for Meta Reinforcement Learning
Corentin Léger
Gautier Hamon
Eleni Nisioti
X. Hinaut
Clément Moulin-Frier
23
1
0
09 Dec 2023
Canaries and Whistles: Resilient Drone Communication Networks with (or
  without) Deep Reinforcement Learning
Canaries and Whistles: Resilient Drone Communication Networks with (or without) Deep Reinforcement Learning
Chris Hicks
V. Mavroudis
Myles Foley
Thomas Davies
Kate Highnam
Tim Watson
16
6
0
08 Dec 2023
Is Feedback All You Need? Leveraging Natural Language Feedback in
  Goal-Conditioned Reinforcement Learning
Is Feedback All You Need? Leveraging Natural Language Feedback in Goal-Conditioned Reinforcement Learning
Sabrina McCallum
Max Taylor-Davies
Stefano V. Albrecht
Alessandro Suglia
21
1
0
07 Dec 2023
Mastering Complex Coordination through Attention-based Dynamic Graph
Mastering Complex Coordination through Attention-based Dynamic Graph
Guangchong Zhou
Zhiwei Xu
Zeren Zhang
Guoliang Fan
GNN
8
0
0
07 Dec 2023
DanZero+: Dominating the GuanDan Game through Reinforcement Learning
DanZero+: Dominating the GuanDan Game through Reinforcement Learning
Youpeng Zhao
Yudong Lu
Jian Zhao
Wen-gang Zhou
Houqiang Li
34
6
0
05 Dec 2023
Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games
Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games
Lukas Schäfer
Logan Jones
Anssi Kanervisto
Yuhan Cao
Tabish Rashid
Raluca Georgescu
David Bignell
Siddhartha Sen
Andrea Trevino Gavito
Sam Devlin
85
3
0
04 Dec 2023
Extreme Event Prediction with Multi-agent Reinforcement Learning-based
  Parametrization of Atmospheric and Oceanic Turbulence
Extreme Event Prediction with Multi-agent Reinforcement Learning-based Parametrization of Atmospheric and Oceanic Turbulence
R. Mojgani
Daniel Waelchli
Yifei Guan
P. Koumoutsakos
P. Hassanzadeh
AI4Cl
AI4CE
20
5
0
01 Dec 2023
A safe exploration approach to constrained Markov decision processes
A safe exploration approach to constrained Markov decision processes
Tingting Ni
Maryam Kamgarpour
25
3
0
01 Dec 2023
Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play
Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play
Daniel Bairamian
Philippe Marcotte
Joshua Romoff
Gabriel Robert
Derek Nowrouzezahrai
16
0
0
28 Nov 2023
Reward Shaping for Improved Learning in Real-time Strategy Game Play
Reward Shaping for Improved Learning in Real-time Strategy Game Play
John Kliem
Prithviraj Dasgupta
OffRL
11
1
0
27 Nov 2023
Learning to Cooperate and Communicate Over Imperfect Channels
Learning to Cooperate and Communicate Over Imperfect Channels
Jannis Weil
Gizem Ekinci
Heinz Koeppl
Tobias Meuser
18
0
0
24 Nov 2023
Efficient Open-world Reinforcement Learning via Knowledge Distillation
  and Autonomous Rule Discovery
Efficient Open-world Reinforcement Learning via Knowledge Distillation and Autonomous Rule Discovery
Ekaterina Nikonova
Cheng Xue
Jochen Renz
CLL
11
1
0
24 Nov 2023
A DRL solution to help reduce the cost in waiting time of securing a
  traffic light for cyclists
A DRL solution to help reduce the cost in waiting time of securing a traffic light for cyclists
Lucas Magnana
H. Rivano
Nicolas Chiabaut
9
0
0
23 Nov 2023
minimax: Efficient Baselines for Autocurricula in JAX
minimax: Efficient Baselines for Autocurricula in JAX
Minqi Jiang
Michael Dennis
Edward Grefenstette
Tim Rocktaschel
19
8
0
21 Nov 2023
Provable Representation with Efficient Planning for Partial Observable
  Reinforcement Learning
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
Hongming Zhang
Tongzheng Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
45
3
0
20 Nov 2023
ADAPTER-RL: Adaptation of Any Agent using Reinforcement Learning
ADAPTER-RL: Adaptation of Any Agent using Reinforcement Learning
Yi-Fan Jin
Greg Slabaugh
Simon Lucas
OnRL
AI4CE
8
0
0
20 Nov 2023
Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees
Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees
Yifei Zhou
Ayush Sekhari
Yuda Song
Wen Sun
OffRL
OnRL
30
8
0
14 Nov 2023
On-Policy Policy Gradient Reinforcement Learning Without On-Policy
  Sampling
On-Policy Policy Gradient Reinforcement Learning Without On-Policy Sampling
Nicholas Corrado
Josiah P. Hanna
OffRL
16
1
0
14 Nov 2023
Object-Centric Learning with Slot Mixture Module
Object-Centric Learning with Slot Mixture Module
Daniil E. Kirilenko
Vitaliy Vorobyov
A. Kovalev
Aleksandr I. Panov
OCL
31
3
0
08 Nov 2023
MixtureGrowth: Growing Neural Networks by Recombining Learned Parameters
MixtureGrowth: Growing Neural Networks by Recombining Learned Parameters
Chau Pham
Piotr Teterwak
Soren Nelson
Bryan A. Plummer
9
3
0
07 Nov 2023
The NeurIPS 2022 Neural MMO Challenge: A Massively Multiagent
  Competition with Specialization and Trade
The NeurIPS 2022 Neural MMO Challenge: A Massively Multiagent Competition with Specialization and Trade
Enhong Liu
Joseph Suárez
Chenhui You
Bo Wu
Bingcheng Chen
...
Yuejia Huang
Kun Zhang
Hanhui Yang
Shi-bao Tang
Phillip Isola
13
0
0
07 Nov 2023
A Brain-inspired Theory of Collective Mind Model for Efficient Social
  Cooperation
A Brain-inspired Theory of Collective Mind Model for Efficient Social Cooperation
Zhuoya Zhao
Feifei Zhao
Shiwen Wang
Yinqian Sun
Yi Zeng
23
1
0
06 Nov 2023
Emergence of Collective Open-Ended Exploration from Decentralized
  Meta-Reinforcement Learning
Emergence of Collective Open-Ended Exploration from Decentralized Meta-Reinforcement Learning
Richard Bornemann
Gautier Hamon
Eleni Nisioti
Clément Moulin-Frier
LRM
22
1
0
01 Nov 2023
Language Agents with Reinforcement Learning for Strategic Play in the
  Werewolf Game
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game
Zelai Xu
Chao Yu
Fei Fang
Yu Wang
Yi Wu
LLMAG
29
78
0
29 Oct 2023
Fair collaborative vehicle routing: A deep multi-agent reinforcement
  learning approach
Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach
Stephen Mak
Liming Xu
Tim Pearce
Michael Ostroumov
Alexandra Brintrup
16
11
0
26 Oct 2023
TD-MPC2: Scalable, Robust World Models for Continuous Control
TD-MPC2: Scalable, Robust World Models for Continuous Control
Nicklas Hansen
Hao Su
Xiaolong Wang
MU
32
121
0
25 Oct 2023
AI Agent as Urban Planner: Steering Stakeholder Dynamics in Urban
  Planning via Consensus-based Multi-Agent Reinforcement Learning
AI Agent as Urban Planner: Steering Stakeholder Dynamics in Urban Planning via Consensus-based Multi-Agent Reinforcement Learning
Kejiang Qian
Lingjun Mao
Xin Liang
Yimin Ding
Jin Gao
Xinran Wei
Ziyi Guo
Jiajie Li
9
3
0
25 Oct 2023
Finetuning Offline World Models in the Real World
Finetuning Offline World Models in the Real World
Yunhai Feng
Nicklas Hansen
Ziyan Xiong
Chandramouli Rajagopalan
Xiaolong Wang
OffRL
OnRL
17
20
0
24 Oct 2023
Fact-based Agent modeling for Multi-Agent Reinforcement Learning
Fact-based Agent modeling for Multi-Agent Reinforcement Learning
Baofu Fang
Caiming Zheng
Hao Wang
OffRL
16
0
0
18 Oct 2023
Accelerate Presolve in Large-Scale Linear Programming via Reinforcement
  Learning
Accelerate Presolve in Large-Scale Linear Programming via Reinforcement Learning
Yufei Kuang
Xijun Li
Jie Wang
Fangzhou Zhu
Meng Lu
Zhihai Wang
Jianguo Zeng
Houqiang Li
Yongdong Zhang
Feng Wu
29
4
0
18 Oct 2023
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
Jake Grigsby
Linxi Fan
Yuke Zhu
OffRL
LM&Ro
33
10
0
15 Oct 2023
Seeking Next Layer Neurons' Attention for Error-Backpropagation-Like
  Training in a Multi-Agent Network Framework
Seeking Next Layer Neurons' Attention for Error-Backpropagation-Like Training in a Multi-Agent Network Framework
Arshia Soltani Moakhar
Mohammad Azizmalayeri
Hossein Mirzaei
M. T. Manzuri
M. Rohban
21
2
0
15 Oct 2023
Robust Multi-Agent Reinforcement Learning by Mutual Information
  Regularization
Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization
Simin Li
Ruixiao Xu
Jingqiao Xiu
Yuwei Zheng
Pu Feng
Yaodong Yang
Xianglong Liu
23
3
0
15 Oct 2023
ELDEN: Exploration via Local Dependencies
ELDEN: Exploration via Local Dependencies
Jiaheng Hu
Zizhao Wang
Peter Stone
Roberto Martin-Martin
35
8
0
12 Oct 2023
Cross-Episodic Curriculum for Transformer Agents
Cross-Episodic Curriculum for Transformer Agents
Lucy Xiaoyang Shi
Yunfan Jiang
Jake Grigsby
Linxi "Jim" Fan
Yuke Zhu
22
4
0
12 Oct 2023
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General
  Sequential Decision Scenarios
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
Yazhe Niu
Yuan Pu
Zhenjie Yang
Xueyan Li
Tong Zhou
Jiyuan Ren
Shuai Hu
Hongsheng Li
Yu Liu
90
12
0
12 Oct 2023
GameGPT: Multi-agent Collaborative Framework for Game Development
GameGPT: Multi-agent Collaborative Framework for Game Development
Dake Chen
Hanbin Wang
Yunhao Huo
Yuzhao Li
Haoyang Zhang
LLMAG
11
19
0
12 Oct 2023
Measuring Feature Sparsity in Language Models
Measuring Feature Sparsity in Language Models
Mingyang Deng
Lucas Tao
Joe Benton
21
1
0
11 Oct 2023
Deep Reinforcement Learning for Autonomous Cyber Operations: A Survey
Deep Reinforcement Learning for Autonomous Cyber Operations: A Survey
Gregory Palmer
Chris Parry
Daniel J.B. Harrold
Chris Willis
AI4CE
21
1
0
11 Oct 2023
Previous
123456...181920
Next