ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.06680
  4. Cited By
Dota 2 with Large Scale Deep Reinforcement Learning

Dota 2 with Large Scale Deep Reinforcement Learning

13 December 2019
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
Vicki Cheung
Przemyslaw Debiak
Christy Dennison
David Farhi
Quirin Fischer
Shariq Hashme
Christopher Hesse
Rafal Jozefowicz
Scott Gray
Catherine Olsson
J. Pachocki
Michael Petrov
Henrique Pondé de Oliveira Pinto
Jonathan Raiman
Tim Salimans
Jeremy Schlatter
Jonas Schneider
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
    GNN
    VLM
    CLL
    AI4CE
    LRM
ArXivPDFHTML

Papers citing "Dota 2 with Large Scale Deep Reinforcement Learning"

50 / 991 papers shown
Title
Quantifying Agent Interaction in Multi-agent Reinforcement Learning for
  Cost-efficient Generalization
Quantifying Agent Interaction in Multi-agent Reinforcement Learning for Cost-efficient Generalization
Yuxin Chen
Chen Tang
Ran Tian
Chenran Li
Jinning Li
Masayoshi Tomizuka
Wei Zhan
34
3
0
11 Oct 2023
RoboHive: A Unified Framework for Robot Learning
RoboHive: A Unified Framework for Robot Learning
Vikash Kumar
Rutav Shah
Gaoyue Zhou
Vincent Moens
Vittorio Caggiano
Jay Vakil
Abhishek Gupta
Aravind Rajeswaran
20
22
0
10 Oct 2023
Diversity from Human Feedback
Diversity from Human Feedback
Ren-Jian Wang
Ke Xue
Yutong Wang
Peng Yang
Haobo Fu
Qiang Fu
Chao Qian
39
3
0
10 Oct 2023
GEAR: A GPU-Centric Experience Replay System for Large Reinforcement
  Learning Models
GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models
Hanjing Wang
Man-Kit Sit
Cong He
Ying Wen
Weinan Zhang
J. Wang
Yaodong Yang
Luo Mai
OffRL
VLM
27
1
0
08 Oct 2023
"A Nova Eletricidade: Aplicações, Riscos e Tendências da IA
  Moderna -- "The New Electricity": Applications, Risks, and Trends in Current
  AI
"A Nova Eletricidade: Aplicações, Riscos e Tendências da IA Moderna -- "The New Electricity": Applications, Risks, and Trends in Current AI
A. Bazzan
Anderson R. Tavares
André G. Pereira
C. R. Jung
Jacob Scharcanski
J. Carbonera
Luís C. Lamb
Mariana Recamonde Mendoza
T. L. T. D. Silveira
V. P. Moreira
29
0
0
08 Oct 2023
FP3O: Enabling Proximal Policy Optimization in Multi-Agent Cooperation
  with Parameter-Sharing Versatility
FP3O: Enabling Proximal Policy Optimization in Multi-Agent Cooperation with Parameter-Sharing Versatility
Lang Feng
Dong Xing
Junru Zhang
Gang Pan
21
1
0
08 Oct 2023
Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with
  Subgame Curriculum Learning
Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum Learning
Jiayu Chen
Zelai Xu
Yunfei Li
Chao Yu
Jiaming Song
Huazhong Yang
Fei Fang
Yu Wang
Yi Wu
24
4
0
07 Oct 2023
Digital Twin Assisted Deep Reinforcement Learning for Online Admission
  Control in Sliced Network
Digital Twin Assisted Deep Reinforcement Learning for Online Admission Control in Sliced Network
Zhenyu Tao
Weihong Xu
Xiaohu You
OffRL
25
3
0
07 Oct 2023
Confronting Reward Model Overoptimization with Constrained RLHF
Confronting Reward Model Overoptimization with Constrained RLHF
Ted Moskovitz
Aaditya K. Singh
DJ Strouse
T. Sandholm
Ruslan Salakhutdinov
Anca D. Dragan
Stephen Marcus McAleer
34
47
0
06 Oct 2023
Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed
  Cooperative-Competitive Games
Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed Cooperative-Competitive Games
Zelai Xu
Yancheng Liang
Chao Yu
Yu Wang
Yi Wu
13
8
0
05 Oct 2023
Discovering General Reinforcement Learning Algorithms with Adversarial
  Environment Design
Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design
Matthew Jackson
Minqi Jiang
Jack Parker-Holder
Risto Vuorio
Chris Xiaoxuan Lu
Gregory Farquhar
Shimon Whiteson
Jakob N. Foerster
OOD
11
9
0
04 Oct 2023
Differentially Encoded Observation Spaces for Perceptive Reinforcement
  Learning
Differentially Encoded Observation Spaces for Perceptive Reinforcement Learning
Lev Grossman
Brian Plancher
OffRL
15
0
0
03 Oct 2023
Blending Imitation and Reinforcement Learning for Robust Policy
  Improvement
Blending Imitation and Reinforcement Learning for Robust Policy Improvement
Xuefeng Liu
Takuma Yoneda
Rick L. Stevens
Matthew R. Walter
Yuxin Chen
27
10
0
03 Oct 2023
Avalon's Game of Thoughts: Battle Against Deception through Recursive
  Contemplation
Avalon's Game of Thoughts: Battle Against Deception through Recursive Contemplation
Shenzhi Wang
Chang Liu
Zilong Zheng
Siyuan Qi
Shuo Chen
Qisen Yang
Andrew Zhao
Chaofei Wang
Shiji Song
Gao Huang
LLMAG
31
62
0
02 Oct 2023
Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of
  Agents
Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of Agents
Marco Pleines
Matthias Pallasch
Frank Zimmer
Mike Preuss
OffRL
29
0
0
29 Sep 2023
High Throughput Training of Deep Surrogates from Large Ensemble Runs
High Throughput Training of Deep Surrogates from Large Ensemble Runs
Lucas Meyer
M. Schouler
R. Caulk
Alejandro Ribés
Bruno Raffin
AI4CE
17
5
0
28 Sep 2023
Towards Human-Like RL: Taming Non-Naturalistic Behavior in Deep RL via
  Adaptive Behavioral Costs in 3D Games
Towards Human-Like RL: Taming Non-Naturalistic Behavior in Deep RL via Adaptive Behavioral Costs in 3D Games
Kuo-Hao Ho
Ping-Chun Hsieh
Chiu-Chou Lin
You-Ren Luo
Feng-Jian Wang
I-Chen Wu
24
0
0
27 Sep 2023
PlotMap: Automated Layout Design for Building Game Worlds
PlotMap: Automated Layout Design for Building Game Worlds
Yi Wang
Jieliang Luo
Adam Gaier
Evan Atherton
Hilmar Koch
25
1
0
26 Sep 2023
Enhancing data efficiency in reinforcement learning: a novel imagination
  mechanism based on mesh information propagation
Enhancing data efficiency in reinforcement learning: a novel imagination mechanism based on mesh information propagation
Zihang Wang
Maowei Jiang
AI4CE
12
0
0
25 Sep 2023
An In-depth Survey of Large Language Model-based Artificial Intelligence
  Agents
An In-depth Survey of Large Language Model-based Artificial Intelligence Agents
Pengyu Zhao
Zijian Jin
Ning Cheng
LLMAG
38
20
0
23 Sep 2023
OmniDrones: An Efficient and Flexible Platform for Reinforcement
  Learning in Drone Control
OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control
Botian Xu
Feng Gao
Chao Yu
Ruize Zhang
Yi Wu
Yu Wang
26
28
0
22 Sep 2023
Improving Generalization in Game Agents with Data Augmentation in
  Imitation Learning
Improving Generalization in Game Agents with Data Augmentation in Imitation Learning
Derek Yadgaroff
Alessandro Sestini
Konrad Tollmar
Ayca Ozcelikkale
Linus Gisslén
14
2
0
22 Sep 2023
Counterfactual Conservative Q Learning for Offline Multi-agent
  Reinforcement Learning
Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning
Jianzhun Shao
Yun Qu
Chen Chen
Hongchang Zhang
Xiangyang Ji
OffRL
13
19
0
22 Sep 2023
Tree-Based Reconstructive Partitioning: A Novel Low-Data Level
  Generation Approach
Tree-Based Reconstructive Partitioning: A Novel Low-Data Level Generation Approach
Emily Halina
Matthew J. Guzdial
15
1
0
18 Sep 2023
Parallel Distributional Prioritized Deep Reinforcement Learning for
  Unmanned Aerial Vehicles
Parallel Distributional Prioritized Deep Reinforcement Learning for Unmanned Aerial Vehicles
A. H. Kolling
V. A. Kich
J. C. Jesus
Andressa Cavalcante da Silva
Ricardo B. Grando
Paulo L. J. Drews-Jr
D. T. Gamarra
14
3
0
01 Sep 2023
D-VAT: End-to-End Visual Active Tracking for Micro Aerial Vehicles
D-VAT: End-to-End Visual Active Tracking for Micro Aerial Vehicles
Alberto Dionigi
Simone Felicioni
Mirko Leomanni
G. Costante
10
9
0
31 Aug 2023
Benchmarking Robustness and Generalization in Multi-Agent Systems: A
  Case Study on Neural MMO
Benchmarking Robustness and Generalization in Multi-Agent Systems: A Case Study on Neural MMO
Yangkun Chen
Joseph Suárez
Junjie Zhang
Chenghui Yu
Bo Wu
...
Sharada Mohanty
Jiaxin Chen
Xiu Li
Xiaolong Zhu
Phillip Isola
24
0
0
30 Aug 2023
Reinforcement Learning Informed Evolutionary Search for Autonomous
  Systems Testing
Reinforcement Learning Informed Evolutionary Search for Autonomous Systems Testing
D. Humeniuk
Foutse Khomh
G. Antoniol
25
4
0
24 Aug 2023
Diverse Policies Converge in Reward-free Markov Decision Processe
Diverse Policies Converge in Reward-free Markov Decision Processe
Fanqing Lin
Shiyu Huang
Weiwei Tu
24
0
0
23 Aug 2023
Careful at Estimation and Bold at Exploration
Careful at Estimation and Bold at Exploration
Xing Chen
Yijun Liu
Zhaogeng Liu
Hechang Chen
Hengshuai Yao
Yi-Ju Chang
14
0
0
22 Aug 2023
Stabilizing Unsupervised Environment Design with a Learned Adversary
Stabilizing Unsupervised Environment Design with a Learned Adversary
Ishita Mediratta
Minqi Jiang
Jack Parker-Holder
Michael Dennis
Eugene Vinitsky
Tim Rocktaschel
34
14
0
21 Aug 2023
Generating Personas for Games with Multimodal Adversarial Imitation
  Learning
Generating Personas for Games with Multimodal Adversarial Imitation Learning
William Ahlberg
Alessandro Sestini
Konrad Tollmar
Linus Gisslén
GAN
24
2
0
15 Aug 2023
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player
  Zero-Sum Games
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Yang Li
Kun Xiong
Yingping Zhang
Jiangcheng Zhu
Stephen Marcus McAleer
Wei Pan
J. Wang
Zonghong Dai
Yaodong Yang
29
2
0
09 Aug 2023
BarlowRL: Barlow Twins for Data-Efficient Reinforcement Learning
BarlowRL: Barlow Twins for Data-Efficient Reinforcement Learning
Omer Veysel Cagatan
Barış Akgün
BDL
OffRL
24
3
0
08 Aug 2023
Scope Loss for Imbalanced Classification and RL Exploration
Scope Loss for Imbalanced Classification and RL Exploration
Hasham Burhani
Xiaolong Shi
Jonathan Jaegerman
Daniel Balicki
14
0
0
08 Aug 2023
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Michaël Mathieu
Sherjil Ozair
Srivatsan Srinivasan
Çağlar Gülçehre
Shangtong Zhang
...
Sergio Gomez Colmenarejo
Aaron van den Oord
Wojciech M. Czarnecki
Nando de Freitas
Oriol Vinyals
OffRL
16
10
0
07 Aug 2023
ESP: Exploiting Symmetry Prior for Multi-Agent Reinforcement Learning
ESP: Exploiting Symmetry Prior for Multi-Agent Reinforcement Learning
Xin Yu
Rongye Shi
Pu Feng
Yongkai Tian
Jie Luo
Wenjun Wu
36
7
0
30 Jul 2023
Benchmarking Offline Reinforcement Learning on Real-Robot Hardware
Benchmarking Offline Reinforcement Learning on Real-Robot Hardware
Nico Gürtler
Sebastian Blaes
Pavel Kolev
Felix Widmaier
Manuel Wüthrich
Stefan Bauer
Bernhard Schölkopf
Georg Martius
OffRL
30
28
0
28 Jul 2023
Counterfactual Explanation Policies in RL
Counterfactual Explanation Policies in RL
Shripad Deshmukh
R Srivatsan
Supriti Vijay
Jayakumar Subramanian
Chirag Agarwal
OffRL
30
0
0
25 Jul 2023
Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local
  Value Regularization
Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization
Xiangsen Wang
Haoran Xu
Yinan Zheng
Xianyuan Zhan
OffRL
30
23
0
21 Jul 2023
Towards practical reinforcement learning for tokamak magnetic control
Towards practical reinforcement learning for tokamak magnetic control
Brendan D. Tracey
Andrea Michi
Yuri Chervonyi
Ian Davies
Cosmin Paduraru
...
Jonathan Evens
Paula Kurylowicz
D. Mankowitz
Martin Riedmiller
The Tcv Team
AI4CE
37
10
0
21 Jul 2023
Towards General Game Representations: Decomposing Games Pixels into
  Content and Style
Towards General Game Representations: Decomposing Games Pixels into Content and Style
C. Trivedi
Konstantinos Makantasis
Antonios Liapis
Georgios N. Yannakakis
OCL
35
3
0
20 Jul 2023
IxDRL: A Novel Explainable Deep Reinforcement Learning Toolkit based on
  Analyses of Interestingness
IxDRL: A Novel Explainable Deep Reinforcement Learning Toolkit based on Analyses of Interestingness
Pedro Sequeira
Melinda Gervasio
11
2
0
18 Jul 2023
LuckyMera: a Modular AI Framework for Building Hybrid NetHack Agents
LuckyMera: a Modular AI Framework for Building Hybrid NetHack Agents
Luigi Quarantiello
Simone Marzeddu
Antonio Guzzi
Vincenzo Lomonaco
29
0
0
17 Jul 2023
Image-based Regularization for Action Smoothness in Autonomous Miniature
  Racing Car with Deep Reinforcement Learning
Image-based Regularization for Action Smoothness in Autonomous Miniature Racing Car with Deep Reinforcement Learning
Hoang-Giang Cao
I. Lee
Bo-Jiun Hsu
Zheng-Yi Lee
Yu-Wei Shih
Hsueh-Cheng Wang
I-Chen Wu
27
2
0
17 Jul 2023
Efficient Adversarial Attacks on Online Multi-agent Reinforcement
  Learning
Efficient Adversarial Attacks on Online Multi-agent Reinforcement Learning
Guanlin Liu
Lifeng Lai
AAML
35
6
0
15 Jul 2023
Rational Neural Network Controllers
Rational Neural Network Controllers
M. Newton
A. Papachristodoulou
OOD
AAML
37
1
0
12 Jul 2023
Comparing Reinforcement Learning and Human Learning using the Game of
  Hidden Rules
Comparing Reinforcement Learning and Human Learning using the Game of Hidden Rules
Eric Pulick
Vladimir Menkov
Yonatan Dov Mintz
Paul B. Kantor
Vicki M. Bier
OffRL
9
0
0
30 Jun 2023
Would I have gotten that reward? Long-term credit assignment by
  counterfactual contribution analysis
Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis
Alexander Meulemans
Simon Schug
Seijin Kobayashi
Nathaniel D. Daw
Gregory Wayne
18
3
0
29 Jun 2023
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand
  Cores
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
Zhiyu Mei
Wei Fu
Jiaxuan Gao
Guang Wang
Huanchen Zhang
Yi Wu
OffRL
LRM
19
5
0
29 Jun 2023
Previous
123...567...181920
Next