Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1912.06680
Cited By
Dota 2 with Large Scale Deep Reinforcement Learning
13 December 2019
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
Vicki Cheung
Przemyslaw Debiak
Christy Dennison
David Farhi
Quirin Fischer
Shariq Hashme
Christopher Hesse
Rafal Jozefowicz
Scott Gray
Catherine Olsson
J. Pachocki
Michael Petrov
Henrique Pondé de Oliveira Pinto
Jonathan Raiman
Tim Salimans
Jeremy Schlatter
Jonas Schneider
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Dota 2 with Large Scale Deep Reinforcement Learning"
50 / 991 papers shown
Title
Multi-Agent Continuous Control with Generative Flow Networks
Shuang Luo
Yinchuan Li
Shunyu Liu
Xu Zhang
Yunfeng Shao
Chao Wu
AI4CE
27
2
0
13 Aug 2024
Strategy Game-Playing with Size-Constrained State Abstraction
Linjie Xu
Diego Perez-Liebana
Alexander Dockhorn
35
0
0
12 Aug 2024
Achieving Human Level Competitive Robot Table Tennis
David B. DÁmbrosio
Saminda Abeyruwan
L. Graesser
Atil Iscen
H. B. Amor
...
Vikas Sindhwani
Vincent Vanhoucke
Grace Vesom
P. Xu
Pannag R. Sanketi
89
14
0
07 Aug 2024
A Survey on Self-play Methods in Reinforcement Learning
Ruize Zhang
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
46
8
0
02 Aug 2024
LiteEFG: An Efficient Python Library for Solving Extensive-form Games
Mingyang Liu
Gabriele Farina
Asuman Ozdaglar
27
2
0
29 Jul 2024
Reinforcement Learning for Sustainable Energy: A Survey
Koen Ponse
Felix Kleuker
Márton Fejér
Álvaro Serra-Gómez
Aske Plaat
Thomas M. Moerland
OffRL
AI4CE
40
1
0
26 Jul 2024
Gymnasium: A Standard Interface for Reinforcement Learning Environments
Mark Towers
Ariel Kwiatkowski
Jordan Terry
John U. Balis
Gianluca De Cola
...
Andrea Pierré
Sander Schulhoff
Jun Jet Tai
Hannah Tan
Omar G. Younis
AuLLM
OffRL
19
149
0
24 Jul 2024
Learning to Play Foosball: System and Baselines
Janosch Moos
Cedric Derstroff
Niklas Schröder
Debora Clever
19
0
0
23 Jul 2024
Mitigating Deep Reinforcement Learning Backdoors in the Neural Activation Space
Sanyam Vyas
Chris Hicks
V. Mavroudis
AAML
34
0
0
21 Jul 2024
Proximal Policy Distillation
Giacomo Spigler
OffRL
26
1
0
21 Jul 2024
Rocket Landing Control with Random Annealing Jump Start Reinforcement Learning
Yuxuan Jiang
Yujie Yang
Zhiqian Lan
Guojian Zhan
Shengbo Eben Li
Qi Sun
Jian Ma
Tianwen Yu
Changwu Zhang
33
1
0
21 Jul 2024
Model-based Policy Optimization using Symbolic World Model
Andrey Gorodetskiy
Konstantin Mironov
Aleksandr I. Panov
29
0
0
18 Jul 2024
PG-Rainbow: Using Distributional Reinforcement Learning in Policy Gradient Methods
WooJae Jeon
KanJun Lee
Jeewoo Lee
OffRL
18
0
0
18 Jul 2024
AlphaDou: High-Performance End-to-End Doudizhu AI Integrating Bidding
Chang Lei
Huan Lei
25
0
0
14 Jul 2024
Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments
Zoya Volovikova
A. Skrynnik
Petr Kuderov
Aleksandr I. Panov
LLMAG
LM&Ro
38
0
0
12 Jul 2024
A Review of Nine Physics Engines for Reinforcement Learning Research
Michael Kaup
Cornelius Wolff
Hyerim Hwang
Julius Mayer
Elia Bruni
AI4CE
30
5
0
11 Jul 2024
Hierarchical Consensus-Based Multi-Agent Reinforcement Learning for Multi-Robot Cooperation Tasks
Pu Feng
Junkang Liang
Size Wang
Xin Yu
Xin Ji
Yiting Chen
Kui Zhang
Rongye Shi
Wenjun Wu
47
7
0
11 Jul 2024
Structural Design Through Reinforcement Learning
Thomas Rochefort-Beaudoin
Aurelian Vadean
Niels Aage
S. Achiche
AI4CE
21
0
0
10 Jul 2024
Learning With Generalised Card Representations for "Magic: The Gathering"
Timo Bertram
Johannes Fürnkranz
Martin Müller
47
1
0
08 Jul 2024
Neural Network-based Information Set Weighting for Playing Reconnaissance Blind Chess
Timo Bertram
Johannes Fürnkranz
Martin Müller
21
1
0
08 Jul 2024
Gradient-based Regularization for Action Smoothness in Robotic Control with Reinforcement Learning
I. Lee
Hoang-Giang Cao
Cong-Tinh Dao
Yu-Cheng Chen
I-Chen Wu
25
0
0
05 Jul 2024
Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion
Boyuan Chen
Diego Marti Monso
Yilun Du
Max Simchowitz
Russ Tedrake
Vincent Sitzmann
DiffM
32
73
0
01 Jul 2024
Diffusion Models for Offline Multi-agent Reinforcement Learning with Safety Constraints
Jianuo Huang
OffRL
27
0
0
30 Jun 2024
Contextualized Hybrid Ensemble Q-learning: Learning Fast with Control Priors
Emma Cramer
Bernd Frauenknecht
Ramil Sabirov
Sebastian Trimpe
OffRL
OnRL
38
3
0
28 Jun 2024
Multi-agent Cooperative Games Using Belief Map Assisted Training
Qinwei Huang
Chen Luo
Alex B. Wu
Simon Khan
Hai Helen Li
Qinru Qiu
29
0
0
27 Jun 2024
Mixture of Experts in a Mixture of RL settings
Timon Willi
J. Obando-Ceron
Jakob Foerster
Karolina Dziugaite
Pablo Samuel Castro
MoE
41
7
0
26 Jun 2024
Reinforcement Learning via Auxiliary Task Distillation
Abhinav Harish
Larry Heck
Josiah P. Hanna
Z. Kira
Andrew Szot
31
0
0
24 Jun 2024
Position: Benchmarking is Limited in Reinforcement Learning Research
Scott M. Jordan
Adam White
Bruno Castro da Silva
Martha White
Philip S. Thomas
OffRL
18
5
0
23 Jun 2024
A Simple, Solid, and Reproducible Baseline for Bridge Bidding AI
Haruka Kita
Sotetsu Koyamada
Yotaro Yamaguchi
Shin Ishii
27
0
0
14 Jun 2024
Reinforcement Learning for High-Level Strategic Control in Tower Defense Games
Joakim Bergdahl
Alessandro Sestini
Linus Gisslén
40
0
0
12 Jun 2024
Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL
Qi Lv
Xiang Deng
Gongwei Chen
Michael Yu Wang
Liqiang Nie
75
7
0
08 Jun 2024
Massively Multiagent Minigames for Training Generalist Agents
Kyoung Whan Choe
Ryan Sullivan
Joseph Suárez
AI4CE
32
0
0
07 Jun 2024
Open-Endedness is Essential for Artificial Superhuman Intelligence
Edward Hughes
Michael Dennis
Jack Parker-Holder
Feryal M. P. Behbahani
Aditi Mavalankar
Yuge Shi
Tom Schaul
Tim Rocktaschel
LRM
32
18
0
06 Jun 2024
Mini Honor of Kings: A Lightweight Environment for Multi-Agent Reinforcement Learning
Lin Liu
Jian Zhao
Cheng Hu
Zhengtao Cao
Youpeng Zhao
...
Wenjun Wang
Zhaofeng He
Houqiang Li
Xia Lin
Lanxiao Huang
OffRL
SyDa
29
0
0
06 Jun 2024
Behavior-Targeted Attack on Reinforcement Learning with Limited Access to Victim's Policy
Shojiro Yamabe
Kazuto Fukuchi
Ryoma Senda
Jun Sakuma
AAML
45
0
0
06 Jun 2024
FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning
Wenzhe Li
Zihan Ding
Seth Karten
Chi Jin
32
1
0
04 Jun 2024
Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment
Chen Zhang
Qiang He
Zhou Yuan
Elvis S. Liu
Hong Wang
Jian Zhao
Yang-Feng Wang
31
1
0
03 Jun 2024
Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity
Yan Yang
Bin Gao
Ya-xiang Yuan
38
2
0
30 May 2024
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption
Bernd Frauenknecht
Artur Eisele
Devdutt Subhasish
Friedrich Solowjow
Sebastian Trimpe
44
5
0
29 May 2024
Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation
Fengshuo Bai
Rui Zhao
Hongming Zhang
Sijia Cui
Ying Wen
Yaodong Yang
Bo Xu
Lei Han
OffRL
24
6
0
29 May 2024
Safe Reinforcement Learning in Black-Box Environments via Adaptive Shielding
Daniel Bethell
Simos Gerasimou
R. Calinescu
Calum Imrie
OffRL
OnRL
29
0
0
28 May 2024
PyTAG: Tabletop Games for Multi-Agent Reinforcement Learning
Martin Balla
G. E. Long
J. Goodman
Raluca D. Gaina
Diego Perez-Liebana
OffRL
GP
15
1
0
28 May 2024
A Recipe for Unbounded Data Augmentation in Visual Reinforcement Learning
Abdulaziz Almuzairee
Nicklas Hansen
Henrik I. Christensen
40
6
0
27 May 2024
Model-free reinforcement learning with noisy actions for automated experimental control in optics
Lea Richtmann
Viktoria-S. Schmiesing
Dennis Wilken
Jan Heine
Aaron Tranter
Avishek Anand
Tobias J. Osborne
M. Heurs
25
2
0
24 May 2024
Mixture of Public and Private Distributions in Imperfect Information Games
Jérôme Arjonilla
Abdallah Saffidine
Tristan Cazenave
16
1
0
23 May 2024
Deep Reinforcement Learning for Time-Critical Wilderness Search And Rescue Using Drones
Jan‐Hendrik Ewers
David Anderson
Douglas G. Thomson
28
4
0
21 May 2024
Configurable Mirror Descent: Towards a Unification of Decision Making
Pengdeng Li
Shuxin Li
Chang Yang
Xinrun Wang
Shuyue Hu
Xiao Huang
Hau Chan
Bo An
36
1
0
20 May 2024
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework
Jian Hu
Xibin Wu
Weixun Wang
OpenLLMAI Team
Dehao Zhang
Yu Cao
AI4CE
VLM
25
90
0
20 May 2024
The Future of Large Language Model Pre-training is Federated
Lorenzo Sani
Alexandru Iacob
Zeyu Cao
Bill Marino
Yan Gao
...
Wanru Zhao
William F. Shen
Preslav Aleksandrov
Xinchi Qiu
Nicholas D. Lane
AI4CE
33
12
0
17 May 2024
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Yuexiang Zhai
Hao Bai
Zipeng Lin
Jiayi Pan
Shengbang Tong
...
Alane Suhr
Saining Xie
Yann LeCun
Yi-An Ma
Sergey Levine
LLMAG
LRM
39
56
0
16 May 2024
Previous
1
2
3
4
5
6
...
18
19
20
Next