Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1912.06680
Cited By
Dota 2 with Large Scale Deep Reinforcement Learning
13 December 2019
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
Vicki Cheung
Przemyslaw Debiak
Christy Dennison
David Farhi
Quirin Fischer
Shariq Hashme
Christopher Hesse
Rafal Jozefowicz
Scott Gray
Catherine Olsson
J. Pachocki
Michael Petrov
Henrique Pondé de Oliveira Pinto
Jonathan Raiman
Tim Salimans
Jeremy Schlatter
Jonas Schneider
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Dota 2 with Large Scale Deep Reinforcement Learning"
50 / 991 papers shown
Title
Learning to Play Pursuit-Evasion with Dynamic and Sensor Constraints
B. M. Gonultas
Volkan Isler
37
0
0
08 May 2024
Adversarial Attacks on Reinforcement Learning Agents for Command and Control
Ahaan Dabholkar
James Z. Hare
Mark R. Mittrick
John Richardson
Nick Waytowich
Priya Narayanan
Saurabh Bagchi
AAML
29
1
0
02 May 2024
HUGO -- Highlighting Unseen Grid Options: Combining Deep Reinforcement Learning with a Heuristic Target Topology Approach
Malte Lehna
Clara Holzhuter
Sven Tomforde
Christoph Scholz
37
6
0
01 May 2024
SAFE-RL: Saliency-Aware Counterfactual Explainer for Deep Reinforcement Learning Policies
Amir Samadi
K. Koufos
Kurt Debattista
M. Dianati
46
4
0
28 Apr 2024
Generalizing Multi-Step Inverse Models for Representation Learning to Finite-Memory POMDPs
Lili Wu
Ben Evans
Riashat Islam
Raihan Seraj
Yonathan Efroni
Alex Lamb
52
1
0
22 Apr 2024
Reducing Redundant Computation in Multi-Agent Coordination through Locally Centralized Execution
Yidong Bai
Toshiharu Sugawara
OffRL
23
0
0
19 Apr 2024
Higher Replay Ratio Empowers Sample-Efficient Multi-Agent Reinforcement Learning
Linjie Xu
Zichuan Liu
Alexander Dockhorn
Diego Perez-Liebana
Jinyu Wang
Lei Song
Jiang Bian
38
2
0
15 Apr 2024
Mitigating Cascading Effects in Large Adversarial Graph Environments
James Cunningham
Conrad S. Tucker
AI4CE
AAML
19
0
0
12 Apr 2024
On the Sample Efficiency of Abstractions and Potential-Based Reward Shaping in Reinforcement Learning
Giuseppe Canonaco
Leo Ardon
Alberto Pozanco
Daniel Borrajo
OffRL
26
1
0
11 Apr 2024
Efficient Multi-Task Reinforcement Learning via Task-Specific Action Correction
Jinyuan Feng
Min Chen
Zhiqiang Pu
Tenghai Qiu
Jianqiang Yi
27
2
0
09 Apr 2024
CNN-based Game State Detection for a Foosball Table
David Hagens
Jan Knaup
Elke Hergenröther
Andreas Weinmann
12
0
0
08 Apr 2024
Efficient Reinforcement Learning of Task Planners for Robotic Palletization through Iterative Action Masking Learning
Zheng Wu
Yichuan Li
Wei Zhan
Changliu Liu
Yun-hui Liu
Masayoshi Tomizuka
42
4
0
07 Apr 2024
Scaling Population-Based Reinforcement Learning with GPU Accelerated Simulation
Asad Ali Shahid
Yashraj S. Narang
Vincenzo Petrone
Enrico Ferrentino
Ankur Handa
Dieter Fox
Marco Pavone
L. Roveda
35
3
0
04 Apr 2024
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods
Yuji Cao
Huan Zhao
Yuheng Cheng
Ting Shu
Guolong Liu
Gaoqi Liang
Junhua Zhao
Yun Li
LLMAG
KELM
OffRL
LM&Ro
35
49
0
30 Mar 2024
Jointly Training and Pruning CNNs via Learnable Agent Guidance and Alignment
Alireza Ganjdanesh
Shangqian Gao
Heng-Chiao Huang
36
5
0
28 Mar 2024
Self-Clustering Hierarchical Multi-Agent Reinforcement Learning with Extensible Cooperation Graph
Qing Fu
Tenghai Qiu
Jianqiang Yi
Zhiqiang Pu
Xiaolin Ai
29
1
0
26 Mar 2024
POLICEd RL: Learning Closed-Loop Robot Control Policies with Provable Satisfaction of Hard Constraints
Jean-Baptiste Bouvier
Kartik Nagpal
Negar Mehr
39
3
0
20 Mar 2024
A Survey on Game Playing Agents and Large Models: Methods, Applications, and Challenges
Xinrun Xu
Yuxin Wang
Chaoyi Xu
Ziluo Ding
Jiechuan Jiang
Zhiming Ding
Börje F. Karlsson
LM&Ro
LLMAG
72
14
0
15 Mar 2024
SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning
Nicholas Zolman
Urban Fasel
J. Nathan Kutz
Steven L. Brunton
AI4CE
30
11
0
14 Mar 2024
Scaling Instructable Agents Across Many Simulated Worlds
Sima Team
Maria Abi Raad
Arun Ahuja
Catarina Barros
F. Besse
...
Daan Wierstra
Duncan Williams
Nathaniel Wong
Sarah York
Nick Young
LM&Ro
112
38
0
13 Mar 2024
RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning
Boning Li
Zhixuan Fang
Longbo Huang
16
0
0
07 Mar 2024
Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
Liangzhou Wang
Kaiwen Zhu
Fengming Zhu
Xinghu Yao
Shujie Zhang
Deheng Ye
Haobo Fu
Qiang Fu
Wei Yang
34
0
0
05 Mar 2024
Corruption-Robust Offline Two-Player Zero-Sum Markov Games
Andi Nika
Debmalya Mandal
Adish Singla
Goran Radanović
OffRL
32
1
0
04 Mar 2024
Continuous Mean-Zero Disagreement-Regularized Imitation Learning (CMZ-DRIL)
Noah Ford
Ryan W. Gardner
Austin Juhl
Nathan Larson
27
0
0
02 Mar 2024
Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Michal Nauman
Michal Bortkiewicz
Piotr Milo's
Tomasz Trzciñski
M. Ostaszewski
Marek Cygan
OffRL
22
17
0
01 Mar 2024
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Hany Hamed
Subin Kim
Dongyeong Kim
Jaesik Yoon
Sungjin Ahn
47
4
0
29 Feb 2024
ELA: Exploited Level Augmentation for Offline Learning in Zero-Sum Games
Shiqi Lei
Kanghoon Lee
Linjing Li
Jinkyoo Park
Jiachen Li
OffRL
29
1
0
28 Feb 2024
Mastering the Game of Guandan with Deep Reinforcement Learning and Behavior Regulating
Yifan YangGong
Haojun Pan
Lei Wang
21
0
0
21 Feb 2024
In value-based deep reinforcement learning, a pruned network is a good network
J. Obando-Ceron
Aaron C. Courville
Pablo Samuel Castro
OffRL
36
18
0
19 Feb 2024
When Do Off-Policy and On-Policy Policy Gradient Methods Align?
Davide Mambelli
Stephan Bongers
O. Zoeter
M. Spaan
F. Oliehoek
OffRL
19
0
0
19 Feb 2024
SINR-Aware Deep Reinforcement Learning for Distributed Dynamic Channel Allocation in Cognitive Interference Networks
Yaniv Cohen
Tomer Gafni
Ronen Greenberg
Kobi Cohen
27
5
0
17 Feb 2024
Mixtures of Experts Unlock Parameter Scaling for Deep RL
J. Obando-Ceron
Ghada Sokar
Timon Willi
Clare Lyle
Jesse Farebrother
Jakob N. Foerster
Gintare Karolina Dziugaite
Doina Precup
Pablo Samuel Castro
50
29
0
13 Feb 2024
Scaling Intelligent Agents in Combat Simulations for Wargaming
Scotty Black
Christian J. Darken
8
1
0
08 Feb 2024
Limitations of Agents Simulated by Predictive Models
Raymond Douglas
Jacek Karwowski
Chan Bae
Andis Draguns
Victoria Krakovna
25
0
0
08 Feb 2024
Improving Token-Based World Models with Parallel Observation Prediction
Lior Cohen
Kaixin Wang
Bingyi Kang
Shie Mannor
18
2
0
08 Feb 2024
Learning mirror maps in policy mirror descent
Carlo Alfano
Sebastian Towers
Silvia Sapora
Chris Xiaoxuan Lu
Patrick Rebeschini
30
0
0
07 Feb 2024
Joint Intrinsic Motivation for Coordinated Exploration in Multi-Agent Deep Reinforcement Learning
Maxime Toquebiau
Nicolas Bredeche
F. Benamar
Jae-Yun Jun
28
1
0
06 Feb 2024
SUB-PLAY: Adversarial Policies against Partially Observed Multi-Agent Reinforcement Learning Systems
Oubo Ma
Yuwen Pu
L. Du
Yang Dai
Ruo Wang
Xiaolei Liu
Yingcai Wu
Shouling Ji
AAML
27
3
0
06 Feb 2024
RL-VLM-F: Reinforcement Learning from Vision Language Foundation Model Feedback
Yufei Wang
Zhanyi Sun
Jesse Zhang
Zhou Xian
Erdem Biyik
David Held
Zackory M. Erickson
VLM
55
48
0
06 Feb 2024
Assessing the Impact of Distribution Shift on Reinforcement Learning Performance
Ted Fujimoto
Joshua Suetterlein
Samrat Chatterjee
A. Ganguly
OffRL
22
3
0
05 Feb 2024
V-IRL: Grounding Virtual Intelligence in Real Life
Jihan Yang
Runyu Ding
Ellis L Brown
Xiaojuan Qi
Saining Xie
LM&Ro
53
19
0
05 Feb 2024
Boosting Reinforcement Learning with Strongly Delayed Feedback Through Auxiliary Short Delays
Qingyuan Wu
S. Zhan
Yixuan Wang
Yuhui Wang
Chung-Wei Lin
Chen Lv
Qi Zhu
Jürgen Schmidhuber
Chao Huang
OffRL
43
1
0
05 Feb 2024
To the Max: Reinventing Reward in Reinforcement Learning
Grigorii Veviurko
Wendelin Bohmer
Mathijs de Weerdt
19
5
0
02 Feb 2024
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game
Guangzheng Hu
Yuanheng Zhu
Haoran Li
Dongbin Zhao
16
3
0
01 Feb 2024
Augmenting Replay in World Models for Continual Reinforcement Learning
Luke Yang
L. Kuhlmann
Gideon Kowadlo
VLM
KELM
CLL
OffRL
19
0
0
30 Jan 2024
Enhancing Human Experience in Human-Agent Collaboration: A Human-Centered Modeling Approach Based on Positive Human Gain
Yiming Gao
Feiyu Liu
Liang Wang
Zhenjie Lian
Dehua Zheng
...
Jing Dai
Qiang Fu
Wei Yang
Lanxiao Huang
Wei Liu
39
1
0
28 Jan 2024
Multi-Agent Diagnostics for Robustness via Illuminated Diversity
Mikayel Samvelyan
Davide Paglieri
Minqi Jiang
Jack Parker-Holder
Tim Rocktaschel
AAML
27
4
0
24 Jan 2024
CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents
Siyuan Qi
Shuo Chen
Yexin Li
Xiangyu Kong
Junqi Wang
...
Zhaowei Zhang
Nian Liu
Wei Wang
Yaodong Yang
Song-Chun Zhu
AI4CE
LRM
19
17
0
19 Jan 2024
Multi-Task Multi-Agent Shared Layers are Universal Cognition of Multi-Agent Coordination
Jiawei Wang
Jian Zhao
Zhengtao Cao
Ruili Feng
Rongjun Qin
Yang Yu
27
1
0
25 Dec 2023
LARP: Language-Agent Role Play for Open-World Games
Ming Yan
Ruihao Li
Hao Zhang
Hao Wang
Zhilan Yang
Ji Yan
LLMAG
LM&Ro
AI4CE
22
16
0
24 Dec 2023
Previous
1
2
3
4
5
...
18
19
20
Next