Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1912.06680
Cited By
Dota 2 with Large Scale Deep Reinforcement Learning
13 December 2019
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
Vicki Cheung
Przemyslaw Debiak
Christy Dennison
David Farhi
Quirin Fischer
Shariq Hashme
Christopher Hesse
Rafal Jozefowicz
Scott Gray
Catherine Olsson
J. Pachocki
Michael Petrov
Henrique Pondé de Oliveira Pinto
Jonathan Raiman
Tim Salimans
Jeremy Schlatter
Jonas Schneider
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Dota 2 with Large Scale Deep Reinforcement Learning"
50 / 991 papers shown
Title
Enhancing Aerial Combat Tactics through Hierarchical Multi-Agent Reinforcement Learning
Ardian Selmonaj
Oleg Szehr
Giacomo Del Rio
Alessandro Antonucci
Adrian Schneider
Michael Rüegsegger
21
0
0
13 May 2025
Multi-agent Embodied AI: Advances and Future Directions
Zhaohan Feng
Ruiqi Xue
Lei Yuan
Yang Yu
Ning Ding
M. Liu
Bingzhao Gao
Jian-jun Sun
Gang Wang
AI4CE
54
1
0
08 May 2025
Program Semantic Inequivalence Game with Large Language Models
Antonio Valerio Miceli-Barone
Vaishak Belle
Ali Payani
LRM
25
0
0
02 May 2025
Adaptive 3D UI Placement in Mixed Reality Using Deep Reinforcement Learning
Feiyu Lu
Mengyu Chen
Hsiang Hsu
Pranav Deshpande
Cheng Yao Wang
Blair MacIntyre
23
3
0
30 Apr 2025
CaRL: Learning Scalable Planning Policies with Simple Rewards
Bernhard Jaeger
D. Dauner
Jens Beißwenger
Simon Gerstenecker
Kashyap Chitta
Andreas Geiger
49
0
0
24 Apr 2025
Policy-Based Radiative Transfer: Solving the
2
2
2
-Level Atom Non-LTE Problem using Soft Actor-Critic Reinforcement Learning
Brandon Panos
Ivan Milic
OffRL
18
0
0
22 Apr 2025
Improving Human-AI Coordination through Adversarial Training and Generative Models
Paresh Chaudhary
Yancheng Liang
Daphne Chen
S. Du
Natasha Jaques
64
0
0
21 Apr 2025
Adapting a World Model for Trajectory Following in a 3D Game
Marko Tot
Shu Ishida
Abdelhak Lemkhenter
David Bignell
Pallavi Choudhury
...
Tarun Gupta
Darren Gehring
Sam Devlin
Sergio Valcarcel Macua
Raluca Georgescu
41
0
0
16 Apr 2025
Vision based driving agent for race car simulation environments
Gergely Bári
László Palkovics
21
1
0
14 Apr 2025
Are We Done with Object-Centric Learning?
Alexander Rubinstein
Ameya Prabhu
Matthias Bethge
Seong Joon Oh
OCL
613
0
0
09 Apr 2025
AssistanceZero: Scalably Solving Assistance Games
Cassidy Laidlaw
Eli Bronstein
Timothy Guo
Dylan Feng
Lukas Berglund
Justin Svegliato
Stuart J. Russell
Anca Dragan
29
1
0
09 Apr 2025
An Information-Geometric Approach to Artificial Curiosity
Alexander Nedergaard
Pablo A. Morales
21
0
0
08 Apr 2025
Algorithm Discovery With LLMs: Evolutionary Search Meets Reinforcement Learning
Anja Surina
Amin Mansouri
Lars Quaedvlieg
Amal Seddas
Maryna Viazovska
Emmanuel Abbe
Çağlar Gülçehre
38
1
0
07 Apr 2025
Playing Non-Embedded Card-Based Games with Reinforcement Learning
Tianyang Wu
Lipeng Wan
Yuhang Wang
Qiang Wan
Xuguang Lan
OffRL
25
0
0
07 Apr 2025
Human-Level Competitive Pokémon via Scalable Offline Reinforcement Learning with Transformers
Jake Grigsby
Yuqi Xie
Justin Sasek
Steven Zheng
Yuke Zhu
OffRL
26
0
0
06 Apr 2025
How to Adapt Control Barrier Functions? A Learning-Based Approach with Applications to a VTOL Quadplane
Taekyung Kim
Randal W. Beard
Dimitra Panagou
29
0
0
03 Apr 2025
Integrating Human Knowledge Through Action Masking in Reinforcement Learning for Operations Research
Mirko Stappert
Bernhard Lutz
Niklas Goby
Dirk Neumann
OffRL
31
0
0
03 Apr 2025
On the Mistaken Assumption of Interchangeable Deep Reinforcement Learning Implementations
Rajdeep Singh Hundal
Yan Xiao
Xiaochun Cao
J. Dong
Manuel Rigger
46
0
0
28 Mar 2025
Evolutionary Policy Optimization
Jianren Wang
Yifan Su
Abhinav Gupta
Deepak Pathak
45
0
0
24 Mar 2025
Real-Time Diffusion Policies for Games: Enhancing Consistency Policies with Q-Ensembles
Ruoqi Zhang
Ziwei Luo
Jens Sjölund
Per Mattsson
Linus Gisslén
Alessandro Sestini
42
1
0
21 Mar 2025
Reinforcement Learning Environment with LLM-Controlled Adversary in D&D 5th Edition Combat
Joseph Emmanuel DL Dayo
Michel Onasis S. Ogbinar
Prospero C. Naval Jr
51
0
0
19 Mar 2025
Agents Play Thousands of 3D Video Games
Zhongwen Xu
Xianliang Wang
Siyi Li
Tao Yu
Liang Wang
Qiang Fu
Wei Yang
LM&Ro
52
0
0
17 Mar 2025
Eval-PPO: Building an Efficient Threat Evaluator Using Proximal Policy Optimization
Wuzhou Sun
Siyi Li
Qingxiang Zou
Zixing Liao
AAML
56
0
0
15 Mar 2025
HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied Agents
Tristan Tomilin
Meng Fang
Mykola Pechenizkiy
55
0
0
11 Mar 2025
Automatic Curriculum Design for Zero-Shot Human-AI Coordination
Won-Sang You
Tae-Gwan Ha
Seo-Young Lee
Kyung-Joong Kim
49
0
0
10 Mar 2025
Controllable Complementarity: Subjective Preferences in Human-AI Collaboration
Chase McDonald
Cleotilde Gonzalez
65
0
0
07 Mar 2025
Multi-Robot Collaboration through Reinforcement Learning and Abstract Simulation
Adam Labiosa
Josiah P. Hanna
49
0
0
07 Mar 2025
Factorio Learning Environment
Jack Hopkins
Mart Bakler
Akbir Khan
LRM
AI4CE
LLMAG
50
0
0
06 Mar 2025
PokéChamp: an Expert-level Minimax Language Agent
Seth Karten
Andy Luu Nguyen
Chi Jin
AI4MH
LLMAG
ELM
75
2
0
06 Mar 2025
Flying on Point Clouds with Reinforcement Learning
Guangtong Xu
Tianyue Wu
Zihan Wang
Qianhao Wang
Fei Gao
3DPC
45
0
0
01 Mar 2025
Towards Understanding the Benefit of Multitask Representation Learning in Decision Process
Rui Lu
Yang Yue
Andrew Zhao
S. Du
Gao Huang
OffRL
52
1
0
01 Mar 2025
Reinforcement Learning with Curriculum-inspired Adaptive Direct Policy Guidance for Truck Dispatching
Shi Meng
Bin Tian
Xiaotong Zhang
OffRL
33
0
0
28 Feb 2025
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
Zhouyu He
Peng Qiao
Rongchun Li
Yong Dou
Yusong Tan
OffRL
57
0
0
27 Feb 2025
Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids
Toru Lin
Kartik Sachdev
Linxi Fan
Jitendra Malik
Yuke Zhu
44
7
0
27 Feb 2025
Provable Performance Bounds for Digital Twin-driven Deep Reinforcement Learning in Wireless Networks: A Novel Digital-Twin Bisimulation Metric
Zhenyu Tao
Wei Xu
Xiaohu You
OffRL
59
0
0
25 Feb 2025
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
Thomas Schmied
Thomas Adler
Vihang Patil
M. Beck
Korbinian Poppel
Johannes Brandstetter
G. Klambauer
Razvan Pascanu
Sepp Hochreiter
73
4
0
21 Feb 2025
Investigating Non-Transitivity in LLM-as-a-Judge
Yi Xu
Laura Ruis
Tim Rocktaschel
Robert Kirk
38
0
0
19 Feb 2025
Learning Strategy Representation for Imitation Learning in Multi-Agent Games
Shiqi Lei
Kanghon Lee
Linjing Li
Jinkyoo Park
OffRL
42
0
0
17 Feb 2025
Economics of Sourcing Human Data
Sebastin Santy
Prasanta Bhattacharya
Manoel Horta Ribeiro
Kelsey Allen
Sewoong Oh
69
0
0
11 Feb 2025
VSC-RL: Advancing Autonomous Vision-Language Agents with Variational Subgoal-Conditioned Reinforcement Learning
Qingyuan Wu
Jianheng Liu
Jianye Hao
J. Wang
Kun Shao
OffRL
100
0
0
11 Feb 2025
Skill Expansion and Composition in Parameter Space
Tenglong Liu
J. Li
Yinan Zheng
Haoyi Niu
Yixing Lan
Xin Xu
Xianyuan Zhan
56
4
0
09 Feb 2025
Improving Environment Novelty Quantification for Effective Unsupervised Environment Design
Jayden Teoh
Wenjun Li
Pradeep Varakantham
53
1
0
08 Feb 2025
Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-Tuning
Patrick Yin
Tyler Westenbroek
Simran Bagaria
Kevin Huang
Ching-an Cheng
Andrey Kobolov
Abhishek Gupta
70
2
0
04 Feb 2025
Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network
Jijia Liu
Feng Gao
Q. Liao
Chao Yu
Yu-Xiang Wang
OffRL
70
0
0
01 Feb 2025
Towards General-Purpose Model-Free Reinforcement Learning
Scott Fujimoto
P. DÓro
Amy Zhang
Yuandong Tian
Michael Rabbat
OffRL
36
3
0
28 Jan 2025
Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation
Wenzhang Liu
Lianjun Jin
Lu Ren
Chaoxu Mu
Changyin Sun
CML
45
0
0
24 Jan 2025
Kimi k1.5: Scaling Reinforcement Learning with LLMs
Kimi Team
Angang Du
Bofei Gao
Bowei Xing
Changjiu Jiang
...
Zhilin Yang
Zhiqi Huang
Zihao Huang
Ziyao Xu
Z. Yang
VLM
ALM
OffRL
AI4TS
LRM
106
136
0
22 Jan 2025
Blockchain-assisted Demonstration Cloning for Multi-Agent Deep Reinforcement Learning
Ahmed Alagha
Jamal Bentahar
Hadi Otrok
Shakti Singh
R. Mizouni
53
3
0
19 Jan 2025
Explainable Reinforcement Learning for Formula One Race Strategy
Devin Thomas
Junqi Jiang
Avinash Kori
Aaron Russo
Steffen Winkler
Stuart Sale
Joseph McMillan
Francesco Belardinelli
Antonio Rago
LRM
35
0
0
07 Jan 2025
Turn-based Multi-Agent Reinforcement Learning Model Checking
Dennis Gross
39
0
0
06 Jan 2025
1
2
3
4
...
18
19
20
Next