Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1912.06680
Cited By
Dota 2 with Large Scale Deep Reinforcement Learning
13 December 2019
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
Vicki Cheung
Przemyslaw Debiak
Christy Dennison
David Farhi
Quirin Fischer
Shariq Hashme
Christopher Hesse
Rafal Jozefowicz
Scott Gray
Catherine Olsson
J. Pachocki
Michael Petrov
Henrique Pondé de Oliveira Pinto
Jonathan Raiman
Tim Salimans
Jeremy Schlatter
Jonas Schneider
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Dota 2 with Large Scale Deep Reinforcement Learning"
50 / 991 papers shown
Title
Human-like Bots for Tactical Shooters Using Compute-Efficient Sensors
Niels Justesen
Maria Kaselimi
Sam Snodgrass
Miruna Vozaru
Matthew Schlegel
...
Albert Wang
Christoffer Holmgård
Georgios N. Yannakakis
S. Risi
Julian Togelius
45
0
0
03 Jan 2025
Heterogeneous Multi-agent Zero-Shot Coordination by Coevolution
Ke Xue
Yutong Wang
Cong Guan
Lei Yuan
Haobo Fu
Qiang Fu
Chao Qian
Yang Yu
42
16
0
03 Jan 2025
AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale Games
Kefan Su
Yusen Huo
Zhilin Zhang
Shuai Dou
Chuan Yu
Jian Xu
Zongqing Lu
Bo Zheng
75
4
0
31 Dec 2024
Symbolic Disentangled Representations for Images
Alexandr Korchemnyi
A. Kovalev
Aleksandr I. Panov
OCL
49
0
0
31 Dec 2024
Multi Agent Reinforcement Learning for Sequential Satellite Assignment Problems
Joshua Holder
Natasha Jaques
Mehran Mesbahi
69
1
0
20 Dec 2024
Future Research Avenues for Artificial Intelligence in Digital Gaming: An Exploratory Report
Markus Dablander
75
0
0
18 Dec 2024
RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL Evaluation and LLM Enhancement
Junjie Lin
Jian Zhao
Lin Liu
Yue Deng
Youpeng Zhao
Lanxiao Huang
Xia Lin
Wengang Zhou
H. Li
74
0
0
16 Dec 2024
Inverse Delayed Reinforcement Learning
S. Zhan
Qingyuan Wu
Zhian Ruan
Frank Yang
Philip Wang
Yixuan Wang
Ruochen Jiao
Chao Huang
Qi Zhu
65
0
0
04 Dec 2024
Segmenting Action-Value Functions Over Time-Scales in SARSA via TD(
Δ
\Delta
Δ
)
Mahammad Humayoo
57
0
0
22 Nov 2024
Rethinking the Intermediate Features in Adversarial Attacks: Misleading Robotic Models via Adversarial Distillation
Ke Zhao
Huayang Huang
Miao Li
Yu Wu
AAML
71
0
0
21 Nov 2024
Personalized Help for Optimizing Low-Skilled Users' Strategy
Feng Gu
Wichayaporn Wongkamjan
Jordan Boyd-Graber
Jonathan K. Kummerfeld
Denis Peskoff
Jonathan May
28
0
0
14 Nov 2024
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Zhihong Liu
Xin Xu
Peng Qiao
Dongsheng Li
OffRL
20
2
0
08 Nov 2024
Interpreting the Learned Model in MuZero Planning
Hung Guei
Yan-Ru Ju
Wei-Yu Chen
Ti-Rong Wu
28
1
0
07 Nov 2024
Scaling Laws for Pre-training Agents and World Models
Tim Pearce
Tabish Rashid
Dave Bignell
Raluca Georgescu
Sam Devlin
Katja Hofmann
LM&Ro
40
6
0
07 Nov 2024
Opportunities of Reinforcement Learning in South Africa's Just Transition
Claude Formanek
C. Tilbury
Jonathan P. Shock
72
0
0
06 Nov 2024
Sample-Efficient Alignment for LLMs
Zichen Liu
Changyu Chen
Chao Du
Wee Sun Lee
Min-Bin Lin
36
3
0
03 Nov 2024
Role Play: Learning Adaptive Role-Specific Strategies in Multi-Agent Interactions
Weifan Long
Wen Wen
Peng Zhai
Lihua Zhang
26
0
0
02 Nov 2024
Learning in Markov Games with Adaptive Adversaries: Policy Regret, Fundamental Barriers, and Efficient Algorithms
Thanh Nguyen-Tang
Raman Arora
74
1
0
01 Nov 2024
Beyond the Boundaries of Proximal Policy Optimization
Charlie B. Tan
Edan Toledo
Benjamin Ellis
Jakob Foerster
Ferenc Huszár
21
0
0
01 Nov 2024
Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration
Hai Zhong
Xun Wang
Zhuoran Li
Longbo Huang
OffRL
OnRL
29
0
0
25 Oct 2024
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Michael Noukhovitch
Shengyi Huang
Sophie Xhonneux
Arian Hosseini
Rishabh Agarwal
Aaron C. Courville
OffRL
79
5
0
23 Oct 2024
A Large Language Model-Driven Reward Design Framework via Dynamic Feedback for Reinforcement Learning
Shengjie Sun
Runze Liu
Jiafei Lyu
J. Yang
L. Zhang
Xiu Li
LRM
22
7
0
18 Oct 2024
Potential-Based Intrinsic Motivation: Preserving Optimality With Complex, Non-Markovian Shaping Rewards
Grant C. Forbes
Leonardo Villalobos-Arias
Jianxun Wang
Arnav Jhala
David L. Roberts
29
0
0
16 Oct 2024
Improving the Language Understanding Capabilities of Large Language Models Using Reinforcement Learning
Bokai Hu
Sai Ashish Somayajula
Xin Pan
Zihan Huang
Pengtao Xie
OffRL
16
1
0
14 Oct 2024
Gradient-Free Neural Network Training on the Edge
Dotan Di Castro
O. Joglekar
Shir Kozlovsky
Vladimir Tchuiev
Michal Moshkovitz
MQ
14
0
0
13 Oct 2024
Overcoming Slow Decision Frequencies in Continuous Control: Model-Based Sequence Reinforcement Learning for Model-Free Control
Devdhar Patel
H. Siegelmann
OffRL
37
0
0
11 Oct 2024
Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient
Wenlong Wang
Ivana Dusparic
Yucheng Shi
Ke Zhang
V. Cahill
Mamba
131
0
0
11 Oct 2024
Cooperative and Asynchronous Transformer-based Mission Planning for Heterogeneous Teams of Mobile Robots
Milad Farjadnasab
Shahin Sirouspour
33
0
0
08 Oct 2024
Model-Based Reward Shaping for Adversarial Inverse Reinforcement Learning in Stochastic Environments
S. Zhan
Qingyuan Wu
Philip Wang
Yixuan Wang
Ruochen Jiao
Chao Huang
Qi Zhu
31
1
0
04 Oct 2024
GenSim2: Scaling Robot Data Generation with Multi-modal and Reasoning LLMs
Pu Hua
Minghuan Liu
Annabella Macaluso
Yunfeng Lin
Weinan Zhang
Huazhe Xu
Lirui Wang
LM&Ro
LRM
34
14
0
04 Oct 2024
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Yun Qu
Boyuan Wang
Yuhang Jiang
Jianzhun Shao
Yixiu Mao
Cheems Wang
Chang Liu
Xiangyang Ji
46
4
0
03 Oct 2024
SEAL: SEmantic-Augmented Imitation Learning via Language Model
Chengyang Gu
Yuxin Pan
Haotian Bai
Hui Xiong
Yize Chen
27
0
0
03 Oct 2024
Breaking the mold: The challenge of large scale MARL specialization
Stefan Juang
Hugh Cao
Arielle Zhou
Ruochen Liu
Nevin L. Zhang
Elvis Liu
16
1
0
03 Oct 2024
Realizable Continuous-Space Shields for Safe Reinforcement Learning
Kyungmin Kim
Davide Corsi
Andoni Rodríguez
JB Lanier
Benjami Parellada
Pierre Baldi
César Sánchez
Roy Fox
37
1
0
02 Oct 2024
Sampling from Energy-based Policies using Diffusion
V. Jain
Tara Akhound-Sadegh
Siamak Ravanbakhsh
DiffM
40
1
0
02 Oct 2024
Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse Training
Pihe Hu
Shaolong Li
Zhuoran Li
L. Pan
Longbo Huang
21
0
0
28 Sep 2024
Esports Training, Periodization, and Software -- a Scoping Review
A. Białecki
Bartłomiej Michalak
Jan Gajewski
13
2
0
27 Sep 2024
Learning to Drive via Asymmetric Self-Play
Chris Zhang
Sourav Biswas
Kelvin Wong
Kion Fallah
Lunjun Zhang
Dian Chen
Sergio Casas
R. Urtasun
44
0
0
26 Sep 2024
Autonomous Network Defence using Reinforcement Learning
Myles Foley
Chris Hicks
Kate Highnam
V. Mavroudis
AAML
19
29
0
26 Sep 2024
CLSP: High-Fidelity Contrastive Language-State Pre-training for Agent State Representation
Fuxian Huang
Qi Zhang
Shaopeng Zhai
Jie Wang
Tianyi Zhang
Haoran Zhang
Ming Zhou
Yu Liu
Yu Qiao
CLIP
AI4TS
34
0
0
24 Sep 2024
Can VLMs Play Action Role-Playing Games? Take Black Myth Wukong as a Study Case
Peng Chen
Pi Bu
Jun Song
Yuan Gao
Bo Zheng
LLMAG
27
10
0
19 Sep 2024
Synthesizing Evolving Symbolic Representations for Autonomous Systems
Gabriele Sartor
A. Oddi
R. Rasconi
V. Santucci
Rosa Meo
21
0
0
18 Sep 2024
Robust Reinforcement Learning with Dynamic Distortion Risk Measures
Anthony Coache
S. Jaimungal
25
1
0
16 Sep 2024
Applying Action Masking and Curriculum Learning Techniques to Improve Data Efficiency and Overall Performance in Operational Technology Cyber Security using Reinforcement Learning
Alec Wilson
William Holmes
Ryan Menzies
Kez Smithson Whitehead
25
0
0
13 Sep 2024
ELO-Rated Sequence Rewards: Advancing Reinforcement Learning Models
Qi Ju
Falin Hei
Zhemei Fang
Yunfeng Luo
27
0
0
05 Sep 2024
Learning to Move Like Professional Counter-Strike Players
David Durst
Feng Xie
Vishnu Sarukkai
Brennan Shacklett
I. Frosio
...
Carly Taylor
Gilbert Bernstein
Sanjiban Choudhury
Pat Hanrahan
Kayvon Fatahalian
31
0
0
25 Aug 2024
Localized Observation Abstraction Using Piecewise Linear Spatial Decay for Reinforcement Learning in Combat Simulations
Scotty Black
Christian J. Darken
16
0
0
23 Aug 2024
Growing Deep Neural Network Considering with Similarity between Neurons
Taigo Sakai
Kazuhiro Hotta
30
0
0
23 Aug 2024
Lifelong Reinforcement Learning via Neuromodulation
Sebastian Lee
Samuel Liebana Garcia
Claudia Clopath
Will Dabney
44
0
0
15 Aug 2024
Explaining an Agent's Future Beliefs through Temporally Decomposing Future Reward Estimators
Mark Towers
Yali Du
Christopher T. Freeman
Timothy J. Norman
29
0
0
15 Aug 2024
Previous
1
2
3
4
5
...
18
19
20
Next