Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1708.04782
Cited By
StarCraft II: A New Challenge for Reinforcement Learning
16 August 2017
Oriol Vinyals
T. Ewalds
Sergey Bartunov
Petko Georgiev
A. Vezhnevets
Michelle Yeo
Alireza Makhzani
Heinrich Küttler
J. Agapiou
Julian Schrittwieser
John Quan
Stephen Gaffney
Stig Petersen
Karen Simonyan
Tom Schaul
H. V. Hasselt
David Silver
Timothy Lillicrap
Kevin Calderone
Paul Keet
Anthony Brunasso
David Lawrence
Anders Ekermo
J. Repp
Rodney Tsing
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"StarCraft II: A New Challenge for Reinforcement Learning"
50 / 414 papers shown
Adaptive Command: Real-Time Policy Adjustment via Language Models in StarCraft II
Weiyu Ma
Dongyu Xu
Shu Lin
Haifeng Zhang
Jun Wang
LLMAG
376
2
0
24 Dec 2025
Switch-JustDance: Benchmarking Whole Body Motion Tracking Controllers Using a Commercial Console Game
J. Kim
Wontaek Kim
Yidan Lu
Jin Cheng
Fatemeh Zargarbashi
...
Zhiyang Dou
Nitish Sontakke
Donghoon Baek
Sehoon Ha
Tianyu Li
150
1
0
22 Nov 2025
IPR-1: Interactive Physical Reasoner
Mingyu Zhang
Lifeng Zhuo
Tianxi Tan
Guocan Xie
Xian Nie
...
Renjie Zhao
Zizhu He
Z. Wang
Jiting Cai
Yong-Lu Li
PINN
LRM
AI4CE
500
0
0
19 Nov 2025
HRM-Agent: Training a recurrent reasoning model in dynamic environments using reinforcement learning
Long H Dang
David Rawlinson
LRM
281
0
0
26 Oct 2025
Human-Allied Relational Reinforcement Learning
Fateme Golivand Darvishvand
Hikaru Shindo
Sahil Sidheekh
Kristian Kersting
S. Natarajan
OffRL
143
0
0
17 Oct 2025
Narrowing Action Choices with AI Improves Human Sequential Decisions
Eleni Straitouri
Stratis Tsirtsis
Ander Artola Velasco
Manuel Gomez Rodriguez
173
1
0
17 Oct 2025
RLRF: Competitive Search Agent Design via Reinforcement Learning from Ranker Feedback
Tommy Mordo
Sagie Dekel
Omer Madmon
Moshe Tennenholtz
Oren Kurland
178
1
0
05 Oct 2025
Orchestrating Human-AI Teams: The Manager Agent as a Unifying Research Challenge
Charlie Masters
Advaith Vellanki
J. Shangguan
Bart Kultys
Jonathan Gilmore
Alastair Moore
Stefano V. Albrecht
218
5
0
02 Oct 2025
On the Convergence of Policy Mirror Descent with Temporal Difference Evaluation
Jiacai Liu
Wenye Li
Ke Wei
219
1
0
23 Sep 2025
AI Methods for Permutation Circuit Synthesis Across Generic Topologies
Victor Villar
Juan Cruz-Benito
Ismael Faro
David Kremer
192
0
0
19 Sep 2025
Empowering LLMs with Parameterized Skills for Adversarial Long-Horizon Planning
Sijia Cui
Shuai Xu
Aiyao He
Yanna Wang
Bo Xu
LLMAG
311
2
0
16 Sep 2025
Imagined Autocurricula
Ahmet H. Güzel
Matthew Jackson
Jarek Liesen
Tim Rocktaschel
Jakob Foerster
Ilija Bogunovic
Jack Parker-Holder
311
2
0
11 Sep 2025
What-If Analysis of Large Language Models: Explore the Game World Using Proactive Thinking
Yuan Sui
Yanming Zhang
Yi Liao
Yu Gu
Guohua Tang
Zhongqian Sun
Wei Yang
Xu Cheng
LLMAG
457
0
0
05 Sep 2025
A Comprehensive Review of Multi-Agent Reinforcement Learning in Video Games
IEEE Transactions on Games (IEEE Trans. Games), 2025
Zhengyang Li
Qijin Ji
Xinghong Ling
Quan Liu
210
42
0
03 Sep 2025
Lattice Annotated Temporal (LAT) Logic for Non-Markovian Reasoning
Kaustuv Mukherji
Jaikrishna Manojkumar Patil
Dyuman Aditya
Paulo Shakarian
Devendra Parkar
Lahari Pokala
Clark Dorman
Gerardo Simari
LRM
202
0
0
03 Sep 2025
Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models
Yi Liao
Yu Gu
Yuan Sui
Zining Zhu
Yifan Lu
Guohua Tang
Zhongqian Sun
Wei Yang
OffRL
ReLM
LM&Ro
LRM
230
2
0
29 Aug 2025
Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data
Ahmet H. Güzel
Ilija Bogunovic
Jack Parker-Holder
OffRL
OnRL
261
0
0
17 Aug 2025
EvoCurr: Self-evolving Curriculum with Behavior Code Generation for Complex Decision-making
Yang Cheng
Zilai Wang
Weiyu Ma
Wenhui Zhu
Yue Deng
Jian Zhao
LRM
425
2
0
13 Aug 2025
ORVIT: Near-Optimal Online Distributionally Robust Reinforcement Learning
Debamita Ghosh
George Atia
Yue Wang
OffRL
OOD
440
3
0
05 Aug 2025
TacticCraft: Natural Language-Driven Tactical Adaptation for StarCraft II
Weiyu Ma
Jiwen Jiang
Haobo Fu
Haifeng Zhang
216
0
0
21 Jul 2025
Hierarchical Learning-Enhanced MPC for Safe Crowd Navigation with Heterogeneous Constraints
Huajian Liu
Yixuan Feng
W. Dong
Kunpeng Fan
Chao Wang
Yongzhuo Gao
388
1
0
11 Jun 2025
AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification
Geonwoo Cho
Jaemoon Lee
Jaegyun Im
Subi Lee
Jihwan Lee
Sundong Kim
398
0
0
06 Jun 2025
Leveraging Reward Models for Guiding Code Review Comment Generation
Oussama Ben Sghaier
Rosalia Tufano
Gabriele Bavota
Houari Sahraoui
222
2
0
04 Jun 2025
Strategy-Augmented Planning for Large Language Models via Opponent Exploitation
Shuai Xu
Sijia Cui
Longji Xu
Bo Xu
Qi Wang
RALM
627
1
0
13 May 2025
How to Adapt Control Barrier Functions? A Learning-Based Approach with Applications to a VTOL Quadplane
IEEE Conference on Decision and Control (CDC), 2025
Taekyung Kim
Randal W. Beard
Dimitra Panagou
538
0
0
03 Apr 2025
Integrating Human Knowledge Through Action Masking in Reinforcement Learning for Operations Research
Mirko Stappert
Bernhard Lutz
Niklas Goby
Dirk Neumann
OffRL
247
2
0
03 Apr 2025
Enabling Rapid Shared Human-AI Mental Model Alignment via the After-Action Review
Edward Gu
H. Siu
Melanie Platt
Isabelle Hurley
Jaime D. Peña
Rohan R. Paleja
238
2
0
25 Mar 2025
HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied Agents
International Conference on Learning Representations (ICLR), 2025
Tristan Tomilin
Meng Fang
Mykola Pechenizkiy
452
4
0
11 Mar 2025
DSGBench: A Diverse Strategic Game Benchmark for Evaluating LLM-based Agents in Complex Decision-Making Environments
Wenjie Tang
Yuan Zhou
Erqiang Xu
Keyan Cheng
Minne Li
Liquan Xiao
ELM
407
11
0
08 Mar 2025
Digital Player: Evaluating Large Language Models based Human-like Agent in Games
Jinqiao Wang
Kai Wang
Shaojie Lin
Runze Wu
Bihan Xu
...
Zhipeng Hu
Z. Fan
Le Li
Tangjie Lyu
Changjie Fan
LLMAG
ELM
AI4CE
410
3
0
28 Feb 2025
Physics-Aware Robotic Palletization with Online Masking Inference
IEEE International Conference on Robotics and Automation (ICRA), 2025
Tianqi Zhang
Zheng Wu
Yuxin Chen
Yixiao Wang
Boyuan Liang
Scott Moura
Masayoshi Tomizuka
Mingyu Ding
Weidong Zhan
OffRL
374
5
0
20 Feb 2025
Learning Variational Inequalities from Data: Fast Generalization Rates under Strong Monotonicity
Eric Zhao
Tatjana Chavdarova
Michael I. Jordan
362
1
0
20 Feb 2025
Reflection of Episodes: Learning to Play Game from Expert and Self Experiences
Xiaojie Xu
Zongyuan Li
Chang Lu
Runnan Qi
Yanan Ni
...
Yongchun Fang
Kuihua Huang
Xian Guo
Zhanghua Wu
Zhenya Li
272
1
0
19 Feb 2025
Hierarchical Expert Prompt for Large-Language-Model: An Approach Defeat Elite AI in TextStarCraft II for the First Time
Zongyuan Li
Chang Lu
Xiaojie Xu
Runnan Qi
Yanan Ni
...
Xiangbei Liu
Xinsong Zhang
Yongchun Fang
Kuihua Huang
Xian Guo
266
1
0
16 Feb 2025
Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation
Wenzhang Liu
Lianjun Jin
Lu Ren
Chaoxu Mu
Changyin Sun
CML
271
0
0
24 Jan 2025
Preference-Based Multi-Agent Reinforcement Learning: Data Coverage and Algorithmic Techniques
Natalia Zhang
X. Wang
Qiwen Cui
Runlong Zhou
Sham Kakade
Simon S. Du
OffRL
542
1
0
10 Jan 2025
CREW: Facilitating Human-AI Teaming Research
Lingyu Zhang
Zhengran Ji
Boyuan Chen
595
9
0
03 Jan 2025
Human-like Bots for Tactical Shooters Using Compute-Efficient Sensors
IEEE Transactions on Games (IEEE Trans. Games), 2024
Niels Justesen
Maria Kaselimi
Sam Snodgrass
Miruna Vozaru
Matthew Schlegel
...
Albert Wang
Christoffer Holmgård
Georgios N. Yannakakis
S. Risi
Julian Togelius
634
1
0
03 Jan 2025
Heterogeneous Multi-agent Zero-Shot Coordination by Coevolution
IEEE Transactions on Evolutionary Computation (TEVC), 2022
Ke Xue
Yutong Wang
Cong Guan
Lei Yuan
Haobo Fu
Qiang Fu
Chao Qian
Yang Yu
660
25
0
03 Jan 2025
GPT for Games: An Updated Scoping Review (2020-2024)
IEEE Transactions on Games (IEEE Trans. Games), 2024
Daijin Yang
Erica Kleinman
Casper Harteveld
LLMAG
AI4TS
AI4CE
626
17
0
01 Nov 2024
LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation
Neural Information Processing Systems (NeurIPS), 2024
Bowen Li
Zhaoyu Li
Qiwei Du
Jinqi Luo
Wenshan Wang
...
Katia Sycara
Pradeep Kumar Ravikumar
Alexander G. Gray
X. Si
Sebastian A. Scherer
AI4CE
LRM
552
16
0
01 Nov 2024
Entity-based Reinforcement Learning for Autonomous Cyber Defence
Isaac Symes Thompson
Alberto Caron
Chris Hicks
V. Mavroudis
AAML
689
9
0
23 Oct 2024
Improve Value Estimation of Q Function and Reshape Reward with Monte Carlo Tree Search
Jiamian Li
264
0
0
15 Oct 2024
Can we hop in general? A discussion of benchmark selection and design using the Hopper environment
C. Voelcker
Marcel Hussing
Eric Eaton
OffRL
389
7
0
11 Oct 2024
Carefully Structured Compression: Efficiently Managing StarCraft II Data
Bryce Ferenczi
Rhys Newbury
Michael G. Burke
Tom Drummond
230
1
0
11 Oct 2024
Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space
Yangming Li
Chieh-Hsin Lai
Carola-Bibiane Schönlieb
Yuki Mitsufuji
Stefano Ermon
DiffM
306
1
0
02 Oct 2024
Applying Action Masking and Curriculum Learning Techniques to Improve Data Efficiency and Overall Performance in Operational Technology Cyber Security using Reinforcement Learning
Alec Wilson
William Holmes
Ryan Menzies
Kez Smithson Whitehead
232
0
0
13 Sep 2024
BattleAgentBench: A Benchmark for Evaluating Cooperation and Competition Capabilities of Language Models in Multi-Agent Systems
Wei Wang
Dan Zhang
Tao Feng
Boyan Wang
Jie Tang
LLMAG
ELM
282
13
0
28 Aug 2024
Vanilla Gradient Descent for Oblique Decision Trees
European Conference on Artificial Intelligence (ECAI), 2024
Subrat Prasad Panda
B. Genest
Arvind Easwaran
Ponnuthurai Nagaratnam Suganthan
OffRL
356
3
0
17 Aug 2024
Trajectory Planning for Teleoperated Space Manipulators Using Deep Reinforcement Learning
Bo Xia
Xianru Tian
Bo Yuan
Zhiheng Li
Bin Liang
Xueqian Wang
241
0
0
10 Aug 2024
1
2
3
4
5
6
7
8
9
Next
Page 1 of 9