StarCraft II: A New Challenge for Reinforcement Learning

16 August 2017

David Silver

Papers citing "StarCraft II: A New Challenge for Reinforcement Learning"

50 / 414 papers shown

Adaptive Command: Real-Time Policy Adjustment via Language Models in StarCraft II

376

24 Dec 2025

Switch-JustDance: Benchmarking Whole Body Motion Tracking Controllers Using a Commercial Console Game

...

150

22 Nov 2025

IPR-1: Interactive Physical Reasoner

...

500

19 Nov 2025

HRM-Agent: Training a recurrent reasoning model in dynamic environments using reinforcement learning

Long H Dang

David Rawlinson

LRM

281

26 Oct 2025

Human-Allied Relational Reinforcement Learning

Fateme Golivand Darvishvand

143

17 Oct 2025

Narrowing Action Choices with AI Improves Human Sequential Decisions

Eleni Straitouri

Stratis Tsirtsis

Ander Artola Velasco

Manuel Gomez Rodriguez

173

17 Oct 2025

RLRF: Competitive Search Agent Design via Reinforcement Learning from Ranker Feedback

178

05 Oct 2025

Orchestrating Human-AI Teams: The Manager Agent as a Unifying Research Challenge

218

02 Oct 2025

On the Convergence of Policy Mirror Descent with Temporal Difference Evaluation

Jiacai Liu

Wenye Li

Ke Wei

219

23 Sep 2025

AI Methods for Permutation Circuit Synthesis Across Generic Topologies

192

19 Sep 2025

Empowering LLMs with Parameterized Skills for Adversarial Long-Horizon Planning

311

16 Sep 2025

Imagined Autocurricula

311

11 Sep 2025

What-If Analysis of Large Language Models: Explore the Game World Using Proactive Thinking

457

05 Sep 2025

A Comprehensive Review of Multi-Agent Reinforcement Learning in Video GamesIEEE Transactions on Games (IEEE Trans. Games), 2025

210

03 Sep 2025

Lattice Annotated Temporal (LAT) Logic for Non-Markovian Reasoning

Kaustuv Mukherji

Jaikrishna Manojkumar Patil

202

03 Sep 2025

Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models

230

29 Aug 2025

Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data

261

17 Aug 2025

EvoCurr: Self-evolving Curriculum with Behavior Code Generation for Complex Decision-making

425

13 Aug 2025

ORVIT: Near-Optimal Online Distributionally Robust Reinforcement Learning

440

05 Aug 2025

TacticCraft: Natural Language-Driven Tactical Adaptation for StarCraft II

216

21 Jul 2025

Hierarchical Learning-Enhanced MPC for Safe Crowd Navigation with Heterogeneous Constraints

388

11 Jun 2025

AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification

398

06 Jun 2025

Leveraging Reward Models for Guiding Code Review Comment Generation

222

04 Jun 2025

Strategy-Augmented Planning for Large Language Models via Opponent Exploitation

627

13 May 2025

How to Adapt Control Barrier Functions? A Learning-Based Approach with Applications to a VTOL QuadplaneIEEE Conference on Decision and Control (CDC), 2025

Taekyung Kim

Randal W. Beard

Dimitra Panagou

538

03 Apr 2025

Integrating Human Knowledge Through Action Masking in Reinforcement Learning for Operations Research

247

03 Apr 2025

Enabling Rapid Shared Human-AI Mental Model Alignment via the After-Action Review

238

25 Mar 2025

HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied AgentsInternational Conference on Learning Representations (ICLR), 2025

Tristan Tomilin

Meng Fang

Mykola Pechenizkiy

452

11 Mar 2025

DSGBench: A Diverse Strategic Game Benchmark for Evaluating LLM-based Agents in Complex Decision-Making Environments

407

08 Mar 2025

Digital Player: Evaluating Large Language Models based Human-like Agent in Games

...

410

28 Feb 2025

Physics-Aware Robotic Palletization with Online Masking InferenceIEEE International Conference on Robotics and Automation (ICRA), 2025

374

20 Feb 2025

Learning Variational Inequalities from Data: Fast Generalization Rates under Strong Monotonicity

Eric Zhao

Tatjana Chavdarova

Michael I. Jordan

362

20 Feb 2025

Reflection of Episodes: Learning to Play Game from Expert and Self Experiences

...

272

19 Feb 2025

Hierarchical Expert Prompt for Large-Language-Model: An Approach Defeat Elite AI in TextStarCraft II for the First Time

...

266

16 Feb 2025

Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation

271

24 Jan 2025

Preference-Based Multi-Agent Reinforcement Learning: Data Coverage and Algorithmic Techniques

542

10 Jan 2025

CREW: Facilitating Human-AI Teaming Research

Lingyu Zhang

Zhengran Ji

Boyuan Chen

595

03 Jan 2025

Human-like Bots for Tactical Shooters Using Compute-Efficient SensorsIEEE Transactions on Games (IEEE Trans. Games), 2024

...

Georgios N. Yannakakis

S. Risi

Julian Togelius

634

03 Jan 2025

Heterogeneous Multi-agent Zero-Shot Coordination by CoevolutionIEEE Transactions on Evolutionary Computation (TEVC), 2022

660

03 Jan 2025

GPT for Games: An Updated Scoping Review (2020-2024)IEEE Transactions on Games (IEEE Trans. Games), 2024

626

01 Nov 2024

LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban SimulationNeural Information Processing Systems (NeurIPS), 2024

...

Pradeep Kumar Ravikumar

552

01 Nov 2024

Entity-based Reinforcement Learning for Autonomous Cyber Defence

689

23 Oct 2024

Improve Value Estimation of Q Function and Reshape Reward with Monte Carlo Tree Search

Jiamian Li

264

15 Oct 2024

Can we hop in general? A discussion of benchmark selection and design using the Hopper environment

389

11 Oct 2024

Carefully Structured Compression: Efficiently Managing StarCraft II Data

230

11 Oct 2024

Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space

Yangming Li

Chieh-Hsin Lai

Carola-Bibiane Schönlieb

Yuki Mitsufuji

Stefano Ermon

DiffM

306

02 Oct 2024

Applying Action Masking and Curriculum Learning Techniques to Improve Data Efficiency and Overall Performance in Operational Technology Cyber Security using Reinforcement Learning

Alec Wilson

William Holmes

Ryan Menzies

Kez Smithson Whitehead

232

13 Sep 2024

BattleAgentBench: A Benchmark for Evaluating Cooperation and Competition Capabilities of Language Models in Multi-Agent Systems

Wei Wang

Dan Zhang

Tao Feng

Boyan Wang

Jie Tang

LLMAG ELM

282

28 Aug 2024

Vanilla Gradient Descent for Oblique Decision TreesEuropean Conference on Artificial Intelligence (ECAI), 2024

Subrat Prasad Panda

B. Genest

Arvind Easwaran

Ponnuthurai Nagaratnam Suganthan

OffRL

356

17 Aug 2024

Trajectory Planning for Teleoperated Space Manipulators Using Deep Reinforcement Learning

Zhiheng Li

241

10 Aug 2024