61
0

Game State and Spatio-temporal Action Detection in Soccer using Graph Neural Networks and 3D Convolutional Networks

Abstract

Soccer analytics rely on two data sources: the player positions on the pitch and the sequences of events they perform. With around 2000 ball events per game, their precise and exhaustive annotation based on a monocular video stream remains a tedious and costly manual task. While state-of-the-art spatio-temporal action detection methods show promise for automating this task, they lack contextual understanding of the game. Assuming professional players' behaviors are interdependent, we hypothesize that incorporating surrounding players' information such as positions, velocity and team membership can enhance purely visual predictions. We propose a spatio-temporal action detection approach that combines visual and game state information via Graph Neural Networks trained end-to-end with state-of-the-art 3D CNNs, demonstrating improved metrics through game state integration.

View on arXiv
@article{ochin2025_2502.15462,
  title={ Game State and Spatio-temporal Action Detection in Soccer using Graph Neural Networks and 3D Convolutional Networks },
  author={ Jeremie Ochin and Guillaume Devineau and Bogdan Stanciulescu and Sotiris Manitsaris },
  journal={arXiv preprint arXiv:2502.15462},
  year={ 2025 }
}
Comments on this paper