14

Complexity-based code embeddings

International Conference on Computational Collective Intelligence (ICCCI), 2026
Rares Folea
Radu Iacob
Emil Slusanschi
Traian Rebedea
Main:11 Pages
4 Figures
Bibliography:2 Pages
2 Tables
Abstract

This paper presents a generic method for transforming the source code of various algorithms to numerical embeddings, by dynamically analysing the behaviour of computer programs against different inputs and by tailoring multiple generic complexity functions for the analysed metrics. The used algorithms embeddings are based on r-Complexity . Using the proposed code embeddings, we present an implementation of the XGBoost algorithm that achieves an average F1-score on a multi-label dataset with 11 classes, built using real-world code snippets submitted for programming competitions on the Codeforces platform.

View on arXiv
Comments on this paper