Adaptive Semantic Token Communication for Transformer-based Edge Inference

23 May 2025

Abstract

This paper presents an adaptive framework for edge inference based on a dynamically configurable transformer-powered deep joint source channel coding (DJSCC) architecture. Motivated by a practical scenario where a resource constrained edge device engages in goal oriented semantic communication, such as selectively transmitting essential features for object detection to an edge server, our approach enables efficient task aware data transmission under varying bandwidth and channel conditions. To achieve this, input data is tokenized into compact high level semantic representations, refined by a transformer, and transmitted over noisy wireless channels. As part of the DJSCC pipeline, we employ a semantic token selection mechanism that adaptively compresses informative features into a user specified number of tokens per sample. These tokens are then further compressed through the JSCC module, enabling a flexible token communication strategy that adjusts both the number of transmitted tokens and their embedding dimensions. We incorporate a resource allocation algorithm based on Lyapunov stochastic optimization to enhance robustness under dynamic network conditions, effectively balancing compression efficiency and task performance. Experimental results demonstrate that our system consistently outperforms existing baselines, highlighting its potential as a strong foundation for AI native semantic communication in edge intelligence applications.

View on arXiv

@article{devoto2025_2505.17604,
  title={ Adaptive Semantic Token Communication for Transformer-based Edge Inference },
  author={ Alessio Devoto and Jary Pomponi and Mattia Merluzzi and Paolo Di Lorenzo and Simone Scardapane },
  journal={arXiv preprint arXiv:2505.17604},
  year={ 2025 }
}

Comments on this paper