v1v2 (latest)

$\mathbf{S^2LM}$ : Towards Semantic Steganography via Large Language Models

7 November 2025

ArXiv (abs)PDF HTML Github (164★)

Main:8 Pages

25 Figures

Bibliography:3 Pages

16 Tables

Appendix:19 Pages

Abstract

Despite remarkable progress in steganography, embedding semantically rich, sentence-level information into carriers remains a challenging problem. In this work, we present a novel concept of Semantic Steganography, which aims to hide semantically meaningful and structured content, such as sentences or paragraphs, in cover media. Based on this concept, we present Sentence-to-Image Steganography as an instance that enables the hiding of arbitrary sentence-level messages within a cover image. To accomplish this feat, we propose S^2LM: Semantic Steganographic Language Model, which leverages large language models (LLMs) to embed high-level textual information into images. Unlike traditional bit-level approaches, S^2LM redesigns the entire pipeline, involving the LLM throughout the process to enable the hiding and recovery of arbitrary sentences. Furthermore, we establish a benchmark named Invisible Text (IVT), comprising a diverse set of sentence-level texts as secret messages to evaluate semantic steganography methods. Experimental results demonstrate that S^2LM effectively enables direct sentence recovery beyond bit-level steganography. The source code and IVT dataset will be released soon.

View on arXiv

Comments on this paper

S2LM\mathbf{S^2LM}S2LM: Towards Semantic Steganography via Large Language Models

$\mathbf{S^2LM}$ : Towards Semantic Steganography via Large Language Models