55
0

Textual-to-Visual Iterative Self-Verification for Slide Generation

Abstract

Generating presentation slides is a time-consuming task that urgently requires automation. Due to their limited flexibility and lack of automated refinement mechanisms, existing autonomous LLM-based agents face constraints in real-world applicability. We decompose the task of generating missing presentation slides into two key components: content generation and layout generation, aligning with the typical process of creating academic slides. First, we introduce a content generation approach that enhances coherence and relevance by incorporating context from surrounding slides and leveraging section retrieval strategies. For layout generation, we propose a textual-to-visual self-verification process using a LLM-based Reviewer + Refiner workflow, transforming complex textual layouts into intuitive visual formats. This modality transformation simplifies the task, enabling accurate and human-like review and refinement. Experiments show that our approach significantly outperforms baseline methods in terms of alignment, logical flow, visual appeal, and readability.

View on arXiv
@article{xu2025_2502.15412,
  title={ Textual-to-Visual Iterative Self-Verification for Slide Generation },
  author={ Yunqing Xu and Xinbei Ma and Jiyang Qiu and Hai Zhao },
  journal={arXiv preprint arXiv:2502.15412},
  year={ 2025 }
}
Comments on this paper