ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.24862
161
6
v1v2v3 (latest)

ViStoryBench: Comprehensive Benchmark Suite for Story Visualization

30 May 2025
Cailin Zhuang
Ailin Huang
Wei Cheng
J. Wu
Yaoqi Hu
Jiaqi Liao
Zhewei Huang
Hongyuan Wang
Xinyao Liao
Weiwei Cai
Hengyuan Xu
Xuanyang Zhang
Xianfang Zeng
Gang Yu
Fangqiu Yi
    CoGe
ArXiv (abs)PDFHTMLHuggingFace (31 upvotes)
Main:31 Pages
11 Figures
Bibliography:4 Pages
13 Tables
Abstract

Story visualization, which aims to generate a sequence of visually coherent images aligning with a given narrative and reference images, has seen significant progress with recent advancements in generative models. To further enhance the performance of story visualization frameworks in real-world scenarios, we introduce a comprehensive evaluation benchmark, ViStoryBench. We collect a diverse dataset encompassing various story types and artistic styles, ensuring models are evaluated across multiple dimensions such as different plots (e.g., comedy, horror) and visual aesthetics (e.g., anime, 3D renderings). ViStoryBench is carefully curated to balance narrative structures and visual elements, featuring stories with single and multiple protagonists to test models' ability to maintain character consistency. Additionally, it includes complex plots and intricate world-building to challenge models in generating accurate visuals. To ensure comprehensive comparisons, our benchmark incorporates a wide range of evaluation metrics assessing critical aspects. This structured and multifaceted framework enables researchers to thoroughly identify both the strengths and weaknesses of different models, fostering targeted improvements.

View on arXiv
Comments on this paper