T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation

Main:11 Pages
5 Figures
Bibliography:3 Pages
10 Tables
Appendix:5 Pages
Abstract
We propose T2I-ReasonBench, a benchmark evaluating reasoning capabilities of text-to-image (T2I) models. It consists of four dimensions: Idiom Interpretation, Textual Image Design, Entity-Reasoning and Scientific-Reasoning. We propose a two-stage evaluation protocol to assess the reasoning accuracy and image quality. We benchmark various T2I generation models, and provide comprehensive analysis on their performances.
View on arXivComments on this paper
