14

SciFig: Towards Automating Scientific Figure Generation

Siyuan Huang
Yutong Gao
Juyang Bai
Yifan Zhou
Zi Yin
Xinxin Liu
Rama Chellappa
Chun Pong Lau
Sayan Nag
Cheng Peng
Shraman Pramanick
Main:8 Pages
16 Figures
Bibliography:3 Pages
4 Tables
Appendix:10 Pages
Abstract

Creating high-quality figures and visualizations for scientific papers is a time-consuming task that requires both deep domain knowledge and professional design skills. Despite over 2.5 million scientific papers published annually, the figure generation process remains largely manual. We introduce SciFig\textbf{SciFig}, an end-to-end AI agent system that generates publication-ready pipeline figures directly from research paper texts. SciFig uses a hierarchical layout generation strategy, which parses research descriptions to identify component relationships, groups related elements into functional modules, and generates inter-module connections to establish visual organization. Furthermore, an iterative chain-of-thought (CoT) feedback mechanism progressively improves layouts through multiple rounds of visual analysis and reasoning. We introduce a rubric-based evaluation framework that analyzes 2,219 real scientific figures to extract evaluation rubrics and automatically generates comprehensive evaluation criteria. SciFig demonstrates remarkable performance: achieving 70.1%\% overall quality on dataset-level evaluation and 66.2%\% on paper-specific evaluation, and consistently high scores across metrics such as visual clarity, structural organization, and scientific accuracy. SciFig figure generation pipeline and our evaluation benchmark will be open-sourced.

View on arXiv
Comments on this paper