SciFig: Towards Automating Scientific Figure Generation

7 January 2026

Siyuan Huang

Yutong Gao

Juyang Bai

Yifan Zhou

Zi Yin

Xinxin Liu

Rama Chellappa

Chun Pong Lau

Sayan Nag

Cheng Peng

Shraman Pramanick

ArXiv (abs)PDF HTML Github

Main:8 Pages

16 Figures

Bibliography:3 Pages

4 Tables

Appendix:10 Pages

Abstract

Creating high-quality figures and visualizations for scientific papers is a time-consuming task that requires both deep domain knowledge and professional design skills. Despite over 2.5 million scientific papers published annually, the figure generation process remains largely manual. We introduce $\textbf{SciFig}$ , an end-to-end AI agent system that generates publication-ready pipeline figures directly from research paper texts. SciFig uses a hierarchical layout generation strategy, which parses research descriptions to identify component relationships, groups related elements into functional modules, and generates inter-module connections to establish visual organization. Furthermore, an iterative chain-of-thought (CoT) feedback mechanism progressively improves layouts through multiple rounds of visual analysis and reasoning. We introduce a rubric-based evaluation framework that analyzes 2,219 real scientific figures to extract evaluation rubrics and automatically generates comprehensive evaluation criteria. SciFig demonstrates remarkable performance: achieving 70.1 $\%$ overall quality on dataset-level evaluation and 66.2 $\%$ on paper-specific evaluation, and consistently high scores across metrics such as visual clarity, structural organization, and scientific accuracy. SciFig figure generation pipeline and our evaluation benchmark will be open-sourced.

View on arXiv

Comments on this paper