12

ChartE3^{3}: A Comprehensive Benchmark for End-to-End Chart Editing

Shuo Li
Jiajun Sun
Zhekai Wang
Xiaoran Fan
Hui Li
Dingwen Yang
Zhiheng Xi
Yijun Wang
Zifei Shan
Tao Gui
Qi Zhang
Xuanjing Huang
Main:8 Pages
10 Figures
Bibliography:3 Pages
10 Tables
Appendix:11 Pages
Abstract

Charts are a fundamental visualization format for structured data analysis. Enabling end-to-end chart editing according to user intent is of great practical value, yet remains challenging due to the need for both fine-grained control and global structural consistency. Most existing approaches adopt pipeline-based designs, where natural language or code serves as an intermediate representation, limiting their ability to faithfully execute complex edits. We introduce ChartE3^{3}, an End-to-End Chart Editing benchmark that directly evaluates models without relying on intermediate natural language programs or code-level supervision. ChartE3^{3} focuses on two complementary editing dimensions: local editing, which involves fine-grained appearance changes such as font or color adjustments, and global editing, which requires holistic, data-centric transformations including data filtering and trend line addition. ChartE3^{3} contains over 1,200 high-quality samples constructed via a well-designed data pipeline with human curation. Each sample is provided as a triplet of a chart image, its underlying code, and a multimodal editing instruction, enabling evaluation from both objective and subjective perspectives. Extensive benchmarking of state-of-the-art multimodal large language models reveals substantial performance gaps, particularly on global editing tasks, highlighting critical limitations in current end-to-end chart editing capabilities.

View on arXiv
Comments on this paper