CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

26 June 2024

ArXiv (abs)PDF HTML HuggingFace (30 upvotes)Github (26894★)

Papers citing "CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs"

50 / 112 papers shown

VACoT: Rethinking Visual Data Augmentation with VLMs

117

02 Dec 2025

See, Hear, and Understand: Benchmarking Audiovisual Human Speech Understanding in Multimodal Large Language Models

...

232

01 Dec 2025

ChartPoint: Guiding MLLMs with Grounding Reflection for Chart Reasoning

322

29 Nov 2025

Qwen3-VL Technical Report

...

2.2K

446

26 Nov 2025

CodeV: Code with Images for Faithful Visual Reasoning via Tool-Aware Policy Optimization

188

24 Nov 2025

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Xingxuan Li

304

20 Nov 2025

Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data

...

729

16 Nov 2025

DeepEyesV2: Toward Agentic Multimodal ModelIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025

181

07 Nov 2025

V-Thinker: Interactive Thinking with Images

...

496

06 Nov 2025

NVIDIA Nemotron Nano V2 VL

Nvidia

Amala Sanjay Deshmukh

...

401

06 Nov 2025

ChartM

^3

: A Multi-Stage Code-Driven Pipeline for Constructing Multi-Dimensional and Multi-Step Visual Reasoning Data in Chart ComprehensionConference on Empirical Methods in Natural Language Processing (EMNLP), 2025

200

04 Nov 2025

The Ouroboros of Benchmarking: Reasoning Evaluation in an Era of Saturation

İbrahim Ethem Deveci

Duygu Ataman

ReLM ALM ELM LRM

274

03 Nov 2025

TIR-Bench: A Comprehensive Benchmark for Agentic Thinking-with-Images Reasoning

573

03 Nov 2025

ConnectomeBench: Can LLMs Proofread the Connectome?

Jeff Brown

Andrew Kirjner Annika Vivekananthan

Ed Boyden

MLLM

176

31 Oct 2025

ChartAB: A Benchmark for Chart Grounding & Dense Alignment

237

30 Oct 2025

A Survey of AI Scientists

458

27 Oct 2025

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

228

27 Oct 2025

A Coherence-Based Measure of AGI

Fares Fourati

164

23 Oct 2025

Structured and Abstractive Reasoning on Multi-modal Relational Knowledge Images

170

22 Oct 2025

UNO-Bench: A Unified Benchmark for Exploring the Compositional Law Between Uni-modal and Omni-modal in Omni Models

226

21 Oct 2025

Res-Bench: Benchmarking the Robustness of Multimodal Large Language Models to Dynamic Resolution Input

273

19 Oct 2025

MultiVerse: A Multi-Turn Conversation Benchmark for Evaluating Large Vision and Language Models

...

182

18 Oct 2025

Composition-Grounded Data Synthesis for Visual Reasoning

177

16 Oct 2025

RECODE: Reasoning Through Code Generation for Visual Question Answering

198

15 Oct 2025

Beyond Seeing: Evaluating Multimodal LLMs on Tool-Enabled Image Perception, Transformation, and Reasoning

Ernesto Gabriel Hernández Montoya

...

365

14 Oct 2025

A Survey on Agentic Multimodal Large Language Models

...

LM&Ro AIFin AI4TS LRM AI4CE

302

13 Oct 2025

Towards Efficient Multimodal Unified Reasoning Model via Model Merging

355

10 Oct 2025

ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping

202

09 Oct 2025

ARISE: An Adaptive Resolution-Aware Metric for Test-Time Scaling Evaluation in Large Reasoning Models

150

07 Oct 2025

Large Language Models Achieve Gold Medal Performance at the International Olympiad on Astronomy & Astrophysics (IOAA)

Lucas Carrit Delgado Pinheiro

173

06 Oct 2025

ContextNav: Towards Agentic Multimodal In-Context Learning

208

06 Oct 2025

Factuality Matters: When Image Generation and Editing Meet Structured Visuals

...

281

06 Oct 2025

RefineShot: Rethinking Cinematography Understanding with Foundational Skill Evaluation

200

02 Oct 2025

What MLLMs Learn about When they Learn about Multimodal Reasoning: Perception, Reasoning, or their Integration?

233

02 Oct 2025

PixelCraft: A Multi-Agent System for High-Fidelity Visual Reasoning on Structured Images

192

29 Sep 2025

AstroMMBench: A Benchmark for Evaluating Multimodal Large Language Models Capabilities in Astronomy

230

29 Sep 2025

LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training

...

444

100

28 Sep 2025

CoFFT: Chain of Foresight-Focus Thought for Visual Language Models

347

26 Sep 2025

Chimera: Diagnosing Shortcut Learning in Visual-Language Understanding

175

26 Sep 2025

CHURRO: Making History Readable with an Open-Weight Large Vision-Language Model for High-Accuracy, Low-Cost Historical Text Recognition

187

24 Sep 2025

OmniBridge: Unified Multimodal Understanding, Generation, and Retrieval via Latent Space Alignment

Teng Xiao

Zuchao Li

Lefei Zhang

302

23 Sep 2025

Losing the Plot: How VLM responses degrade on imperfect charts

P. W. Shin

Jack Sampson

Vijaykrishnan Narayanan

Andres Marquez

Mahantesh Halappanavar

136

22 Sep 2025

Visual Programmability: A Guide for Code-as-Thought in Chart Understanding

197

11 Sep 2025

A Comprehensive Survey on Trustworthiness in Reasoning with Large Language Models

240

04 Sep 2025

R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning

316

28 Aug 2025

Do MLLMs Really Understand the Charts?

222

27 Aug 2025

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

...

361

525

25 Aug 2025

DashboardQA: Benchmarking Multimodal Agents for Question Answering on Interactive Dashboards

Aaryaman Kartha

Ahmed Masry

Mohammed Saidul Islam

...

110

24 Aug 2025

XFinBench: Benchmarking LLMs in Complex Financial Problem Solving and ReasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

316

20 Aug 2025

Vision-G1: Towards General Vision Language Reasoning with Multi-Domain Data Curation

252

18 Aug 2025