Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1907.12861
Cited By
LEAF-QA: Locate, Encode & Attend for Figure Question Answering
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2019
30 July 2019
Ritwick Chaudhry
Sumit Shekhar
Utkarsh Gupta
Pranav Maneriker
Prann Bansal
Ajay Joshi
LMTD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"LEAF-QA: Locate, Encode & Attend for Figure Question Answering"
50 / 54 papers shown
PRISMM-Bench: A Benchmark of Peer-Review Grounded Multimodal Inconsistencies
Lukas Selch
Yufang Hou
Muhammad Jehanzeb Mirza
Sivan Doveh
James Glass
Rogerio Feris
Wei Lin
215
0
0
18 Oct 2025
Benchmark It Yourself (BIY): Preparing a Dataset and Benchmarking AI Models for Scatterplot-Related Tasks
João Palmeiro
Diogo Duarte
Rita Costa
P. Bizarro
120
0
0
07 Oct 2025
ChartGaze: Enhancing Chart Understanding in LVLMs with Eye-Tracking Guided Attention Refinement
Ali Salamatian
Amirhossein Abaskohi
Wan-Cyuan Fan
Mir Rayat Imtiaz Hossain
Leonid Sigal
Giuseppe Carenini
92
1
0
16 Sep 2025
BigCharts-R1: Enhanced Chart Reasoning with Visual Reinforcement Finetuning
Ahmed Masry
Abhay Puri
Masoud Hashemi
Juan A. Rodriguez
Megh Thakkar
...
David Vazquez
Enamul Hoque
Perouz Taslakian
Sai Rajeswar
Spandana Gella
AI4TS
128
5
0
13 Aug 2025
InfoCausalQA:Can Models Perform Non-explicit Causal Reasoning Based on Infographic?
Keummin Ka
J. S. Park
Jaehyun Jeon
Youngjae Yu
ReLM
CML
141
0
0
08 Aug 2025
WikiMixQA: A Multimodal Benchmark for Question Answering over Tables and Charts
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Negar Foroutan
Angelika Romanou
Matin Ansaripour
Julian Martin Eisenschlos
Karl Aberer
R. Lebret
253
2
0
18 Jun 2025
ChartReasoner: Code-Driven Modality Bridging for Long-Chain Reasoning in Chart Question Answering
Caijun Jia
Nan Xu
Jingxuan Wei
Qingli Wang
Lei Wang
Bihui Yu
Junnan Zhu
LRM
164
4
0
11 Jun 2025
ChartQA-X: Generating Explanations for Visual Chart Reasoning
Shamanthak Hegde
Pooyan Fazli
H. Seifi
355
0
0
17 Apr 2025
ChartQAPro: A More Diverse and Challenging Benchmark for Chart Question Answering
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Ahmed Masry
Mohammed Saidul Islam
Mahir Ahmed
Aayush Bajaj
Firoz Kabir
...
Mehrad Shahmohammadi
Megh Thakkar
Md. Rizwan Parvez
E. Hoque
Shafiq Joty
ELM
247
40
0
07 Apr 2025
DomainCQA: Crafting Knowledge-Intensive QA from Domain-Specific Charts
Ling Zhong
Yujing Lu
Jing Yang
Weiming Li
Peng Wei
Yongheng Wang
Manni Duan
Qing Zhang
478
2
0
25 Mar 2025
Patent Figure Classification using Large Vision-language Models
European Conference on Information Retrieval (ECIR), 2025
Sushil Awale
Eric Müller-Budack
Ralph Ewerth
210
1
0
22 Jan 2025
SBS Figures: Pre-training Figure QA from Stage-by-Stage Synthesized Images
Risa Shinoda
Kuniaki Saito
Shohei Tanaka
Tosho Hirasawa
Yoshitaka Ushiku
172
3
0
23 Dec 2024
Understanding Graphical Perception in Data Visualization through Zero-shot Prompting of Vision-Language Models
Grace Guo
Jenna Jiayi Kang
Raj Sanjay Shah
Hanspeter Pfister
Sashank Varma
VLM
221
7
0
31 Oct 2024
SimpsonsVQA: Enhancing Inquiry-Based Learning with a Tailored Dataset
Ngoc Dung Huynh
Mohamed Reda Bouadjenek
Sunil Aryal
Imran Razzak
Hakim Hacid
233
0
0
30 Oct 2024
RealCQA-V2 : Visual Premise Proving A Manual COT Dataset for Charts
Saleem Ahmed
Ranga Setlur
Venu Govindaraju
ReLM
LRM
236
0
0
29 Oct 2024
ChartKG: A Knowledge-Graph-Based Representation for Chart Images
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2024
Zhiguang Zhou
Haoxuan Wang
Zhengqing Zhao
Fengling Zheng
Yongheng Wang
Wei Chen
Yong Wang
287
4
0
13 Oct 2024
MAPWise: Evaluating Vision-Language Models for Advanced Map Queries
Srija Mukhopadhyay
Abhishek Rajgaria
Prerana Khatiwada
Vivek Gupta
Dan Roth
135
0
0
30 Aug 2024
GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models
Jonathan Roberts
Kai Han
Samuel Albanie
248
3
0
21 Aug 2024
Datasets of Visualization for Machine Learning
Can Liu
Ruike Jiang
Shaocong Tan
Jiacheng Yu
Chaofan Yang
Hanning Shao
Xiaoru Yuan
XAI
362
0
0
23 Jul 2024
Unraveling the Truth: Do LLMs really Understand Charts? A Deep Dive into Consistency and Robustness
Srija Mukhopadhyay
Adnan Qidwai
Aparna Garimella
Pritika Ramu
Vivek Gupta
Dan Roth
238
12
0
15 Jul 2024
SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers
Shraman Pramanick
Rama Chellappa
Subhashini Venugopalan
454
49
0
12 Jul 2024
mChartQA: A universal benchmark for multimodal Chart Question Answer based on Vision-Language Alignment and Reasoning
Jingxuan Wei
Nan Xu
Guiyong Chang
Yin Luo
Bihui Yu
Ruifeng Guo
209
8
0
02 Apr 2024
Synthesize Step-by-Step: Tools, Templates and LLMs as Data Generators for Reasoning-Based Chart VQA
Zhuowan Li
Bhavan A. Jasani
Peng Tang
Shabnam Ghadar
LRM
314
24
0
25 Mar 2024
From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2024
Kung-Hsiang Huang
Hou Pong Chan
Yi R. Fung
Haoyi Qiu
Mingyang Zhou
Shafiq Joty
Shih-Fu Chang
Chenhui Xu
AI4TS
461
55
0
18 Mar 2024
MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Fuxiao Liu
Xiaoyang Wang
Wenlin Yao
Jianshu Chen
Kaiqiang Song
Sangwoo Cho
Yaser Yacoob
Dong Yu
206
162
0
15 Nov 2023
From Image to Language: A Critical Analysis of Visual Question Answering (VQA) Approaches, Challenges, and Opportunities
Information Fusion (Inf. Fusion), 2023
Md Farhan Ishmam
Md Sakib Hossain Shovon
M. F. Mridha
Nilanjan Dey
397
71
0
01 Nov 2023
DCQA: Document-Level Chart Question Answering towards Complex Reasoning and Common-Sense Understanding
Anran Wu
Luwei Xiao
Xingjiao Wu
Shuwen Yang
Junjie Xu
Zisong Zhuang
Nian Xie
Cheng Jin
Xiaoling Wang
212
0
0
29 Oct 2023
DOMINO: A Dual-System for Multi-step Visual Language Reasoning
Peifang Wang
O. Yu. Golovneva
Armen Aghajanyan
Xiang Ren
Muhao Chen
Asli Celikyilmaz
Maryam Fazel-Zarandi
LRM
172
13
0
04 Oct 2023
Natural Language Dataset Generation Framework for Visualizations Powered by Large Language Models
International Conference on Human Factors in Computing Systems (CHI), 2023
Hyung-Kwon Ko
Hyeon Jeon
Gwanmo Park
Dae Hyun Kim
Nam Wook Kim
Juho Kim
Jinwook Seo
362
31
0
19 Sep 2023
Reviving Static Charts into Live Charts
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2023
Lu Ying
Yun Wang
Haotian Li
Shuguang Dou
Haidong Zhang
Xinyang Jiang
Huamin Qu
Yingcai Wu
227
21
0
06 Sep 2023
SciGraphQA: A Large-Scale Synthetic Multi-Turn Question-Answering Dataset for Scientific Graphs
Sheng Li
Nima Tajbakhsh
MLLM
207
68
0
07 Aug 2023
RealCQA: Scientific Chart Question Answering as a Test-bed for First-Order Logic
IEEE International Conference on Document Analysis and Recognition (ICDAR), 2023
Saleem Ahmed
Bhavin Jawade
Shubham Pandey
S. Setlur
Venugopal Govindaraju
154
7
0
03 Aug 2023
FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback
Ashish Singh
Ashutosh Singh
Prateek R. Agarwal
Zixuan Huang
Arpita Singh
...
Ryan Rossi
Puneet Mathur
Erik Learned-Miller
Franck Dernoncourt
Ryan Rossi
287
8
0
20 Jul 2023
Enhanced Chart Understanding in Vision and Language Task via Cross-modal Pre-training on Plot Table Pairs
Mingyang Zhou
Yi R. Fung
Long Chen
Christopher Thomas
Heng Ji
Shih-Fu Chang
234
17
0
29 May 2023
The State of the Art in Creating Visualization Corpora for Automated Chart Analysis
Chong Chen
Zhicheng Liu
233
15
0
23 May 2023
PDFVQA: A New Dataset for Real-World VQA on PDF Documents
Yihao Ding
Siwen Luo
Hyunsuk Chung
S. Han
400
25
0
13 Apr 2023
Reading and Reasoning over Chart Images for Evidence-based Automated Fact-Checking
Findings (Findings), 2023
Mubashara Akhtar
O. Cocarascu
Elena Simperl
297
35
0
27 Jan 2023
MapQA: A Dataset for Question Answering on Choropleth Maps
Shuaichen Chang
David Palzer
Jialin Li
Eric Fosler-Lussier
N. Xiao
178
67
0
15 Nov 2022
OpenCQA: Open-ended Question Answering with Charts
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Shankar Kantharaj
Do Xuan Long
Rixie Tiffany Ko Leong
J. Tan
Enamul Hoque
Shafiq Joty
159
68
0
12 Oct 2022
ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Yu-Chung Hsiao
Fedir Zubach
Maria Wang
Jindong Chen
Victor Carbune
Jason Lin
Maria Wang
Yun Zhu
Jindong Chen
RALM
945
46
0
16 Sep 2022
FETA: Towards Specializing Foundation Models for Expert Task Applications
Amit Alfassy
Assaf Arbelle
Oshri Halimi
Sivan Harary
Roei Herzig
...
Christoph Auer
Kate Saenko
Peter W. J. Staar
Rogerio Feris
Leonid Karlinsky
252
20
0
08 Sep 2022
Business Document Information Extraction: Towards Practical Benchmarks
Conference and Labs of the Evaluation Forum (CLEF), 2022
Matyás Skalický
Stepán Simsa
Michal Uřičář
Milan Šulc
181
12
0
20 Jun 2022
Chart Question Answering: State of the Art and Future Directions
Enamul Hoque
P. Kavehzadeh
Ahmed Masry
146
53
0
08 May 2022
ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical Reasoning
Findings (Findings), 2022
Ahmed Masry
Do Xuan Long
J. Tan
Shafiq Joty
Enamul Hoque
AIMat
415
1,126
0
19 Mar 2022
VisRecall: Quantifying Information Visualisation Recallability via Question Answering
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2021
Yao Wang
Chuhan Jiao
Mihai Bâce
Andreas Bulling
227
7
0
30 Dec 2021
3D Question Answering
Shuquan Ye
Dongdong Chen
Songfang Han
Jing Liao
ViT
255
57
0
15 Dec 2021
Classification-Regression for Chart Comprehension
Matan Levy
Rami Ben-Ari
Dani Lischinski
141
17
0
29 Nov 2021
ICDAR 2021 Competition on Document VisualQuestion Answering
IEEE International Conference on Document Analysis and Recognition (ICDAR), 2021
Rubèn Pérez Tito
Minesh Mathew
C. V. Jawahar
Ernest Valveny
Dimosthenis Karatzas
211
29
0
10 Nov 2021
Towards Natural Language Interfaces for Data Visualization: A Survey
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2021
Leixian Shen
Enya Shen
Yuyu Luo
Xiaocong Yang
Xuming Hu
Xiongshuai Zhang
Zhiwei Tai
Jianmin Wang
292
175
0
08 Sep 2021
TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text
Computer Vision and Pattern Recognition (CVPR), 2021
Amanpreet Singh
Guan Pang
Mandy Toh
Jing Huang
Wojciech Galuba
Tal Hassner
257
214
0
12 May 2021
1
2
Next