Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.11624
Cited By
SciCap: Generating Captions for Scientific Figures
22 October 2021
Ting-Yao Hsu
C. Lee Giles
Ting-Hao 'Kenneth' Huang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SciCap: Generating Captions for Scientific Figures"
47 / 47 papers shown
Title
DomainCQA: Crafting Expert-Level QA from Domain-Specific Charts
Ling Zhong
Yujing Lu
Jing Yang
Weiming Li
Peng Wei
Yongheng Wang
Manni Duan
Qing Zhang
47
0
0
25 Mar 2025
RoboDesign1M: A Large-scale Dataset for Robot Design Understanding
T. H. Le
T. H. Nguyen
Quang-Dieu Tran
Quang Minh Nguyen
Baoru Huang
Hoan Nguyen
M. Vu
Tung D. Ta
A. Nguyen
3DV
81
0
0
09 Mar 2025
Chart-HQA: A Benchmark for Hypothetical Question Answering in Charts
Xiangnan Chen
Yuancheng Fang
Qian Xiao
Juncheng Billy Li
J. Lin
Siliang Tang
Yi Yang
Yueting Zhuang
70
0
0
06 Mar 2025
PatentLMM: Large Multimodal Model for Generating Descriptions for Patent Figures
S. Kamath S
Nakul Sharma
Manish Gupta
Anand Mishra
48
1
0
28 Jan 2025
ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation
Xuanle Zhao
Xianzhen Luo
Qi Shi
C. L. P. Chen
Shuo Wang
Wanxiang Che
Zhiyuan Liu
Maosong Sun
MLLM
54
2
0
11 Jan 2025
Multi-LLM Collaborative Caption Generation in Scientific Documents
Jaeyoung Kim
J. B. Lee
Hong-Jun Choi
Ting-Yao Hsu
Chieh-Yang Huang
...
Ryan Rossi
Tong Yu
C. Lee Giles
Ting-Hao 'Kenneth' Huang
S. Choi
21
2
0
05 Jan 2025
Natural Language Generation for Visualizations: State of the Art, Challenges and Future Directions
Enamul Hoque
Mohammed Saidul Islam
29
2
0
29 Sep 2024
SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement
Ishani Mondal
Zongxia Li
Yufang Hou
Anandhavelu Natarajan
Aparna Garimella
Jordan Boyd-Graber
31
3
0
28 Sep 2024
Every Part Matters: Integrity Verification of Scientific Figures Based on Multimodal Large Language Models
Xiang Shi
Jiawei Liu
Yinpeng Liu
Qikai Cheng
Wei Lu
39
0
0
26 Jul 2024
Datasets of Visualization for Machine Learning
Can Liu
Ruike Jiang
Shaocong Tan
Jiacheng Yu
Chaofan Yang
Hanning Shao
Xiaoru Yuan
XAI
29
0
0
23 Jul 2024
SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers
Shraman Pramanick
Rama Chellappa
Subhashini Venugopalan
48
13
0
12 Jul 2024
FlowLearn: Evaluating Large Vision-Language Models on Flowchart Understanding
Huitong Pan
Qi Zhang
Cornelia Caragea
Eduard Constantin Dragut
Longin Jan Latecki
33
4
0
06 Jul 2024
Figuring out Figures: Using Textual References to Caption Scientific Figures
Stanley Cao
Kevin Liu
34
0
0
25 Jun 2024
cPAPERS: A Dataset of Situated and Multimodal Interactive Conversations in Scientific Papers
Anirudh S. Sundar
Jin Xu
William Gay
Christopher Richardson
Larry Heck
49
0
0
12 Jun 2024
Faithful Chart Summarization with ChaTS-Pi
Syrine Krichene
Francesco Piccinno
Fangyu Liu
Julian Martin Eisenschlos
32
1
0
29 May 2024
SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
Jonathan Roberts
Kai Han
N. Houlsby
Samuel Albanie
40
12
0
14 May 2024
SciCapenter: Supporting Caption Composition for Scientific Figures with Machine-Generated Captions and Ratings
Ting-Yao Hsu
Chieh-Yang Huang
Shih-Hong Huang
Ryan A. Rossi
Sungchul Kim
Tong Yu
C. Lee Giles
‘Kenneth’ Huang
19
6
0
26 Mar 2024
RAAMove: A Corpus for Analyzing Moves in Research Article Abstracts
Hongzheng Li
Ruojin Wang
Ge Shi
Xing Lv
Lei Lei
Chong Feng
Fang Liu
Jinkun Lin
Yangguang Mei
Lingnan Xu
19
0
0
23 Mar 2024
From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models
Kung-Hsiang Huang
Hou Pong Chan
Yi Ren Fung
Haoyi Qiu
Mingyang Zhou
Shafiq R. Joty
Shih-Fu Chang
Heng Ji
AI4TS
64
14
0
18 Mar 2024
Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Lei Li
Yuqi Wang
Runxin Xu
Peiyi Wang
Xiachong Feng
Lingpeng Kong
Qi Liu
37
51
0
01 Mar 2024
InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions
Ryota Tanaka
Taichi Iki
Kyosuke Nishida
Kuniko Saito
Jun Suzuki
VLM
21
23
0
24 Jan 2024
ChartAssisstant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning
Fanqing Meng
Wenqi Shao
Quanfeng Lu
Peng Gao
Kaipeng Zhang
Yu Qiao
Ping Luo
27
45
0
04 Jan 2024
ChartBench: A Benchmark for Complex Visual Reasoning in Charts
Zhengzhuo Xu
Sinan Du
Yiyan Qi
Chengjin Xu
Chun Yuan
Jian Guo
35
34
0
26 Dec 2023
mPLUG-PaperOwl: Scientific Diagram Analysis with the Multimodal Large Language Model
Anwen Hu
Yaya Shi
Haiyang Xu
Jiabo Ye
Qinghao Ye
Mingshi Yan
Chenliang Li
Qi Qian
Ji Zhang
Fei Huang
MLLM
36
25
0
30 Nov 2023
MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning
Fuxiao Liu
Xiaoyang Wang
Wenlin Yao
Jianshu Chen
Kaiqiang Song
Sangwoo Cho
Yaser Yacoob
Dong Yu
24
99
0
15 Nov 2023
GPT-4 as an Effective Zero-Shot Evaluator for Scientific Figure Captions
Ting-Yao Hsu
Chieh-Yang Huang
Ryan A. Rossi
Sungchul Kim
C. Lee Giles
‘Kenneth’ Huang
21
12
0
23 Oct 2023
AutomaTikZ: Text-Guided Synthesis of Scientific Vector Graphics with TikZ
Jonas Belouadi
Anne Lauscher
Steffen Eger
21
27
0
30 Sep 2023
PatFig: Generating Short and Long Captions for Patent Figures
Dana Aubakirova
Kim Gerdes
Lufei Liu
17
9
0
15 Sep 2023
SciGraphQA: A Large-Scale Synthetic Multi-Turn Question-Answering Dataset for Scientific Graphs
Sheng R. Li
Nima Tajbakhsh
MLLM
11
48
0
07 Aug 2023
Tackling Hallucinations in Neural Chart Summarization
Saad Obaid ul Islam
Iza vSkrjanec
Ondrej Dusek
Vera Demberg
HILM
29
7
0
01 Aug 2023
FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback
Ashish Singh
Prateek R. Agarwal
Zixuan Huang
Arpita Singh
Tong Yu
Sungchul Kim
Victor S. Bursztyn
N. Vlassis
Ryan A. Rossi
28
6
0
20 Jul 2023
SCITUNE: Aligning Large Language Models with Scientific Multimodal Instructions
Sameera Horawalavithana
Sai Munikoti
Ian Stewart
Henry Kvinge
MLLM
19
20
0
03 Jul 2023
VisText: A Benchmark for Semantically Rich Chart Captioning
Benny J. Tang
Angie Boggust
Arvind Satyanarayan
20
76
0
28 Jun 2023
SciCap+: A Knowledge Augmented Dataset to Study the Challenges of Scientific Figure Captioning
Zhishen Yang
Raj Dabre
Hideki Tanaka
Naoaki Okazaki
18
18
0
06 Jun 2023
Enhanced Chart Understanding in Vision and Language Task via Cross-modal Pre-training on Plot Table Pairs
Mingyang Zhou
Yi Ren Fung
Long Chen
Christopher Thomas
Heng Ji
Shih-Fu Chang
15
11
0
29 May 2023
The ACL OCL Corpus: Advancing Open Science in Computational Linguistics
Shaurya Rohatgi
Yanxia Qin
Benjamin Aw
Niranjana Unnithan
MingSung Kan
LMTD
18
12
0
24 May 2023
The State of the Art in Creating Visualization Corpora for Automated Chart Analysis
C. L. P. Chen
Zhicheng Liu
16
12
0
23 May 2023
S-CLIP: Semi-supervised Vision-Language Learning using Few Specialist Captions
Sangwoo Mo
Minkyu Kim
Kyungmin Lee
Jinwoo Shin
VLM
CLIP
41
21
0
23 May 2023
ChartSumm: A Comprehensive Benchmark for Automatic Chart Summarization of Long and Short Summaries
Raian Rahman
Rizvi Hasan
Abdullah Al Farhad
Md Tahmid Rahman Laskar
Md. Hamjajul Ashmafee
A. Kamal
19
23
0
26 Apr 2023
Summaries as Captions: Generating Figure Captions for Scientific Documents with Automated Text Summarization
Huang Chieh-Yang
Ting-Yao Hsu
Ryan A. Rossi
A. Nenkova
Sungchul Kim
G. Chan
Eunyee Koh
C. Lee Giles
Ting-Hao 'Kenneth' Huang
14
16
0
23 Feb 2023
TaTa: A Multilingual Table-to-Text Dataset for African Languages
Sebastian Gehrmann
Sebastian Ruder
Vitaly Nikolaev
Jan A. Botha
Michael Chavinda
Ankur P. Parikh
Clara E. Rivera
LMTD
19
10
0
31 Oct 2022
OCR-VQGAN: Taming Text-within-Image Generation
Juan A. Rodriguez
David Vazquez
I. Laradji
M. Pedersoli
Pau Rodríguez López
30
18
0
19 Oct 2022
A Dataset of Alt Texts from HCI Publications: Analyses and Uses Towards Producing More Descriptive Alt Texts of Data Visualizations in Scientific Papers
S. Chintalapati
Jonathan Bragg
Lucy Lu Wang
38
20
0
27 Sep 2022
LineCap: Line Charts for Data Visualization Captioning Models
Anita Mahinpei
Zona Kostic
Christy Tanner
VLM
27
17
0
15 Jul 2022
Chart-to-Text: A Large-Scale Benchmark for Chart Summarization
Shankar Kanthara
Rixie Tiffany Ko Leong
Xiang Lin
Ahmed Masry
Megh Thakkar
Enamul Hoque
Shafiq R. Joty
14
135
0
12 Mar 2022
#PraCegoVer: A Large Dataset for Image Captioning in Portuguese
G. O. D. Santos
Esther Luna Colombini
Sandra Avila
31
10
0
21 Mar 2021
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
218
7,923
0
17 Aug 2015
1