FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with
Human Feedback

FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback

20 July 2023

Prateek R. Agarwal

Zixuan Huang

Sungchul Kim

Victor S. Bursztyn

Papers citing "FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback"

10 / 10 papers shown

Title
Every Part Matters: Integrity Verification of Scientific Figures Based on Multimodal Large Language Models Xiang Shi Jiawei Liu Yinpeng Liu Qikai Cheng Wei Lu 28 0 0 26 Jul 2024
Enhancing Scientific Figure Captioning Through Cross-modal Learning Mateo Alejandro Rojas Rafael Carranza 31 0 0 24 Jun 2024
SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation Jonathan Roberts Kai Han N. Houlsby Samuel Albanie 27 8 0 14 May 2024
SciCapenter: Supporting Caption Composition for Scientific Figures with Machine-Generated Captions and Ratings Ting-Yao Hsu Chieh-Yang Huang Shih-Hong Huang Ryan A. Rossi Sungchul Kim Tong Yu C. Lee Giles ‘Kenneth’ Huang 11 1 0 26 Mar 2024
From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models Kung-Hsiang Huang Hou Pong Chan Yi Ren Fung Haoyi Qiu Mingyang Zhou Shafiq R. Joty Shih-Fu Chang Heng Ji AI4TS 49 14 0 18 Mar 2024
AutomaTikZ: Text-Guided Synthesis of Scientific Vector Graphics with TikZ Jonas Belouadi Anne Lauscher Steffen Eger 6 11 0 30 Sep 2023
Training language models to follow instructions with human feedback Long Ouyang Jeff Wu Xu Jiang Diogo Almeida Carroll L. Wainwright ... Amanda Askell Peter Welinder Paul Christiano Jan Leike Ryan J. Lowe OSLM ALM 301 11,730 0 04 Mar 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation Junnan Li Dongxu Li Caiming Xiong S. Hoi MLLM BDL VLM CLIP 380 4,010 0 28 Jan 2022
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models Minghao Li Tengchao Lv Jingye Chen Lei Cui Yijuan Lu D. Florêncio Cha Zhang Zhoujun Li Furu Wei ViT 81 214 0 21 Sep 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning Matteo Stefanini Marcella Cornia Lorenzo Baraldi S. Cascianelli G. Fiameni Rita Cucchiara 3DV VLM MLLM 51 244 0 14 Jul 2021