ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.13580
  4. Cited By
AltChart: Enhancing VLM-based Chart Summarization Through Multi-Pretext
  Tasks

AltChart: Enhancing VLM-based Chart Summarization Through Multi-Pretext Tasks

22 May 2024
Omar Moured
Jiaming Zhang
M. Sarfraz
Rainer Stiefelhagen
ArXivPDFHTML

Papers citing "AltChart: Enhancing VLM-based Chart Summarization Through Multi-Pretext Tasks"

7 / 7 papers shown
Title
Chart4Blind: An Intelligent Interface for Chart Accessibility Conversion
Chart4Blind: An Intelligent Interface for Chart Accessibility Conversion
Omar Moured
Morris Baumgarten-Egemole
Alina Roitberg
Karin Muller
Thorsten Schwarz
Rainer Stiefelhagen
38
7
0
11 Mar 2024
Pix2Struct: Screenshot Parsing as Pretraining for Visual Language
  Understanding
Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding
Kenton Lee
Mandar Joshi
Iulia Turc
Hexiang Hu
Fangyu Liu
Julian Martin Eisenschlos
Urvashi Khandelwal
Peter Shaw
Ming-Wei Chang
Kristina Toutanova
CLIP
VLM
148
259
0
07 Oct 2022
A Dataset of Alt Texts from HCI Publications: Analyses and Uses Towards
  Producing More Descriptive Alt Texts of Data Visualizations in Scientific
  Papers
A Dataset of Alt Texts from HCI Publications: Analyses and Uses Towards Producing More Descriptive Alt Texts of Data Visualizations in Scientific Papers
S. Chintalapati
Jonathan Bragg
Lucy Lu Wang
30
20
0
27 Sep 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified
  Vision-Language Understanding and Generation
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
382
4,010
0
28 Jan 2022
Communicating Visualizations without Visuals: Investigation of
  Visualization Alternative Text for People with Visual Impairments
Communicating Visualizations without Visuals: Investigation of Visualization Alternative Text for People with Visual Impairments
C. Jung
Shubham Mehta
Atharva Kulkarni
Yuhang Zhao
Yea-Seul Kim
36
54
0
08 Aug 2021
Image-to-Image Translation with Conditional Adversarial Networks
Image-to-Image Translation with Conditional Adversarial Networks
Phillip Isola
Jun-Yan Zhu
Tinghui Zhou
Alexei A. Efros
SSeg
212
19,191
0
21 Nov 2016
U-Net: Convolutional Networks for Biomedical Image Segmentation
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg
3DV
229
74,467
0
18 May 2015
1