Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2208.01813
Cited By
TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation
3 August 2022
Jun Wang
M. Gao
Yuqian Hu
Ramprasaath R. Selvaraju
Chetan Ramaiah
Ran Xu
J. JáJá
Larry S. Davis
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation"
5 / 5 papers shown
Title
Prophet: Prompting Large Language Models with Complementary Answer Heuristics for Knowledge-based Visual Question Answering
Zhou Yu
Xuecheng Ouyang
Zhenwei Shao
Mei Wang
Jun Yu
MLLM
86
11
0
03 Mar 2023
Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering
Hao Li
Jinfa Huang
Peng Jin
Guoli Song
Qi Wu
Jie Chen
27
20
0
21 Sep 2022
ESSumm: Extractive Speech Summarization from Untranscribed Meeting
Jun Wang
20
7
0
14 Sep 2022
Unified Vision-Language Pre-Training for Image Captioning and VQA
Luowei Zhou
Hamid Palangi
Lei Zhang
Houdong Hu
Jason J. Corso
Jianfeng Gao
MLLM
VLM
250
922
0
24 Sep 2019
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images
Andreas Veit
Tomas Matera
Lukás Neumann
Jirí Matas
Serge J. Belongie
177
515
0
26 Jan 2016
1