ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.01603
  4. Cited By
Locate Then Generate: Bridging Vision and Language with Bounding Box for
  Scene-Text VQA

Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA

4 April 2023
Yongxin Zhu
Z. Liu
Yukang Liang
Xin Li
Hao Liu
Changcun Bao
Linli Xu
ArXivPDFHTML

Papers citing "Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA"

2 / 2 papers shown
Title
Scene-Text Grounding for Text-Based Video Question Answering
Scene-Text Grounding for Text-Based Video Question Answering
Sheng Zhou
Junbin Xiao
Xun Yang
Peipei Song
Dan Guo
Angela Yao
Meng Wang
Tat-Seng Chua
48
1
0
22 Sep 2024
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in
  Natural Images
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images
Andreas Veit
Tomas Matera
Lukás Neumann
Jirí Matas
Serge J. Belongie
172
515
0
26 Jan 2016
1