ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.09809
  4. Cited By
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image
  Classification and Retrieval

Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval

21 September 2020
Andrés Mafla
S. Dey
Ali Furkan Biten
Lluís Gómez
Dimosthenis Karatzas
ArXivPDFHTML

Papers citing "Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval"

10 / 10 papers shown
Title
Large Language Models for Page Stream Segmentation
Large Language Models for Page Stream Segmentation
H. Heidenreich
Ratish Dalvi
Rohith Mukku
Nikhil Verma
Neven Pičuljan
35
0
0
21 Aug 2024
Fine-Grained Scene Image Classification with Modality-Agnostic Adapter
Fine-Grained Scene Image Classification with Modality-Agnostic Adapter
Yiqun Wang
Zhao Zhou
Xiangcheng Du
Xingjiao Wu
Yingbin Zheng
Cheng Jin
34
0
0
03 Jul 2024
Out-of-Vocabulary Challenge Report
Out-of-Vocabulary Challenge Report
Sergi Garcia-Bordils
Andrés Mafla
Ali Furkan Biten
Oren Nuriel
Aviad Aberdam
Shai Mazor
Ron Litman
Dimosthenis Karatzas
9
16
0
14 Sep 2022
Towards Multimodal Vision-Language Models Generating Non-Generic Text
Towards Multimodal Vision-Language Models Generating Non-Generic Text
Wes Robbins
Zanyar Zohourianshahzadi
Jugal Kalita
9
1
0
09 Jul 2022
Knowledge Mining with Scene Text for Fine-Grained Recognition
Knowledge Mining with Scene Text for Fine-Grained Recognition
Hao Wang
Junchao Liao
Tianheng Cheng
Zewen Gao
Hao Liu
Bo Ren
X. Bai
Wenyu Liu
14
14
0
27 Mar 2022
OCR-IDL: OCR Annotations for Industry Document Library Dataset
OCR-IDL: OCR Annotations for Industry Document Library Dataset
Ali Furkan Biten
Rubèn Pérez Tito
Lluís Gómez
Ernest Valveny
Dimosthenis Karatzas
13
26
0
25 Feb 2022
Fine-Grained Image Analysis with Deep Learning: A Survey
Fine-Grained Image Analysis with Deep Learning: A Survey
Xiu-Shen Wei
Yi-Zhe Song
Oisin Mac Aodha
Jianxin Wu
Yuxin Peng
Jinhui Tang
Jian Yang
Serge J. Belongie
66
277
0
11 Nov 2021
StacMR: Scene-Text Aware Cross-Modal Retrieval
StacMR: Scene-Text Aware Cross-Modal Retrieval
Andrés Mafla
Rafael Sampaio de Rezende
Lluís Gómez
Diane Larlus
Dimosthenis Karatzas
3DV
26
14
0
08 Dec 2020
Multimodal Compact Bilinear Pooling for Visual Question Answering and
  Visual Grounding
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
144
1,458
0
06 Jun 2016
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in
  Natural Images
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images
Andreas Veit
Tomas Matera
Lukás Neumann
Jirí Matas
Serge J. Belongie
177
515
0
26 Jan 2016
1