ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1704.04613
  4. Cited By
Integrating Scene Text and Visual Appearance for Fine-Grained Image
  Classification
v1v2 (latest)

Integrating Scene Text and Visual Appearance for Fine-Grained Image Classification

15 April 2017
X. Bai
Mingkun Yang
Pengyuan Lyu
Yongchao Xu
Jiebo Luo
ArXiv (abs)PDFHTML

Papers citing "Integrating Scene Text and Visual Appearance for Fine-Grained Image Classification"

23 / 23 papers shown
MCFNet: A Multimodal Collaborative Fusion Network for Fine-Grained Semantic Classification
MCFNet: A Multimodal Collaborative Fusion Network for Fine-Grained Semantic Classification
Yang Qiao
Xiaoyu Zhong
Xiaofeng Gu
Zhiguo Yu
272
0
0
29 May 2025
Fine-Grained Scene Image Classification with Modality-Agnostic Adapter
Fine-Grained Scene Image Classification with Modality-Agnostic Adapter
Yiqun Wang
Zhao Zhou
Xiangcheng Du
Xingjiao Wu
Yingbin Zheng
Cheng Jin
319
1
0
03 Jul 2024
Sequential Visual and Semantic Consistency for Semi-supervised Text
  Recognition
Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition
Mingkun Yang
Biao Yang
Minghui Liao
Yingying Zhu
Xiang Bai
353
7
0
24 Feb 2024
ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval
ViSTA: Vision and Scene Text Aggregation for Cross-Modal RetrievalComputer Vision and Pattern Recognition (CVPR), 2022
Mengjun Cheng
Yipeng Sun
Long Wang
Xiongwei Zhu
Kun Yao
...
Guoli Song
Junyu Han
Jingtuo Liu
Errui Ding
Jingdong Wang
392
77
0
31 Mar 2022
Knowledge Mining with Scene Text for Fine-Grained Recognition
Knowledge Mining with Scene Text for Fine-Grained RecognitionComputer Vision and Pattern Recognition (CVPR), 2022
Hao Wang
Junchao Liao
Tianheng Cheng
Zewen Gao
Hao Liu
Bo Ren
X. Bai
Wenyu Liu
304
14
0
27 Mar 2022
Utilizing Resource-Rich Language Datasets for End-to-End Scene Text
  Recognition in Resource-Poor Languages
Utilizing Resource-Rich Language Datasets for End-to-End Scene Text Recognition in Resource-Poor Languages
Shota Orihashi
Yoshihiro Yamazaki
Naoki Makishima
Mana Ihori
Akihiko Takashima
Tomohiro Tanaka
Ryo Masumura
249
1
0
24 Nov 2021
Scene Text Retrieval via Joint Text Detection and Similarity Learning
Scene Text Retrieval via Joint Text Detection and Similarity LearningComputer Vision and Pattern Recognition (CVPR), 2021
Hao Wang
X. Bai
Mingkun Yang
Shenggao Zhu
Jing Wang
Wenyu Liu
3DV
167
43
0
04 Apr 2021
StacMR: Scene-Text Aware Cross-Modal Retrieval
StacMR: Scene-Text Aware Cross-Modal Retrieval
Andrés Mafla
Rafael Sampaio de Rezende
Lluís Gómez
Diane Larlus
Dimosthenis Karatzas
3DV
242
19
0
08 Dec 2020
Multi-label classification of promotions in digital leaflets using
  textual and visual information
Multi-label classification of promotions in digital leaflets using textual and visual information
R. Arroyo
David Jiménez-Cabello
Javier Martínez-Cebrián
193
3
0
07 Oct 2020
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image
  Classification and Retrieval
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and RetrievalIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020
Andrés Mafla
S. Dey
Ali Furkan Biten
Lluís Gómez
Dimosthenis Karatzas
267
31
0
21 Sep 2020
Label or Message: A Large-Scale Experimental Survey of Texts and Objects
  Co-Occurrence
Label or Message: A Large-Scale Experimental Survey of Texts and Objects Co-OccurrenceInternational Conference on Pattern Recognition (ICPR), 2020
Koki Takeshita
Juntaro Shioyama
S. Uchida
182
1
0
30 Jul 2020
ePillID Dataset: A Low-Shot Fine-Grained Benchmark for Pill
  Identification
ePillID Dataset: A Low-Shot Fine-Grained Benchmark for Pill Identification
Naoto Usuyama
N. Larios
Amanda K. Hall
Jessica Lundin
254
9
0
28 May 2020
RoadText-1K: Text Detection & Recognition Dataset for Driving Videos
RoadText-1K: Text Detection & Recognition Dataset for Driving Videos
S. Reddy
Minesh Mathew
Lluís Gómez
Marçal Rusiñol
Dimosthenis Karatzas
C. V. Jawahar
198
54
0
19 May 2020
Text Recognition in the Wild: A Survey
Text Recognition in the Wild: A Survey
Xiaoxue Chen
Lianwen Jin
Yuanzhi Zhu
Canjie Luo
Tianwei Wang
3DV
417
131
0
07 May 2020
Fine-grained Image Classification and Retrieval by Combining Visual and
  Locally Pooled Textual Features
Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual FeaturesIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020
Andrés Mafla
S. Dey
Ali Furkan Biten
Lluís Gómez
Dimosthenis Karatzas
190
28
0
14 Jan 2020
All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting
All You Need Is Boundary: Toward Arbitrary-Shaped Text SpottingAAAI Conference on Artificial Intelligence (AAAI), 2019
Hao Wang
Pu Lu
Hui Zhang
Mingkun Yang
X. Bai
Yongchao Xu
Mengchao He
Yongpan Wang
Wenyu Liu
356
144
0
21 Nov 2019
Integration of Text-maps in Convolutional Neural Networks for Region
  Detection among Different Textual Categories
Integration of Text-maps in Convolutional Neural Networks for Region Detection among Different Textual Categories
R. Arroyo
J. Tovar
Francisco J. Delgado
Emilio J. Almazán
Diego G. Serrador
Antonio Hurtado
113
1
0
26 May 2019
Beyond Visual Semantics: Exploring the Role of Scene Text in Image
  Understanding
Beyond Visual Semantics: Exploring the Role of Scene Text in Image UnderstandingPattern Recognition Letters (PR), 2019
Arka Ujjal Dey
Suman K. Ghosh
Ernest Valveny
Gaurav Harit
267
28
0
25 May 2019
A Holistic Representation Guided Attention Network for Scene Text
  Recognition
A Holistic Representation Guided Attention Network for Scene Text Recognition
L. Yang
Yuyang Deng
Peng Wang
Hui Li
Zhen Li
Yanning Zhang
466
37
0
02 Apr 2019
Single Shot Scene Text Retrieval
Single Shot Scene Text Retrieval
Lluís Gómez
Andrés Mafla
Marçal Rusiñol
Dimosthenis Karatzas
210
54
0
27 Aug 2018
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting
  Text with Arbitrary Shapes
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary ShapesIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2018
Pengyuan Lyu
Minghui Liao
Cong Yao
Wenhao Wu
X. Bai
616
639
0
06 Jul 2018
Multi-Oriented Scene Text Detection via Corner Localization and Region
  Segmentation
Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation
Pengyuan Lyu
Cong Yao
Wenhao Wu
Shuicheng Yan
X. Bai
278
339
0
25 Feb 2018
Arbitrary-Oriented Scene Text Detection via Rotation Proposals
Arbitrary-Oriented Scene Text Detection via Rotation Proposals
Jianqi Ma
Weiyuan Shao
Hao Ye
Li Wang
Hong Wang
Yingbin Zheng
Xiangyang Xue
579
1,292
0
03 Mar 2017
1
Page 1 of 1