ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.05085
  4. Cited By
Rosetta: Large scale system for text detection and recognition in images

Rosetta: Large scale system for text detection and recognition in images

11 October 2019
Fedor Borisyuk
Albert Gordo
V. Sivakumar
ArXivPDFHTML

Papers citing "Rosetta: Large scale system for text detection and recognition in images"

22 / 22 papers shown
Title
An Empirical Study of Scaling Law for OCR
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
33
6
0
29 Dec 2023
Making the V in Text-VQA Matter
Making the V in Text-VQA Matter
Shamanthak Hegde
Soumya Jahagirdar
Shankar Gangisetty
CoGe
29
4
0
01 Aug 2023
Visual Question Answering: A Survey on Techniques and Common Trends in
  Recent Literature
Visual Question Answering: A Survey on Techniques and Common Trends in Recent Literature
Ana Claudia Akemi Matsuki de Faria
Felype de Castro Bastos
Jose Victor Nogueira Alves da Silva
Vitor Lopes Fabris
Valeska Uchôa
Décio Gonccalves de Aguiar Neto
C. F. G. Santos
30
22
0
18 May 2023
Weakly-supervised Fingerspelling Recognition in British Sign Language
  Videos
Weakly-supervised Fingerspelling Recognition in British Sign Language Videos
Prajwal K R
Hannah Bull
Liliane Momeni
Samuel Albanie
Gül Varol
Andrew Zisserman
21
14
0
16 Nov 2022
PromptCap: Prompt-Guided Task-Aware Image Captioning
PromptCap: Prompt-Guided Task-Aware Image Captioning
Yushi Hu
Hang Hua
Zhengyuan Yang
Weijia Shi
Noah A. Smith
Jiebo Luo
40
101
0
15 Nov 2022
DM$^2$S$^2$: Deep Multi-Modal Sequence Sets with Hierarchical Modality
  Attention
DM2^22S2^22: Deep Multi-Modal Sequence Sets with Hierarchical Modality Attention
Shunsuke Kitada
Yuki Iwazaki
Riku Togashi
Hitoshi Iyatomi
21
1
0
07 Sep 2022
SVTR: Scene Text Recognition with a Single Visual Model
SVTR: Scene Text Recognition with a Single Visual Model
Yongkun Du
Zhineng Chen
Caiyan Jia
Xiaoyue Yin
Tianlun Zheng
Chenxia Li
Yuning Du
Yu-Gang Jiang
11
170
0
30 Apr 2022
On the Cross-dataset Generalization in License Plate Recognition
On the Cross-dataset Generalization in License Plate Recognition
Rayson Laroca
Everton VIlhena Cardoso
D. Lucio
Valter Estevam
David Menotti
19
42
0
02 Jan 2022
MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and
  Unpaired Text-based Image Captioning
MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-based Image Captioning
Wenqiao Zhang
Haochen Shi
Jiannan Guo
Shengyu Zhang
Qingpeng Cai
Juncheng Li
Sihui Luo
Yueting Zhuang
DiffM
19
46
0
13 Dec 2021
Utilizing Resource-Rich Language Datasets for End-to-End Scene Text
  Recognition in Resource-Poor Languages
Utilizing Resource-Rich Language Datasets for End-to-End Scene Text Recognition in Resource-Poor Languages
Shota Orihashi
Yoshihiro Yamazaki
Naoki Makishima
Mana Ihori
Akihiko Takashima
Tomohiro Tanaka
Ryo Masumura
25
1
0
24 Nov 2021
Oracle Teacher: Leveraging Target Information for Better Knowledge
  Distillation of CTC Models
Oracle Teacher: Leveraging Target Information for Better Knowledge Distillation of CTC Models
J. Yoon
H. Kim
Hyeon Seung Lee
Sunghwan Ahn
N. Kim
28
1
0
05 Nov 2021
Localize, Group, and Select: Boosting Text-VQA by Scene Text Modeling
Localize, Group, and Select: Boosting Text-VQA by Scene Text Modeling
Xiaopeng Lu
Zhenhua Fan
Yansen Wang
Jean Oh
Carolyn Rose
21
27
0
20 Aug 2021
Data Augmentation for Scene Text Recognition
Data Augmentation for Scene Text Recognition
Rowel Atienza
16
19
0
16 Aug 2021
Vision Transformer for Fast and Efficient Scene Text Recognition
Vision Transformer for Fast and Efficient Scene Text Recognition
Rowel Atienza
ViT
11
144
0
18 May 2021
Sewer-ML: A Multi-Label Sewer Defect Classification Dataset and
  Benchmark
Sewer-ML: A Multi-Label Sewer Defect Classification Dataset and Benchmark
Joakim Bruslund Haurum
T. Moeslund
18
60
0
19 Mar 2021
Revisiting Classification Perspective on Scene Text Recognition
Revisiting Classification Perspective on Scene Text Recognition
Hongxiang Cai
Jun Sun
Yichao Xiong
16
10
0
22 Feb 2021
Video Big Data Analytics in the Cloud: A Reference Architecture, Survey,
  Opportunities, and Open Research Issues
Video Big Data Analytics in the Cloud: A Reference Architecture, Survey, Opportunities, and Open Research Issues
A. Alam
I. Ullah
Young-Koo Lee
34
22
0
16 Nov 2020
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image
  Classification and Retrieval
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
Andrés Mafla
S. Dey
Ali Furkan Biten
Lluís Gómez
Dimosthenis Karatzas
19
25
0
21 Sep 2020
SCATTER: Selective Context Attentional Scene Text Recognizer
SCATTER: Selective Context Attentional Scene Text Recognizer
Ron Litman
Oron Anschel
Shahar Tsiper
R. Litman
Shai Mazor
R. Manmatha
16
132
0
25 Mar 2020
TextCaps: a Dataset for Image Captioning with Reading Comprehension
TextCaps: a Dataset for Image Captioning with Reading Comprehension
Oleksii Sidorov
Ronghang Hu
Marcus Rohrbach
Amanpreet Singh
20
386
0
24 Mar 2020
Scene Text Recognition with Sliding Convolutional Character Models
Scene Text Recognition with Sliding Convolutional Character Models
Fei Yin
Yi-Chao Wu
Xu-Yao Zhang
Cheng-Lin Liu
VLM
3DV
56
77
0
06 Sep 2017
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in
  Natural Images
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images
Andreas Veit
Tomas Matera
Lukás Neumann
Jirí Matas
Serge J. Belongie
185
515
0
26 Jan 2016
1