ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.04400
  4. Cited By
What If We Only Use Real Datasets for Scene Text Recognition? Toward
  Scene Text Recognition With Fewer Labels
v1v2 (latest)

What If We Only Use Real Datasets for Scene Text Recognition? Toward Scene Text Recognition With Fewer Labels

Computer Vision and Pattern Recognition (CVPR), 2021
7 March 2021
Jeonghun Baek
Yusuke Matsui
Kiyoharu Aizawa
ArXiv (abs)PDFHTMLGithub (180★)

Papers citing "What If We Only Use Real Datasets for Scene Text Recognition? Toward Scene Text Recognition With Fewer Labels"

50 / 57 papers shown
LadderMoE: Ladder-Side Mixture of Experts Adapters for Bronze Inscription Recognition
LadderMoE: Ladder-Side Mixture of Experts Adapters for Bronze Inscription Recognition
Rixin Zhou
Peiqiang Qiu
Qian Zhang
Chuntao Li
Xi Yang
171
0
0
02 Oct 2025
GraDeT-HTR: A Resource-Efficient Bengali Handwritten Text Recognition System utilizing Grapheme-based Tokenizer and Decoder-only Transformer
GraDeT-HTR: A Resource-Efficient Bengali Handwritten Text Recognition System utilizing Grapheme-based Tokenizer and Decoder-only Transformer
Md. Mahmudul Hasan
Ahmed Nesar Tahsin Choudhury
Mahmudul Hasan
Md. Mosaddek Khan
159
1
0
22 Sep 2025
TEACH: Text Encoding as Curriculum Hints for Scene Text Recognition
TEACH: Text Encoding as Curriculum Hints for Scene Text Recognition
Xiahan Yang
Hui Zheng
VLM
143
1
0
02 Aug 2025
SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting
SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text SpottingComputer Vision and Pattern Recognition (CVPR), 2025
Dongliang Luo
Hanshen Zhu
Ziyang Zhang
Dingkang Liang
Xudong Xie
Yunxing Liu
Xiang Bai
VLM
381
2
0
14 Apr 2025
Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition
Linguistics-aware Masked Image Modeling for Self-supervised Scene Text RecognitionComputer Vision and Pattern Recognition (CVPR), 2025
Yifei Zhang
Yu Xie
Jin Wei
Xiaomeng Yang
Can Ma
Can Ma
Xiangyang Ji
372
10
0
24 Mar 2025
Disentanglement and Compositionality of Letter Identity and Letter
  Position in Variational Auto-Encoder Vision Models
Disentanglement and Compositionality of Letter Identity and Letter Position in Variational Auto-Encoder Vision Models
Bruno Bianchi
Aakash Agrawal
S. Dehaene
Emmanuel Chemla
Yair Lakretz
DRLCoGe
423
0
0
11 Dec 2024
TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition
TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition
Xingsong Ye
Yongkun Du
Yunbo Tao
Z. Chen
DiffM
519
7
0
02 Dec 2024
Boosting Semi-Supervised Scene Text Recognition via Viewing and
  Summarizing
Boosting Semi-Supervised Scene Text Recognition via Viewing and SummarizingNeural Information Processing Systems (NeurIPS), 2024
Yadong Qu
Yuxin Wang
Bangbang Zhou
Zihan Wang
Hongtao Xie
Yongdong Zhang
296
5
0
23 Nov 2024
Relational Contrastive Learning and Masked Image Modeling for Scene Text Recognition
T. Lin
Jinglei Zhang
Yi Xu
Kai Chen
Rui Zhang
Chong Chen
394
1
0
18 Nov 2024
Text Image Generation for Low-Resource Languages with Dual Translation
  Learning
Text Image Generation for Low-Resource Languages with Dual Translation Learning
Chihiro Noguchi
Shun Fukuda
Shoichiro Mihara
Masao Yamanaka
DiffM
259
0
0
26 Sep 2024
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text
  Recognizer
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text RecognizerACM Multimedia (MM), 2024
Humen Zhong
Zhibo Yang
Zhaohai Li
Peng Wang
Jun Tang
Wenqing Cheng
Cong Yao
294
5
0
18 Sep 2024
Rethinking HTG Evaluation: Bridging Generation and Recognition
Rethinking HTG Evaluation: Bridging Generation and Recognition
Konstantina Nikolaidou
George Retsinas
Giorgos Sfikas
Marcus Liwicki
251
7
0
04 Sep 2024
Decoder Pre-Training with only Text for Scene Text Recognition
Decoder Pre-Training with only Text for Scene Text RecognitionACM Multimedia (MM), 2024
Shuai Zhao
Yongkun Du
Zhineng Chen
Yu-Gang Jiang
192
6
0
11 Aug 2024
Self-Supervised Learning for Text Recognition: A Critical Survey
Self-Supervised Learning for Text Recognition: A Critical SurveyInternational Journal of Computer Vision (IJCV), 2024
Carlos Peñarrubia
J. J. Valero-Mas
Jorge Calvo-Zaragoza
534
9
0
29 Jul 2024
Visual Text Generation in the Wild
Visual Text Generation in the Wild
Yuanzhi Zhu
Jiawei Liu
Feiyu Gao
Wenyu Liu
Xinggang Wang
Peng Wang
Fei Huang
Cong Yao
Zhibo Yang
DiffM
279
18
0
19 Jul 2024
Focus on the Whole Character: Discriminative Character Modeling for
  Scene Text Recognition
Focus on the Whole Character: Discriminative Character Modeling for Scene Text Recognition
Bangbang Zhou
Yadong Qu
Zixiao Wang
Zicheng Li
Boqiang Zhang
Hongtao Xie
339
3
0
08 Jul 2024
HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive Permutation for Scene Text Recognition
HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive Permutation for Scene Text Recognition
Honghui Chen
Yuhang Qiu
Jiabao Wang
Pingping Chen
Nam Ling
249
0
0
15 May 2024
JSTR: Judgment Improves Scene Text Recognition
JSTR: Judgment Improves Scene Text Recognition
Masato Fujitake
264
2
0
09 Apr 2024
Global License Plate Dataset
Global License Plate Dataset
Siddharth Agrawal
202
1
0
22 Mar 2024
IndicSTR12: A Dataset for Indic Scene Text Recognition
IndicSTR12: A Dataset for Indic Scene Text Recognition
Harsh Lunia
Ajoy Mondal
C. V. Jawahar
209
4
0
12 Mar 2024
Sequential Visual and Semantic Consistency for Semi-supervised Text
  Recognition
Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition
Mingkun Yang
Biao Yang
Minghui Liao
Yingying Zhu
Xiang Bai
347
6
0
24 Feb 2024
An Empirical Study of Scaling Law for OCR
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
565
12
0
29 Dec 2023
Cross-Lingual Learning in Multilingual Scene Text Recognition
Cross-Lingual Learning in Multilingual Scene Text Recognition
Jeonghun Baek
Yusuke Matsui
Kiyoharu Aizawa
273
4
0
17 Dec 2023
Scene Text Image Super-resolution based on Text-conditional Diffusion
  Models
Scene Text Image Super-resolution based on Text-conditional Diffusion Models
Chihiro Noguchi
Shun Fukuda
Masao Yamanaka
DiffM
272
25
0
16 Nov 2023
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text
  Recognition
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text RecognitionACM Multimedia (ACM MM), 2023
Zixiao Wang
Hongtao Xie
Yuxin Wang
Jianjun Xu
Boqiang Zhang
Yongdong Zhang
351
30
0
08 Oct 2023
Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards
  Enhancing Text Spotting Performance
Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting PerformanceIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Alloy Das
Sanket Biswas
Ayan Banerjee
Josep Lladós
Umapada Pal
Saumik Bhattacharya
372
4
0
02 Oct 2023
SCOB: Universal Text Understanding via Character-wise Supervised
  Contrastive Learning with Online Text Rendering for Bridging Domain Gap
SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain GapIEEE International Conference on Computer Vision (ICCV), 2023
Daehee Kim
Yoon Kim
Donghyun Kim
Yumin Lim
Geewook Kim
Taeho Kil
429
4
0
21 Sep 2023
DTrOCR: Decoder-only Transformer for Optical Character Recognition
DTrOCR: Decoder-only Transformer for Optical Character RecognitionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Masato Fujitake
489
65
0
30 Aug 2023
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
LISTER: Neighbor Decoding for Length-Insensitive Scene Text RecognitionIEEE International Conference on Computer Vision (ICCV), 2023
Changxu Cheng
Peng Wang
Cheng Da
Qi Zheng
Cong Yao
277
24
0
24 Aug 2023
Self-distillation Regularized Connectionist Temporal Classification Loss
  for Text Recognition: A Simple Yet Effective Approach
Self-distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective ApproachAAAI Conference on Artificial Intelligence (AAAI), 2023
Ziyin Zhang
Ning Lu
Minghui Liao
Yongshuai Huang
Cheng Li
Min Wang
Wei Peng
446
20
0
17 Aug 2023
Relational Contrastive Learning for Scene Text Recognition
Relational Contrastive Learning for Scene Text RecognitionACM Multimedia (ACM MM), 2023
Jinglei Zhang
Tiancheng Lin
Yi Xu
Kaibo Chen
Rui Zhang
292
15
0
01 Aug 2023
Multi-Granularity Prediction with Learnable Fusion for Scene Text
  Recognition
Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Cheng Da
Peng Wang
Cong Yao
347
9
0
25 Jul 2023
Revisiting Scene Text Recognition: A Data Perspective
Revisiting Scene Text Recognition: A Data PerspectiveIEEE International Conference on Computer Vision (ICCV), 2023
Qing-Yuan Jiang
Jiapeng Wang
Dezhi Peng
Chongyu Liu
Lianwen Jin
460
67
0
17 Jul 2023
DiffusionSTR: Diffusion Model for Scene Text Recognition
DiffusionSTR: Diffusion Model for Scene Text RecognitionInternational Conference on Information Photonics (ICIP), 2023
Masato Fujitake
DiffM
171
8
0
29 Jun 2023
Conditional Text Image Generation with Diffusion Models
Conditional Text Image Generation with Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2023
Yuanzhi Zhu
Zhaohai Li
Tianwei Wang
Mengchao He
Cong Yao
VLMDiffM
450
89
0
19 Jun 2023
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained
  Vision-Language Model
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language ModelIEEE Transactions on Image Processing (IEEE TIP), 2023
Shuai Zhao
Xiaohan Wang
Linchao Zhu
Yezhou Yang
CLIPVLM
429
50
0
23 May 2023
Improving Scene Text Recognition for Character-Level Long-Tailed
  Distribution
Improving Scene Text Recognition for Character-Level Long-Tailed Distribution
S. Park
Sunghyo Chung
Jungsoo Lee
Jaegul Choo
158
3
0
31 Mar 2023
Diffusion in the Dark: A Diffusion Model for Low-Light Text Recognition
Diffusion in the Dark: A Diffusion Model for Low-Light Text RecognitionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Cindy M. Nguyen
E. R. Chan
Alexander W. Bergman
Gordon Wetzstein
DiffM
448
38
0
07 Mar 2023
CLIPTER: Looking at the Bigger Picture in Scene Text Recognition
CLIPTER: Looking at the Bigger Picture in Scene Text RecognitionIEEE International Conference on Computer Vision (ICCV), 2023
Aviad Aberdam
David Bensaid
Alona Golts
Roy Ganz
Oren Nuriel
Royee Tichauer
Shai Mazor
Ron Litman
VLMCLIP
374
29
0
18 Jan 2023
Indian Commercial Truck License Plate Detection and Recognition for
  Weighbridge Automation
Indian Commercial Truck License Plate Detection and Recognition for Weighbridge AutomationInternational Conference on Mechatronics and Machine Vision in Practice (M2VIP), 2022
Siddharth Agrawal
Keyur D. Joshi
213
5
0
23 Nov 2022
Pure Transformer with Integrated Experts for Scene Text Recognition
Pure Transformer with Integrated Experts for Scene Text RecognitionEuropean Conference on Computer Vision (ECCV), 2022
Yew Lee Tan
A. Kong
Jung-jae Kim
ViT
287
20
0
09 Nov 2022
Masked Vision-Language Transformers for Scene Text Recognition
Masked Vision-Language Transformers for Scene Text RecognitionBritish Machine Vision Conference (BMVC), 2022
Jie Wu
Ying Peng
Shenmin Zhang
Weigang Qi
Jian Zhang
295
5
0
09 Nov 2022
Seq-UPS: Sequential Uncertainty-aware Pseudo-label Selection for
  Semi-Supervised Text Recognition
Seq-UPS: Sequential Uncertainty-aware Pseudo-label Selection for Semi-Supervised Text RecognitionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Gaurav Patel
J. Allebach
Qiang Qiu
UQLM
344
22
0
31 Aug 2022
SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily
  Oriented Scene Text Recognition
SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily Oriented Scene Text RecognitionEuropean Conference on Computer Vision (ECCV), 2022
Dajian Zhong
Shujing Lyu
P. Shivakumara
Bing Yin
Jiajia Wu
Umapada Pal
Yue Lu
304
24
0
21 Jul 2022
Scene Text Recognition with Permuted Autoregressive Sequence Models
Scene Text Recognition with Permuted Autoregressive Sequence ModelsEuropean Conference on Computer Vision (ECCV), 2022
Darwin Bautista
Rowel Atienza
291
254
0
14 Jul 2022
COO: Comic Onomatopoeia Dataset for Recognizing Arbitrary or Truncated
  Texts
COO: Comic Onomatopoeia Dataset for Recognizing Arbitrary or Truncated TextsEuropean Conference on Computer Vision (ECCV), 2022
Jeonghun Baek
Yusuke Matsui
Kiyoharu Aizawa
276
19
0
11 Jul 2022
PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR
  System
PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System
Chenxia Li
Weiwei Liu
Ruoyu Guo
Xiaoyue Yin
Kaitao Jiang
...
Lingfeng Zhu
Baohua Lai
Xiaoguang Hu
Dianhai Yu
Yanjun Ma
469
192
0
07 Jun 2022
Multimodal Semi-Supervised Learning for Text Recognition
Multimodal Semi-Supervised Learning for Text Recognition
Aviad Aberdam
Roy Ganz
Shai Mazor
Ron Litman
VLM
291
22
0
08 May 2022
Pushing the Performance Limit of Scene Text Recognizer without Human
  Annotation
Pushing the Performance Limit of Scene Text Recognizer without Human AnnotationComputer Vision and Pattern Recognition (CVPR), 2022
Caiyuan Zheng
Hui Li
Seon-Min Rhee
Seungju Han
Jae-Joon Han
Peng Wang
270
14
0
16 Apr 2022
SimAN: Exploring Self-Supervised Representation Learning of Scene Text
  via Similarity-Aware Normalization
SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware NormalizationComputer Vision and Pattern Recognition (CVPR), 2022
Canjie Luo
Lianwen Jin
Jingdong Chen
SSLAI4TS
290
33
0
20 Mar 2022
12
Next
Page 1 of 2