Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2103.04400
Cited By
v1
v2 (latest)
What If We Only Use Real Datasets for Scene Text Recognition? Toward Scene Text Recognition With Fewer Labels
Computer Vision and Pattern Recognition (CVPR), 2021
7 March 2021
Jeonghun Baek
Yusuke Matsui
Kiyoharu Aizawa
Re-assign community
ArXiv (abs)
PDF
HTML
Github (180★)
Papers citing
"What If We Only Use Real Datasets for Scene Text Recognition? Toward Scene Text Recognition With Fewer Labels"
50 / 57 papers shown
LadderMoE: Ladder-Side Mixture of Experts Adapters for Bronze Inscription Recognition
Rixin Zhou
Peiqiang Qiu
Qian Zhang
Chuntao Li
Xi Yang
171
0
0
02 Oct 2025
GraDeT-HTR: A Resource-Efficient Bengali Handwritten Text Recognition System utilizing Grapheme-based Tokenizer and Decoder-only Transformer
Md. Mahmudul Hasan
Ahmed Nesar Tahsin Choudhury
Mahmudul Hasan
Md. Mosaddek Khan
159
1
0
22 Sep 2025
TEACH: Text Encoding as Curriculum Hints for Scene Text Recognition
Xiahan Yang
Hui Zheng
VLM
143
1
0
02 Aug 2025
SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting
Computer Vision and Pattern Recognition (CVPR), 2025
Dongliang Luo
Hanshen Zhu
Ziyang Zhang
Dingkang Liang
Xudong Xie
Yunxing Liu
Xiang Bai
VLM
381
2
0
14 Apr 2025
Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition
Computer Vision and Pattern Recognition (CVPR), 2025
Yifei Zhang
Yu Xie
Jin Wei
Xiaomeng Yang
Can Ma
Can Ma
Xiangyang Ji
372
10
0
24 Mar 2025
Disentanglement and Compositionality of Letter Identity and Letter Position in Variational Auto-Encoder Vision Models
Bruno Bianchi
Aakash Agrawal
S. Dehaene
Emmanuel Chemla
Yair Lakretz
DRL
CoGe
423
0
0
11 Dec 2024
TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition
Xingsong Ye
Yongkun Du
Yunbo Tao
Z. Chen
DiffM
519
7
0
02 Dec 2024
Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing
Neural Information Processing Systems (NeurIPS), 2024
Yadong Qu
Yuxin Wang
Bangbang Zhou
Zihan Wang
Hongtao Xie
Yongdong Zhang
296
5
0
23 Nov 2024
Relational Contrastive Learning and Masked Image Modeling for Scene Text Recognition
T. Lin
Jinglei Zhang
Yi Xu
Kai Chen
Rui Zhang
Chong Chen
394
1
0
18 Nov 2024
Text Image Generation for Low-Resource Languages with Dual Translation Learning
Chihiro Noguchi
Shun Fukuda
Shoichiro Mihara
Masao Yamanaka
DiffM
259
0
0
26 Sep 2024
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer
ACM Multimedia (MM), 2024
Humen Zhong
Zhibo Yang
Zhaohai Li
Peng Wang
Jun Tang
Wenqing Cheng
Cong Yao
294
5
0
18 Sep 2024
Rethinking HTG Evaluation: Bridging Generation and Recognition
Konstantina Nikolaidou
George Retsinas
Giorgos Sfikas
Marcus Liwicki
251
7
0
04 Sep 2024
Decoder Pre-Training with only Text for Scene Text Recognition
ACM Multimedia (MM), 2024
Shuai Zhao
Yongkun Du
Zhineng Chen
Yu-Gang Jiang
192
6
0
11 Aug 2024
Self-Supervised Learning for Text Recognition: A Critical Survey
International Journal of Computer Vision (IJCV), 2024
Carlos Peñarrubia
J. J. Valero-Mas
Jorge Calvo-Zaragoza
534
9
0
29 Jul 2024
Visual Text Generation in the Wild
Yuanzhi Zhu
Jiawei Liu
Feiyu Gao
Wenyu Liu
Xinggang Wang
Peng Wang
Fei Huang
Cong Yao
Zhibo Yang
DiffM
279
18
0
19 Jul 2024
Focus on the Whole Character: Discriminative Character Modeling for Scene Text Recognition
Bangbang Zhou
Yadong Qu
Zixiao Wang
Zicheng Li
Boqiang Zhang
Hongtao Xie
339
3
0
08 Jul 2024
HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive Permutation for Scene Text Recognition
Honghui Chen
Yuhang Qiu
Jiabao Wang
Pingping Chen
Nam Ling
249
0
0
15 May 2024
JSTR: Judgment Improves Scene Text Recognition
Masato Fujitake
264
2
0
09 Apr 2024
Global License Plate Dataset
Siddharth Agrawal
202
1
0
22 Mar 2024
IndicSTR12: A Dataset for Indic Scene Text Recognition
Harsh Lunia
Ajoy Mondal
C. V. Jawahar
209
4
0
12 Mar 2024
Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition
Mingkun Yang
Biao Yang
Minghui Liao
Yingying Zhu
Xiang Bai
347
6
0
24 Feb 2024
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
565
12
0
29 Dec 2023
Cross-Lingual Learning in Multilingual Scene Text Recognition
Jeonghun Baek
Yusuke Matsui
Kiyoharu Aizawa
273
4
0
17 Dec 2023
Scene Text Image Super-resolution based on Text-conditional Diffusion Models
Chihiro Noguchi
Shun Fukuda
Masao Yamanaka
DiffM
272
25
0
16 Nov 2023
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text Recognition
ACM Multimedia (ACM MM), 2023
Zixiao Wang
Hongtao Xie
Yuxin Wang
Jianjun Xu
Boqiang Zhang
Yongdong Zhang
351
30
0
08 Oct 2023
Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting Performance
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Alloy Das
Sanket Biswas
Ayan Banerjee
Josep Lladós
Umapada Pal
Saumik Bhattacharya
372
4
0
02 Oct 2023
SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap
IEEE International Conference on Computer Vision (ICCV), 2023
Daehee Kim
Yoon Kim
Donghyun Kim
Yumin Lim
Geewook Kim
Taeho Kil
429
4
0
21 Sep 2023
DTrOCR: Decoder-only Transformer for Optical Character Recognition
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Masato Fujitake
489
65
0
30 Aug 2023
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
IEEE International Conference on Computer Vision (ICCV), 2023
Changxu Cheng
Peng Wang
Cheng Da
Qi Zheng
Cong Yao
277
24
0
24 Aug 2023
Self-distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach
AAAI Conference on Artificial Intelligence (AAAI), 2023
Ziyin Zhang
Ning Lu
Minghui Liao
Yongshuai Huang
Cheng Li
Min Wang
Wei Peng
446
20
0
17 Aug 2023
Relational Contrastive Learning for Scene Text Recognition
ACM Multimedia (ACM MM), 2023
Jinglei Zhang
Tiancheng Lin
Yi Xu
Kaibo Chen
Rui Zhang
292
15
0
01 Aug 2023
Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Cheng Da
Peng Wang
Cong Yao
347
9
0
25 Jul 2023
Revisiting Scene Text Recognition: A Data Perspective
IEEE International Conference on Computer Vision (ICCV), 2023
Qing-Yuan Jiang
Jiapeng Wang
Dezhi Peng
Chongyu Liu
Lianwen Jin
460
67
0
17 Jul 2023
DiffusionSTR: Diffusion Model for Scene Text Recognition
International Conference on Information Photonics (ICIP), 2023
Masato Fujitake
DiffM
171
8
0
29 Jun 2023
Conditional Text Image Generation with Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2023
Yuanzhi Zhu
Zhaohai Li
Tianwei Wang
Mengchao He
Cong Yao
VLM
DiffM
450
89
0
19 Jun 2023
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
IEEE Transactions on Image Processing (IEEE TIP), 2023
Shuai Zhao
Xiaohan Wang
Linchao Zhu
Yezhou Yang
CLIP
VLM
429
50
0
23 May 2023
Improving Scene Text Recognition for Character-Level Long-Tailed Distribution
S. Park
Sunghyo Chung
Jungsoo Lee
Jaegul Choo
158
3
0
31 Mar 2023
Diffusion in the Dark: A Diffusion Model for Low-Light Text Recognition
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Cindy M. Nguyen
E. R. Chan
Alexander W. Bergman
Gordon Wetzstein
DiffM
448
38
0
07 Mar 2023
CLIPTER: Looking at the Bigger Picture in Scene Text Recognition
IEEE International Conference on Computer Vision (ICCV), 2023
Aviad Aberdam
David Bensaid
Alona Golts
Roy Ganz
Oren Nuriel
Royee Tichauer
Shai Mazor
Ron Litman
VLM
CLIP
374
29
0
18 Jan 2023
Indian Commercial Truck License Plate Detection and Recognition for Weighbridge Automation
International Conference on Mechatronics and Machine Vision in Practice (M2VIP), 2022
Siddharth Agrawal
Keyur D. Joshi
213
5
0
23 Nov 2022
Pure Transformer with Integrated Experts for Scene Text Recognition
European Conference on Computer Vision (ECCV), 2022
Yew Lee Tan
A. Kong
Jung-jae Kim
ViT
287
20
0
09 Nov 2022
Masked Vision-Language Transformers for Scene Text Recognition
British Machine Vision Conference (BMVC), 2022
Jie Wu
Ying Peng
Shenmin Zhang
Weigang Qi
Jian Zhang
295
5
0
09 Nov 2022
Seq-UPS: Sequential Uncertainty-aware Pseudo-label Selection for Semi-Supervised Text Recognition
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Gaurav Patel
J. Allebach
Qiang Qiu
UQLM
344
22
0
31 Aug 2022
SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily Oriented Scene Text Recognition
European Conference on Computer Vision (ECCV), 2022
Dajian Zhong
Shujing Lyu
P. Shivakumara
Bing Yin
Jiajia Wu
Umapada Pal
Yue Lu
304
24
0
21 Jul 2022
Scene Text Recognition with Permuted Autoregressive Sequence Models
European Conference on Computer Vision (ECCV), 2022
Darwin Bautista
Rowel Atienza
291
254
0
14 Jul 2022
COO: Comic Onomatopoeia Dataset for Recognizing Arbitrary or Truncated Texts
European Conference on Computer Vision (ECCV), 2022
Jeonghun Baek
Yusuke Matsui
Kiyoharu Aizawa
276
19
0
11 Jul 2022
PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System
Chenxia Li
Weiwei Liu
Ruoyu Guo
Xiaoyue Yin
Kaitao Jiang
...
Lingfeng Zhu
Baohua Lai
Xiaoguang Hu
Dianhai Yu
Yanjun Ma
469
192
0
07 Jun 2022
Multimodal Semi-Supervised Learning for Text Recognition
Aviad Aberdam
Roy Ganz
Shai Mazor
Ron Litman
VLM
291
22
0
08 May 2022
Pushing the Performance Limit of Scene Text Recognizer without Human Annotation
Computer Vision and Pattern Recognition (CVPR), 2022
Caiyuan Zheng
Hui Li
Seon-Min Rhee
Seungju Han
Jae-Joon Han
Peng Wang
270
14
0
16 Apr 2022
SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization
Computer Vision and Pattern Recognition (CVPR), 2022
Canjie Luo
Lianwen Jin
Jingdong Chen
SSL
AI4TS
290
33
0
20 Mar 2022
1
2
Next
Page 1 of 2