Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2112.15093
Cited By
v1
v2 (latest)
Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study
30 December 2021
Haiyang Yu
Jingye Chen
Bin Li
Jianqi Ma
Mengnan Guan
Xixi Xu
Xiaocong Wang
Shaobo Qu
Xiangyang Xue
Re-assign community
ArXiv (abs)
PDF
HTML
Github (477â )
Papers citing
"Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study"
39 / 39 papers shown
Title
Robot-Powered Data Flywheels: Deploying Robots in the Wild for Continual Data Collection and Foundation Model Adaptation
J. Grannen
Michelle Pan
Kenneth Llontop
Cherie Ho
Mark Zolotas
Jeannette Bohg
Dorsa Sadigh
LM&Ro
198
0
0
24 Nov 2025
Restore Text First, Enhance Image Later: Two-Stage Scene Text Image Super-Resolution with Glyph Structure Guidance
Minxing Luo
Linlong Fan
Wang Qiushi
Ge Wu
Yiyan Luo
Yuhang Yu
Jinwei Chen
Y. Wang
Qingnan Fan
Jian Yang
128
1
0
24 Oct 2025
Boosting Fidelity for Pre-Trained-Diffusion-Based Low-Light Image Enhancement via Condition Refinement
Xiaogang Xu
Jian Wang
Yunfan Lu
Ruihang Chu
Ruixing Wang
Jiafei Wu
Bei Yu
Liang Lin
DiffM
148
0
0
20 Oct 2025
Enhanced Generative Structure Prior for Chinese Text Image Super-resolution
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Xiaoming Li
Wangmeng Zuo
Chen Change Loy
139
0
0
11 Aug 2025
Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition
Computer Vision and Pattern Recognition (CVPR), 2025
Yifei Zhang
Yu Xie
Jin Wei
Xiaomeng Yang
Yu Zhou
Can Ma
Xiangyang Ji
240
7
0
24 Mar 2025
Boosting Diffusion-Based Text Image Super-Resolution Model Towards Generalized Real-World Scenarios
Chenglu Pan
Xiaohan Li
Ganggui Ding
Yunke Zhang
Wenbo Li
Jiarong Xu
Qingbiao Wu
358
2
0
10 Mar 2025
Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document Transcription
Benjamin Gutteridge
Matthew Thomas Jackson
Toni Kukurin
Xiaowen Dong
127
0
0
27 Feb 2025
Instruction-Guided Scene Text Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Yongkun Du
Z. Chen
Yuchen Su
Caiyan Jia
Yu-Gang Jiang
377
16
0
03 Jan 2025
SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition
Yongkun Du
Z. Chen
Hongtao Xie
Caiyan Jia
Yu-Gang Jiang
318
16
0
24 Nov 2024
Relational Contrastive Learning and Masked Image Modeling for Scene Text Recognition
T. Lin
Jinglei Zhang
Yi Xu
Kai Chen
Rui Zhang
Chong Chen
295
0
0
18 Nov 2024
SAN: Structure-Aware Network for Complex and Long-tailed Chinese Text Recognition
IEEE International Conference on Document Analysis and Recognition (ICDAR), 2024
Jing Zhang
Chang-rui Liu
Chun Yang
139
3
0
10 Nov 2024
One Model for Two Tasks: Cooperatively Recognizing and Recovering Low-Resolution Scene Text Images by Iterative Mutual Guidance
Minyi Zhao
Yang Wang
Jihong Guan
Shuigeng Zhou
153
0
0
22 Sep 2024
Extremely Fine-Grained Visual Classification over Resembling Glyphs in the Wild
F. Bougourzi
Fadi Dornaika
Chongsheng Zhang
186
0
0
25 Aug 2024
Decoder Pre-Training with only Text for Scene Text Recognition
ACM Multimedia (MM), 2024
Shuai Zhao
Yongkun Du
Zhineng Chen
Yu-Gang Jiang
134
5
0
11 Aug 2024
Out of Length Text Recognition with Sub-String Matching
Yongkun Du
Zhineng Chen
Caiyan Jia
Xieping Gao
Yu-Gang Jiang
421
4
0
17 Jul 2024
AnyTrans: Translate AnyText in the Image with Large Scale Models
Zhipeng Qian
Pei Zhang
Baosong Yang
Kai Fan
Yiwei Ma
Yang Li
Xiaoshuai Sun
Rongrong Ji
VLM
192
3
0
17 Jun 2024
Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition
Mingkun Yang
Biao Yang
Minghui Liao
Yingying Zhu
X. Bai
VLM
182
19
0
21 Feb 2024
VIPTR: A Vision Permutable Extractor for Fast and Efficient Scene Text Recognition
Xianfu Cheng
Weixiao Zhou
Xiang Li
Xiaoming Chen
Zhiqiang Wang
Tongliang Li
Zhoujun Li
274
1
0
18 Jan 2024
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Yu Zhou
DiffM
337
7
0
19 Dec 2023
Diffusion-based Blind Text Image Super-Resolution
Computer Vision and Pattern Recognition (CVPR), 2023
Yuzhe Zhang
Jiawei Zhang
Hao Li
Zhouxia Wang
Luwei Hou
Dongqing Zou
Liheng Bian
244
25
0
13 Dec 2023
TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering
European Conference on Computer Vision (ECCV), 2023
Jingye Chen
Yupan Huang
Tengchao Lv
Lei Cui
Qifeng Chen
Furu Wei
DiffM
240
93
0
28 Nov 2023
DeNoising-MOT: Towards Multiple Object Tracking with Severe Occlusions
ACM Multimedia (ACM MM), 2023
Teng Fu
Xiaocong Wang
Haiyang Yu
Ke Niu
Bin Li
Xiangyang Xue
VOT
ViT
196
16
0
09 Sep 2023
Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS Aligning
IEEE International Conference on Computer Vision (ICCV), 2023
Haiyang Yu
Xiaocong Wang
Bin Li
Xiangyang Xue
VLM
227
33
0
03 Sep 2023
Orientation-Independent Chinese Text Recognition in Scene Images
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Haiyang Yu
Xiaocong Wang
Bin Li
Xiangyang Xue
191
7
0
03 Sep 2023
DTrOCR: Decoder-only Transformer for Optical Character Recognition
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Masato Fujitake
344
52
0
30 Aug 2023
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
IEEE International Conference on Computer Vision (ICCV), 2023
Changxu Cheng
Peng Wang
Cheng Da
Qi Zheng
Cong Yao
189
20
0
24 Aug 2023
Self-distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach
AAAI Conference on Artificial Intelligence (AAAI), 2023
Ziyin Zhang
Ning Lu
Minghui Liao
Yongshuai Huang
Cheng Li
Min Wang
Wei Peng
347
19
0
17 Aug 2023
A Benchmark for Chinese-English Scene Text Image Super-resolution
IEEE International Conference on Computer Vision (ICCV), 2023
Jianqi Ma
Zhetong Liang
Wangmeng Xiang
Xi Yang
Lei Zhang
131
19
0
07 Aug 2023
Relational Contrastive Learning for Scene Text Recognition
ACM Multimedia (ACM MM), 2023
Jinglei Zhang
Tiancheng Lin
Yi Xu
Kaibo Chen
Rui Zhang
211
14
0
01 Aug 2023
HiREN: Towards Higher Supervision Quality for Better Scene Text Image Super-Resolution
Minyi Zhao
Yi Xu
Bingjia Li
Jie Wang
Jihong Guan
Shuigeng Zhou
213
2
0
31 Jul 2023
Context Perception Parallel Decoder for Scene Text Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yongkun Du
Zhineng Chen
Caiyan Jia
Xiaoyue Yin
Chenxia Li
Yuning Du
Yu-Gang Jiang
200
16
0
23 Jul 2023
Perception and Semantic Aware Regularization for Sequential Confidence Calibration
Computer Vision and Pattern Recognition (CVPR), 2023
Zhenghua Peng
Yuanmao Luo
Tianshui Chen
Keke Xu
Shuangping Huang
AI4TS
209
3
0
31 May 2023
Collaborative Chinese Text Recognition with Personalized Federated Learning
Shangchao Su
Haiyang Yu
Bin Li
Xiangyang Xue
FedML
188
0
0
09 May 2023
Weakly-Supervised Text Instance Segmentation
ACM Multimedia (ACM MM), 2023
Xinyan Zu
Haiyang Yu
Bin Li
Xiangyang Xue
ISeg
225
6
0
20 Mar 2023
Transferring General Multimodal Pretrained Models to Text Recognition
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Junyang Lin
Xuancheng Ren
Yichang Zhang
Gao Liu
Peng Wang
An Yang
Chang Zhou
112
5
0
19 Dec 2022
Chinese Character Recognition with Radical-Structured Stroke Trees
Machine-mediated learning (ML), 2022
Haiyang Yu
Jingye Chen
Bin Li
Xiangyang Xue
159
21
0
24 Nov 2022
MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
Pengyuan Lyu
Chengquan Zhang
Shanshan Liu
Meina Qiao
Yangliu Xu
Liang Wu
Kun Yao
Junyu Han
Errui Ding
Jingdong Wang
421
46
0
01 Jun 2022
SVTR: Scene Text Recognition with a Single Visual Model
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Yongkun Du
Zhineng Chen
Caiyan Jia
Xiaoyue Yin
Tianlun Zheng
Chenxia Li
Yuning Du
Yu-Gang Jiang
243
215
0
30 Apr 2022
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Minghao Li
Tengchao Lv
Jingye Chen
Lei Cui
Yijuan Lu
D. FlorĂȘncio
Cha Zhang
Zhoujun Li
Furu Wei
ViT
403
470
0
21 Sep 2021
1