ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.00159
  4. Cited By
SVTR: Scene Text Recognition with a Single Visual Model

SVTR: Scene Text Recognition with a Single Visual Model

30 April 2022
Yongkun Du
Zhineng Chen
Caiyan Jia
Xiaoyue Yin
Tianlun Zheng
Chenxia Li
Yuning Du
Yu-Gang Jiang
ArXivPDFHTML

Papers citing "SVTR: Scene Text Recognition with a Single Visual Model"

50 / 58 papers shown
Title
Document Image Rectification Bases on Self-Adaptive Multitask Fusion
Document Image Rectification Bases on Self-Adaptive Multitask Fusion
Heng Li
Xiangping Wu
Qingcai Chen
39
0
0
09 May 2025
Towards Visual Text Grounding of Multimodal Large Language Model
Towards Visual Text Grounding of Multimodal Large Language Model
Ming Li
Ruiyi Zhang
Jian Chen
Jiuxiang Gu
Yufan Zhou
Franck Dernoncourt
Wanrong Zhu
Dinesh Manocha
Tong Sun
41
2
0
07 Apr 2025
Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition
Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition
Yifei Zhang
Chang-Shu Liu
Jin Wei
Xiaomeng Yang
Yu Zhou
Can Ma
Xiangyang Ji
68
2
0
24 Mar 2025
MX-Font++: Mixture of Heterogeneous Aggregation Experts for Few-shot Font Generation
Weihang Wang
Duolin Sun
Jielei Zhang
Longwen Gao
68
0
0
04 Mar 2025
Billet Number Recognition Based on Test-Time Adaptation
Billet Number Recognition Based on Test-Time Adaptation
Yuan Wei
Xiuzhuang Zhou
82
0
0
13 Feb 2025
Instruction-Guided Scene Text Recognition
Instruction-Guided Scene Text Recognition
Yongkun Du
Z. Chen
Yuchen Su
Caiyan Jia
Yu-Gang Jiang
75
3
0
03 Jan 2025
Disentanglement and Compositionality of Letter Identity and Letter
  Position in Variational Auto-Encoder Vision Models
Disentanglement and Compositionality of Letter Identity and Letter Position in Variational Auto-Encoder Vision Models
Bruno Bianchi
Aakash Agrawal
S. Dehaene
Emmanuel Chemla
Yair Lakretz
DRL
CoGe
73
0
0
11 Dec 2024
TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition
TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition
Xingsong Ye
Yongkun Du
Yunbo Tao
Z. Chen
DiffM
110
0
0
02 Dec 2024
SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition
SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition
Yongkun Du
Z. Chen
Hongtao Xie
Caiyan Jia
Yu-Gang Jiang
85
1
0
24 Nov 2024
Boosting Semi-Supervised Scene Text Recognition via Viewing and
  Summarizing
Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing
Yadong Qu
Yuxin Wang
Bangbang Zhou
Z. Wang
Hongtao Xie
Yongdong Zhang
87
0
0
23 Nov 2024
Learning based Geéz character handwritten recognition
Learning based Geéz character handwritten recognition
Hailemicael Lulseged Yimer
Hailegabriel Dereje Degefa
Marco Cristani
Federico Cunico
64
0
0
20 Nov 2024
Decoder Pre-Training with only Text for Scene Text Recognition
Decoder Pre-Training with only Text for Scene Text Recognition
Shuai Zhao
Yongkun Du
Zhineng Chen
Yu-Gang Jiang
33
0
0
11 Aug 2024
SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding
  from TV Dramas and Synopses
SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Chaolei Tan
Zihang Lin
Junfu Pu
Zhongang Qi
Wei-Yi Pei
Zhi Qu
Yexin Wang
Ying Shan
Wei-Shi Zheng
Jianfang Hu
AI4TS
43
0
0
03 Aug 2024
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and
  Flexible Scene Text Retrieval
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval
Gangyan Zeng
Yuan Zhang
Jin Wei
Dongbao Yang
Peng Zhang
Yiwen Gao
Xugong Qin
Yu Zhou
VLM
CLIP
30
0
0
01 Aug 2024
Out of Length Text Recognition with Sub-String Matching
Out of Length Text Recognition with Sub-String Matching
Yongkun Du
Zhineng Chen
Caiyan Jia
Xieping Gao
Yu-Gang Jiang
54
2
0
17 Jul 2024
Focus on the Whole Character: Discriminative Character Modeling for
  Scene Text Recognition
Focus on the Whole Character: Discriminative Character Modeling for Scene Text Recognition
Bangbang Zhou
Yadong Qu
Zixiao Wang
Zicheng Li
Boqiang Zhang
Hongtao Xie
47
1
0
08 Jul 2024
Large Language Models Lack Understanding of Character Composition of
  Words
Large Language Models Lack Understanding of Character Composition of Words
Andrew Shin
Kunitake Kaneko
29
7
0
18 May 2024
Self-Supervised Pre-training with Symmetric Superimposition Modeling for
  Scene Text Recognition
Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition
Zuan Gao
Yuxin Wang
Yadong Qu
Boqiang Zhang
Zixiao Wang
Jianjun Xu
Hongtao Xie
ViT
42
9
0
09 May 2024
Ensemble Learning for Vietnamese Scene Text Spotting in Urban
  Environments
Ensemble Learning for Vietnamese Scene Text Spotting in Urban Environments
Hieu Nguyen
Cong-Hoang Ta
Phuong-Thuy Le-Nguyen
Minh-Triet Tran
Trung-Truc Huynh-Le
34
0
0
01 Apr 2024
Efficient scene text image super-resolution with semantic guidance
Efficient scene text image super-resolution with semantic guidance
LeoWu TomyEnrique
Xiangcheng Du
Kangliang Liu
Han Yuan
Zhao Zhou
Cheng Jin
VLM
31
2
0
20 Mar 2024
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with
  Pre-trained Language Model
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Jiahao Lyu
Jin Wei
Gangyan Zeng
Zeng Li
Enze Xie
Wei Wang
Yu Zhou
VLM
29
3
0
15 Mar 2024
Sequential Visual and Semantic Consistency for Semi-supervised Text
  Recognition
Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition
Mingkun Yang
Biao Yang
Minghui Liao
Yingying Zhu
Xiang Bai
32
5
0
24 Feb 2024
Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition
Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition
Mingkun Yang
Biao Yang
Minghui Liao
Yingying Zhu
X. Bai
VLM
78
10
0
21 Feb 2024
VIPTR: A Vision Permutable Extractor for Fast and Efficient Scene Text
  Recognition
VIPTR: A Vision Permutable Extractor for Fast and Efficient Scene Text Recognition
Xianfu Cheng
Weixiao Zhou
Xiang Li
Xiaoming Chen
Jian Yang
Tongliang Li
Zhoujun Li
37
2
0
18 Jan 2024
An Empirical Study of Scaling Law for OCR
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
38
6
0
29 Dec 2023
Progressive Evolution from Single-Point to Polygon for Scene Text
Progressive Evolution from Single-Point to Polygon for Scene Text
Linger Deng
Mingxin Huang
Xudong Xie
Yuliang Liu
Lianwen Jin
Xiang Bai
31
1
0
21 Dec 2023
Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video
  Moment Retrieval
Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval
Zhihang Liu
Jun Li
Hongtao Xie
Pandeng Li
Jiannan Ge
Sun-Ao Liu
Guoqing Jin
42
18
0
19 Dec 2023
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Yu Zhou
DiffM
62
1
0
19 Dec 2023
Cross-Lingual Learning in Multilingual Scene Text Recognition
Cross-Lingual Learning in Multilingual Scene Text Recognition
Jeonghun Baek
Yusuke Matsui
Kiyoharu Aizawa
21
0
0
17 Dec 2023
IDPL-PFOD2: A New Large-Scale Dataset for Printed Farsi Optical
  Character Recognition
IDPL-PFOD2: A New Large-Scale Dataset for Printed Farsi Optical Character Recognition
Fatemeh Asadi-zeydabadi
Ali Afkari-Fahandari
Amin Faraji
Elham Shabaninia
Hossein Nezamabadi-pour
21
2
0
02 Dec 2023
EfficientOCR: An Extensible, Open-Source Package for Efficiently
  Digitizing World Knowledge
EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge
Tom Bryan
Jacob Carlson
Abhishek Arora
Melissa Dell
29
8
0
16 Oct 2023
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text
  Recognition
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text Recognition
Zixiao Wang
Hongtao Xie
Yuxin Wang
Jianjun Xu
Boqiang Zhang
Yongdong Zhang
47
15
0
08 Oct 2023
Orientation-Independent Chinese Text Recognition in Scene Images
Orientation-Independent Chinese Text Recognition in Scene Images
Haiyang Yu
Xiaocong Wang
Bin Li
Xiangyang Xue
27
4
0
03 Sep 2023
DTrOCR: Decoder-only Transformer for Optical Character Recognition
DTrOCR: Decoder-only Transformer for Optical Character Recognition
Masato Fujitake
46
35
0
30 Aug 2023
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Changxu Cheng
Peng Wang
Cheng Da
Qi Zheng
Cong Yao
34
15
0
24 Aug 2023
bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents
bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents
Imam Mohammad Zulkarnain
Shayekh Bin Islam
Md. Zami Al Zunaed Farabe
Md. Mehedi Hasan Shawon
Jawaril Munshad Abedin
...
Istiak Shihab
Syed Mobassir
Md. Nazmuddoha Ansary
Asif Sushmit
Farig Sadeque
29
2
0
21 Aug 2023
Self-distillation Regularized Connectionist Temporal Classification Loss
  for Text Recognition: A Simple Yet Effective Approach
Self-distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach
Ziyin Zhang
Ning Lu
Minghui Liao
Yongshuai Huang
Cheng Li
Min Wang
Wei Peng
28
11
0
17 Aug 2023
Towards Robust Real-Time Scene Text Detection: From Semantic to Instance
  Representation Learning
Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning
Xugong Qin
Pengyuan Lyu
Chengquan Zhang
Yu Zhou
Kun Yao
Peng-Zhen Zhang
Hailun Lin
Weiping Wang
39
12
0
14 Aug 2023
HiREN: Towards Higher Supervision Quality for Better Scene Text Image
  Super-Resolution
HiREN: Towards Higher Supervision Quality for Better Scene Text Image Super-Resolution
Minyi Zhao
Yi Xu
Bingjia Li
Jie Wang
Jihong Guan
Shuigeng Zhou
44
1
0
31 Jul 2023
Context Perception Parallel Decoder for Scene Text Recognition
Context Perception Parallel Decoder for Scene Text Recognition
Yongkun Du
Zhineng Chen
Caiyan Jia
Xiaoyue Yin
Chenxia Li
Yuning Du
Yu-Gang Jiang
31
7
0
23 Jul 2023
Revisiting Scene Text Recognition: A Data Perspective
Revisiting Scene Text Recognition: A Data Perspective
Qing-Yuan Jiang
Jiapeng Wang
Dezhi Peng
Chongyu Liu
Lianwen Jin
28
39
0
17 Jul 2023
LRANet: Towards Accurate and Efficient Scene Text Detection with
  Low-Rank Approximation Network
LRANet: Towards Accurate and Efficient Scene Text Detection with Low-Rank Approximation Network
Yuchen Su
Zhineng Chen
Zhiwen Shao
Yuning Du
Zhilong Ji
Jinfeng Bai
Yong Zhou
Yuxi Jiang
28
6
0
27 Jun 2023
Masked and Permuted Implicit Context Learning for Scene Text Recognition
Masked and Permuted Implicit Context Learning for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Jin Wei
Dongbao Yang
Yu Zhou
29
7
0
25 May 2023
MRN: Multiplexed Routing Network for Incremental Multilingual Text
  Recognition
MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition
Tianlun Zheng
Zhineng Chen
Bin Huang
Wei Zhang
Yuran Jiang
18
11
0
24 May 2023
Quantifying Character Similarity with Vision Transformers
Quantifying Character Similarity with Vision Transformers
Xinmei Yang
Abhishek Arora
Shao-Yu Jheng
Melissa Dell
24
3
0
24 May 2023
TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition
TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition
Tianlun Zheng
Zhineng Chen
Jinfeng Bai
Hongtao Xie
Yu-Gang Jiang
21
18
0
09 May 2023
Linguistic More: Taking a Further Step toward Efficient and Accurate
  Scene Text Recognition
Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition
Boqiang Zhang
Hongtao Xie
Yuxin Wang
Jianjun Xu
Yongdong Zhang
32
20
0
09 May 2023
DocParser: End-to-end OCR-free Information Extraction from Visually Rich
  Documents
DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents
M. Dhouib
G. Bettaieb
A. Shabou
17
20
0
24 Apr 2023
Linking Representations with Multimodal Contrastive Learning
Linking Representations with Multimodal Contrastive Learning
Abhishek Arora
Xinmei Yang
Shao-Yu Jheng
Melissa Dell
25
1
0
07 Apr 2023
Efficient OCR for Building a Diverse Digital History
Efficient OCR for Building a Diverse Digital History
Jacob Carlson
Tom Bryan
Melissa Dell
25
11
0
05 Apr 2023
12
Next