ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.09661
  4. Cited By
From Two to One: A New Scene Text Recognizer with Visual Language
  Modeling Network

From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network

22 August 2021
Yuxin Wang
Hongtao Xie
Shancheng Fang
Jing Wang
Shenggao Zhu
Yongdong Zhang
    VLM
ArXivPDFHTML

Papers citing "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network"

50 / 86 papers shown
Title
Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies
Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies
Zhengmi Tang
Yuto Mitsui
Tomo Miyazaki
S. Omachi
14
0
0
11 May 2025
DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation
DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation
Naphat Nithisopa
Teerapong Panboonyuen
ViT
17
0
0
07 May 2025
Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition
Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition
Yifei Zhang
Chang-Shu Liu
Jin Wei
Xiaomeng Yang
Yu Zhou
Can Ma
Xiangyang Ji
57
1
0
24 Mar 2025
Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation
Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation
Andrea Maracani
Savas Ozkan
Sijun Cho
Hyowon Kim
Eunchung Noh
Jeongwon Min
Cho Jung Min
Dookun Park
Mete Ozay
33
0
0
20 Mar 2025
Instruction-Guided Scene Text Recognition
Instruction-Guided Scene Text Recognition
Yongkun Du
Z. Chen
Yuchen Su
Caiyan Jia
Yu-Gang Jiang
64
3
0
03 Jan 2025
Disentanglement and Compositionality of Letter Identity and Letter
  Position in Variational Auto-Encoder Vision Models
Disentanglement and Compositionality of Letter Identity and Letter Position in Variational Auto-Encoder Vision Models
Bruno Bianchi
Aakash Agrawal
S. Dehaene
Emmanuel Chemla
Yair Lakretz
DRL
CoGe
65
0
0
11 Dec 2024
SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition
SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition
Yongkun Du
Z. Chen
Hongtao Xie
Caiyan Jia
Yu Jiang
80
1
0
24 Nov 2024
Boosting Semi-Supervised Scene Text Recognition via Viewing and
  Summarizing
Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing
Yadong Qu
Yuxin Wang
Bangbang Zhou
Z. Wang
Hongtao Xie
Yongdong Zhang
75
0
0
23 Nov 2024
Relational Contrastive Learning and Masked Image Modeling for Scene Text Recognition
T. Lin
Jinglei Zhang
Yi Xu
Kai Chen
Rui Zhang
C. L. P. Chen
31
0
0
18 Nov 2024
Integrating Canonical Neural Units and Multi-Scale Training for
  Handwritten Text Recognition
Integrating Canonical Neural Units and Multi-Scale Training for Handwritten Text Recognition
Zi-Rui Wang
19
0
0
24 Oct 2024
General Detection-based Text Line Recognition
General Detection-based Text Line Recognition
Raphael Baena
Syrine Kalleli
Mathieu Aubry
49
0
0
25 Sep 2024
One Model for Two Tasks: Cooperatively Recognizing and Recovering
  Low-Resolution Scene Text Images by Iterative Mutual Guidance
One Model for Two Tasks: Cooperatively Recognizing and Recovering Low-Resolution Scene Text Images by Iterative Mutual Guidance
Minyi Zhao
Yang Wang
Jihong Guan
Shuigeng Zhou
22
0
0
22 Sep 2024
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text
  Recognizer
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer
Humen Zhong
Zhibo Yang
Zhaohai Li
Peng Wang
Jun Tang
Wenqing Cheng
Cong Yao
16
1
0
18 Sep 2024
LEGO: Self-Supervised Representation Learning for Scene Text Images
LEGO: Self-Supervised Representation Learning for Scene Text Images
Yujin Ren
Jiaxin Zhang
Lianwen Jin
SSL
21
0
0
04 Aug 2024
Out of Length Text Recognition with Sub-String Matching
Out of Length Text Recognition with Sub-String Matching
Yongkun Du
Zhineng Chen
Caiyan Jia
Xieping Gao
Yu-Gang Jiang
44
2
0
17 Jul 2024
Focus on the Whole Character: Discriminative Character Modeling for
  Scene Text Recognition
Focus on the Whole Character: Discriminative Character Modeling for Scene Text Recognition
Bangbang Zhou
Yadong Qu
Zixiao Wang
Zicheng Li
Boqiang Zhang
Hongtao Xie
35
1
0
08 Jul 2024
HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive
  Permutation for Scene Text Recognition
HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive Permutation for Scene Text Recognition
Honghui Chen
Yuhang Qiu
Jiabao Wang
Pingping Chen
Nam Ling
27
0
0
15 May 2024
Self-Supervised Pre-training with Symmetric Superimposition Modeling for
  Scene Text Recognition
Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition
Zuan Gao
Yuxin Wang
Yadong Qu
Boqiang Zhang
Zixiao Wang
Jianjun Xu
Hongtao Xie
ViT
40
9
0
09 May 2024
Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer
Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer
Da Chang
Yu Li
51
2
0
19 Apr 2024
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with
  Pre-trained Language Model
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Jiahao Lyu
Jin Wei
Gangyan Zeng
Zeng Li
Enze Xie
Wei Wang
Yu Zhou
VLM
27
3
0
15 Mar 2024
Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and
  Margin Loss
Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and Margin Loss
Xuhua Ren
Hengcan Shi
Jin Li
VLM
20
0
0
12 Mar 2024
Efficiently Leveraging Linguistic Priors for Scene Text Spotting
Efficiently Leveraging Linguistic Priors for Scene Text Spotting
Nguyen Nguyen
Yapeng Tian
Chenliang Xu
45
1
0
27 Feb 2024
Sequential Visual and Semantic Consistency for Semi-supervised Text
  Recognition
Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition
Mingkun Yang
Biao Yang
Minghui Liao
Yingying Zhu
Xiang Bai
16
5
0
24 Feb 2024
Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition
Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition
Mingkun Yang
Biao Yang
Minghui Liao
Yingying Zhu
X. Bai
VLM
48
10
0
21 Feb 2024
VIPTR: A Vision Permutable Extractor for Fast and Efficient Scene Text
  Recognition
VIPTR: A Vision Permutable Extractor for Fast and Efficient Scene Text Recognition
Xianfu Cheng
Weixiao Zhou
Xiang Li
Xiaoming Chen
Jian Yang
Tongliang Li
Zhoujun Li
22
2
0
18 Jan 2024
CMFN: Cross-Modal Fusion Network for Irregular Scene Text Recognition
CMFN: Cross-Modal Fusion Network for Irregular Scene Text Recognition
Jinzhi Zheng
Ruyi Ji
Libo Zhang
Yanjun Wu
Chen Zhao
24
4
0
18 Jan 2024
Enhancing Small Object Encoding in Deep Neural Networks: Introducing
  Fast&Focused-Net with Volume-wise Dot Product Layer
Enhancing Small Object Encoding in Deep Neural Networks: Introducing Fast&Focused-Net with Volume-wise Dot Product Layer
Tofik Ali
Partha Pratim Roy
ObjD
17
2
0
18 Jan 2024
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
Mingxin Huang
Dezhi Peng
Hongliang Li
Zhenghao Peng
Chongyu Liu
Dahua Lin
Yuliang Liu
Xiang Bai
Lianwen Jin
63
1
0
15 Jan 2024
An Empirical Study of Scaling Law for OCR
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
20
6
0
29 Dec 2023
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Yu Zhou
DiffM
45
1
0
19 Dec 2023
PEAN: A Diffusion-Based Prior-Enhanced Attention Network for Scene Text
  Image Super-Resolution
PEAN: A Diffusion-Based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution
Zuoyan Zhao
Hui Xue
Pengfei Fang
Shipeng Zhu
DiffM
11
4
0
29 Nov 2023
Multi-modal In-Context Learning Makes an Ego-evolving Scene Text
  Recognizer
Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer
Zhen Zhao
Jingqun Tang
Chunhui Lin
Binghong Wu
Can Huang
Hao Liu
Xin Tan
Zhizhong Zhang
Yuan Xie
16
18
0
22 Nov 2023
Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video
  Retrieval
Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval
P. Li
Hongtao Xie
Jiannan Ge
Lei Zhang
Shaobo Min
Yongdong Zhang
23
17
0
12 Oct 2023
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text
  Recognition
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text Recognition
Zixiao Wang
Hongtao Xie
Yuxin Wang
Jianjun Xu
Boqiang Zhang
Yongdong Zhang
18
15
0
08 Oct 2023
ImageBind-LLM: Multi-modality Instruction Tuning
ImageBind-LLM: Multi-modality Instruction Tuning
Jiaming Han
Renrui Zhang
Wenqi Shao
Peng Gao
Peng-Tao Xu
...
Yafei Wen
Xiaoxin Chen
Xiangyu Yue
Hongsheng Li
Yu Qiao
MLLM
19
115
0
07 Sep 2023
Orientation-Independent Chinese Text Recognition in Scene Images
Orientation-Independent Chinese Text Recognition in Scene Images
Haiyang Yu
Xiaocong Wang
Bin Li
Xiangyang Xue
17
4
0
03 Sep 2023
DTrOCR: Decoder-only Transformer for Optical Character Recognition
DTrOCR: Decoder-only Transformer for Optical Character Recognition
Masato Fujitake
33
35
0
30 Aug 2023
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Changxu Cheng
P. Wang
Cheng Da
Qi Zheng
Cong Yao
20
15
0
24 Aug 2023
ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy
  in Transformer
ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
Mingxin Huang
Jiaxin Zhang
Dezhi Peng
Hao Lu
Can Huang
Yuliang Liu
Xiang Bai
Lianwen Jin
19
24
0
20 Aug 2023
UniDoc: A Universal Large Multimodal Model for Simultaneous Text
  Detection, Recognition, Spotting and Understanding
UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding
Hao Feng
Zijian Wang
Jingqun Tang
Jinghui Lu
Wen-gang Zhou
Houqiang Li
Can Huang
MLLM
VLM
21
46
0
19 Aug 2023
Self-distillation Regularized Connectionist Temporal Classification Loss
  for Text Recognition: A Simple Yet Effective Approach
Self-distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach
Ziyin Zhang
Ning Lu
Minghui Liao
Yongshuai Huang
Cheng Li
Min Wang
Wei Peng
19
11
0
17 Aug 2023
Tiny LVLM-eHub: Early Multimodal Experiments with Bard
Tiny LVLM-eHub: Early Multimodal Experiments with Bard
Wenqi Shao
Yutao Hu
Peng Gao
Meng Lei
Kaipeng Zhang
...
Peng-Tao Xu
Siyuan Huang
Hongsheng Li
Yuning Qiao
Ping Luo
VLM
MLLM
22
2
0
07 Aug 2023
One-stage Low-resolution Text Recognition with High-resolution Knowledge
  Transfer
One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer
Han Guo
Tao Dai
Mingyan Zhu
G. MEng
Bin Chen
Zhi Wang
Shutao Xia
19
1
0
05 Aug 2023
Relational Contrastive Learning for Scene Text Recognition
Relational Contrastive Learning for Scene Text Recognition
Jinglei Zhang
Tiancheng Lin
Yi Xu
Kaibo Chen
Rui Zhang
9
4
0
01 Aug 2023
HiREN: Towards Higher Supervision Quality for Better Scene Text Image
  Super-Resolution
HiREN: Towards Higher Supervision Quality for Better Scene Text Image Super-Resolution
Minyi Zhao
Yi Xu
Bingjia Li
Jie Wang
Jihong Guan
Shuigeng Zhou
25
1
0
31 Jul 2023
Count, Decode and Fetch: A New Approach to Handwritten Chinese Character
  Error Correction
Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction
Pengfei Hu
Jie Ma
Zhenrong Zhang
Jun Du
Jianshu Zhang
8
3
0
30 Jul 2023
Multi-Granularity Prediction with Learnable Fusion for Scene Text
  Recognition
Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Cheng Da
P. Wang
Cong Yao
10
8
0
25 Jul 2023
Context Perception Parallel Decoder for Scene Text Recognition
Context Perception Parallel Decoder for Scene Text Recognition
Yongkun Du
Zhineng Chen
Caiyan Jia
Xiaoyue Yin
Chenxia Li
Yuning Du
Yu-Gang Jiang
27
7
0
23 Jul 2023
Revisiting Scene Text Recognition: A Data Perspective
Revisiting Scene Text Recognition: A Data Perspective
Qing-Yuan Jiang
Jiapeng Wang
Dezhi Peng
Chongyu Liu
Lianwen Jin
13
25
0
17 Jul 2023
UTRNet: High-Resolution Urdu Text Recognition In Printed Documents
UTRNet: High-Resolution Urdu Text Recognition In Printed Documents
Abdur Rahman
Arjun Ghosh
Chetan Arora
11
6
0
27 Jun 2023
12
Next