Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2108.09661
Cited By
From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network
22 August 2021
Yuxin Wang
Hongtao Xie
Shancheng Fang
Jing Wang
Shenggao Zhu
Yongdong Zhang
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network"
50 / 86 papers shown
Title
Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies
Zhengmi Tang
Yuto Mitsui
Tomo Miyazaki
S. Omachi
14
0
0
11 May 2025
DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation
Naphat Nithisopa
Teerapong Panboonyuen
ViT
12
0
0
07 May 2025
Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition
Yifei Zhang
Chang-Shu Liu
Jin Wei
Xiaomeng Yang
Yu Zhou
Can Ma
Xiangyang Ji
57
1
0
24 Mar 2025
Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation
Andrea Maracani
Savas Ozkan
Sijun Cho
Hyowon Kim
Eunchung Noh
Jeongwon Min
Cho Jung Min
Dookun Park
Mete Ozay
33
0
0
20 Mar 2025
Instruction-Guided Scene Text Recognition
Yongkun Du
Z. Chen
Yuchen Su
Caiyan Jia
Yu-Gang Jiang
62
3
0
03 Jan 2025
Disentanglement and Compositionality of Letter Identity and Letter Position in Variational Auto-Encoder Vision Models
Bruno Bianchi
Aakash Agrawal
S. Dehaene
Emmanuel Chemla
Yair Lakretz
DRL
CoGe
65
0
0
11 Dec 2024
SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition
Yongkun Du
Z. Chen
Hongtao Xie
Caiyan Jia
Yu Jiang
80
1
0
24 Nov 2024
Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing
Yadong Qu
Yuxin Wang
Bangbang Zhou
Z. Wang
Hongtao Xie
Yongdong Zhang
75
0
0
23 Nov 2024
Relational Contrastive Learning and Masked Image Modeling for Scene Text Recognition
T. Lin
Jinglei Zhang
Yi Xu
Kai Chen
Rui Zhang
C. L. P. Chen
31
0
0
18 Nov 2024
Integrating Canonical Neural Units and Multi-Scale Training for Handwritten Text Recognition
Zi-Rui Wang
19
0
0
24 Oct 2024
General Detection-based Text Line Recognition
Raphael Baena
Syrine Kalleli
Mathieu Aubry
47
0
0
25 Sep 2024
One Model for Two Tasks: Cooperatively Recognizing and Recovering Low-Resolution Scene Text Images by Iterative Mutual Guidance
Minyi Zhao
Yang Wang
Jihong Guan
Shuigeng Zhou
20
0
0
22 Sep 2024
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer
Humen Zhong
Zhibo Yang
Zhaohai Li
Peng Wang
Jun Tang
Wenqing Cheng
Cong Yao
16
1
0
18 Sep 2024
LEGO: Self-Supervised Representation Learning for Scene Text Images
Yujin Ren
Jiaxin Zhang
Lianwen Jin
SSL
21
0
0
04 Aug 2024
Out of Length Text Recognition with Sub-String Matching
Yongkun Du
Zhineng Chen
Caiyan Jia
Xieping Gao
Yu-Gang Jiang
44
2
0
17 Jul 2024
Focus on the Whole Character: Discriminative Character Modeling for Scene Text Recognition
Bangbang Zhou
Yadong Qu
Zixiao Wang
Zicheng Li
Boqiang Zhang
Hongtao Xie
32
1
0
08 Jul 2024
HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive Permutation for Scene Text Recognition
Honghui Chen
Yuhang Qiu
Jiabao Wang
Pingping Chen
Nam Ling
27
0
0
15 May 2024
Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition
Zuan Gao
Yuxin Wang
Yadong Qu
Boqiang Zhang
Zixiao Wang
Jianjun Xu
Hongtao Xie
ViT
40
9
0
09 May 2024
Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer
Da Chang
Yu Li
49
2
0
19 Apr 2024
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Jiahao Lyu
Jin Wei
Gangyan Zeng
Zeng Li
Enze Xie
Wei Wang
Yu Zhou
VLM
27
3
0
15 Mar 2024
Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and Margin Loss
Xuhua Ren
Hengcan Shi
Jin Li
VLM
20
0
0
12 Mar 2024
Efficiently Leveraging Linguistic Priors for Scene Text Spotting
Nguyen Nguyen
Yapeng Tian
Chenliang Xu
45
1
0
27 Feb 2024
Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition
Mingkun Yang
Biao Yang
Minghui Liao
Yingying Zhu
Xiang Bai
16
5
0
24 Feb 2024
Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition
Mingkun Yang
Biao Yang
Minghui Liao
Yingying Zhu
X. Bai
VLM
43
10
0
21 Feb 2024
VIPTR: A Vision Permutable Extractor for Fast and Efficient Scene Text Recognition
Xianfu Cheng
Weixiao Zhou
Xiang Li
Xiaoming Chen
Jian Yang
Tongliang Li
Zhoujun Li
22
2
0
18 Jan 2024
CMFN: Cross-Modal Fusion Network for Irregular Scene Text Recognition
Jinzhi Zheng
Ruyi Ji
Libo Zhang
Yanjun Wu
Chen Zhao
22
4
0
18 Jan 2024
Enhancing Small Object Encoding in Deep Neural Networks: Introducing Fast&Focused-Net with Volume-wise Dot Product Layer
Tofik Ali
Partha Pratim Roy
ObjD
17
2
0
18 Jan 2024
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
Mingxin Huang
Dezhi Peng
Hongliang Li
Zhenghao Peng
Chongyu Liu
Dahua Lin
Yuliang Liu
Xiang Bai
Lianwen Jin
63
1
0
15 Jan 2024
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
17
6
0
29 Dec 2023
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Yu Zhou
DiffM
43
1
0
19 Dec 2023
PEAN: A Diffusion-Based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution
Zuoyan Zhao
Hui Xue
Pengfei Fang
Shipeng Zhu
DiffM
11
4
0
29 Nov 2023
Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer
Zhen Zhao
Jingqun Tang
Chunhui Lin
Binghong Wu
Can Huang
Hao Liu
Xin Tan
Zhizhong Zhang
Yuan Xie
16
18
0
22 Nov 2023
Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval
P. Li
Hongtao Xie
Jiannan Ge
Lei Zhang
Shaobo Min
Yongdong Zhang
17
17
0
12 Oct 2023
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text Recognition
Zixiao Wang
Hongtao Xie
Yuxin Wang
Jianjun Xu
Boqiang Zhang
Yongdong Zhang
18
15
0
08 Oct 2023
ImageBind-LLM: Multi-modality Instruction Tuning
Jiaming Han
Renrui Zhang
Wenqi Shao
Peng Gao
Peng-Tao Xu
...
Yafei Wen
Xiaoxin Chen
Xiangyu Yue
Hongsheng Li
Yu Qiao
MLLM
16
115
0
07 Sep 2023
Orientation-Independent Chinese Text Recognition in Scene Images
Haiyang Yu
Xiaocong Wang
Bin Li
Xiangyang Xue
17
4
0
03 Sep 2023
DTrOCR: Decoder-only Transformer for Optical Character Recognition
Masato Fujitake
30
35
0
30 Aug 2023
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Changxu Cheng
P. Wang
Cheng Da
Qi Zheng
Cong Yao
18
15
0
24 Aug 2023
ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
Mingxin Huang
Jiaxin Zhang
Dezhi Peng
Hao Lu
Can Huang
Yuliang Liu
Xiang Bai
Lianwen Jin
19
24
0
20 Aug 2023
UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding
Hao Feng
Zijian Wang
Jingqun Tang
Jinghui Lu
Wen-gang Zhou
Houqiang Li
Can Huang
MLLM
VLM
13
46
0
19 Aug 2023
Self-distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach
Ziyin Zhang
Ning Lu
Minghui Liao
Yongshuai Huang
Cheng Li
Min Wang
Wei Peng
16
11
0
17 Aug 2023
Tiny LVLM-eHub: Early Multimodal Experiments with Bard
Wenqi Shao
Yutao Hu
Peng Gao
Meng Lei
Kaipeng Zhang
...
Peng-Tao Xu
Siyuan Huang
Hongsheng Li
Yuning Qiao
Ping Luo
VLM
MLLM
22
2
0
07 Aug 2023
One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer
Han Guo
Tao Dai
Mingyan Zhu
G. MEng
Bin Chen
Zhi Wang
Shutao Xia
19
1
0
05 Aug 2023
Relational Contrastive Learning for Scene Text Recognition
Jinglei Zhang
Tiancheng Lin
Yi Xu
Kaibo Chen
Rui Zhang
9
4
0
01 Aug 2023
HiREN: Towards Higher Supervision Quality for Better Scene Text Image Super-Resolution
Minyi Zhao
Yi Xu
Bingjia Li
Jie Wang
Jihong Guan
Shuigeng Zhou
23
1
0
31 Jul 2023
Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction
Pengfei Hu
Jie Ma
Zhenrong Zhang
Jun Du
Jianshu Zhang
8
3
0
30 Jul 2023
Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Cheng Da
P. Wang
Cong Yao
8
8
0
25 Jul 2023
Context Perception Parallel Decoder for Scene Text Recognition
Yongkun Du
Zhineng Chen
Caiyan Jia
Xiaoyue Yin
Chenxia Li
Yuning Du
Yu-Gang Jiang
27
7
0
23 Jul 2023
Revisiting Scene Text Recognition: A Data Perspective
Qing-Yuan Jiang
Jiapeng Wang
Dezhi Peng
Chongyu Liu
Lianwen Jin
13
25
0
17 Jul 2023
UTRNet: High-Resolution Urdu Text Recognition In Printed Documents
Abdur Rahman
Arjun Ghosh
Chetan Arora
11
6
0
27 Jun 2023
1
2
Next