Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2108.09661
Cited By
From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network
22 August 2021
Yuxin Wang
Hongtao Xie
Shancheng Fang
Jing Wang
Shenggao Zhu
Yongdong Zhang
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network"
36 / 86 papers shown
Title
Conditional Text Image Generation with Diffusion Models
Yuanzhi Zhu
Zhaohai Li
Tianwei Wang
Mengchao He
Cong Yao
VLM
DiffM
62
46
0
19 Jun 2023
LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models
Peng-Tao Xu
Wenqi Shao
Kaipeng Zhang
Peng Gao
Shuo Liu
Meng Lei
Fanqing Meng
Siyuan Huang
Yu Qiao
Ping Luo
ELM
MLLM
23
158
0
15 Jun 2023
Looking and Listening: Audio Guided Text Recognition
Wenwen Yu
Mingyu Liu
Biao Yang
Enming Zhang
Deqiang Jiang
Xing Sun
Yuliang Liu
Xiang Bai
DiffM
19
1
0
06 Jun 2023
Masked and Permuted Implicit Context Learning for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Jin Wei
Dongbao Yang
Yu Zhou
16
7
0
25 May 2023
MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition
Tianlun Zheng
Zhineng Chen
Bin Huang
Wei Zhang
Yuran Jiang
16
11
0
24 May 2023
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
Shuai Zhao
Xiaohan Wang
Linchao Zhu
Yezhou Yang
CLIP
VLM
11
25
0
23 May 2023
On the Hidden Mystery of OCR in Large Multimodal Models
Yuliang Liu
Zhang Li
Mingxin Huang
Chunyuan Li
Dezhi Peng
Mingyu Liu
Lianwen Jin
Xiang Bai
VLM
MLLM
16
47
0
13 May 2023
Collaborative Chinese Text Recognition with Personalized Federated Learning
Shangchao Su
Haiyang Yu
Bin Li
Xiangyang Xue
FedML
11
0
0
09 May 2023
TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition
Tianlun Zheng
Zhineng Chen
Jinfeng Bai
Hongtao Xie
Yu-Gang Jiang
11
17
0
09 May 2023
Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition
Boqiang Zhang
Hongtao Xie
Yuxin Wang
Jianjun Xu
Yongdong Zhang
24
20
0
09 May 2023
Scene Text Recognition with Image-Text Matching-guided Dictionary
Jiajun Wei
Hongjian Zhan
X. Tu
Yue Lu
Umapada Pal
VLM
17
0
0
08 May 2023
Improving Scene Text Image Super-resolution via Dual Prior Modulation Network
Shipeng Zhu
Zuoyan Zhao
Pengfei Fang
H. Xue
SupR
DiffM
26
24
0
21 Feb 2023
CLIPTER: Looking at the Bigger Picture in Scene Text Recognition
Aviad Aberdam
David Bensaid
Alona Golts
Roy Ganz
Oren Nuriel
Royee Tichauer
Shai Mazor
Ron Litman
VLM
CLIP
19
11
0
18 Jan 2023
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting
Shancheng Fang
Zhendong Mao
Hongtao Xie
Yuxin Wang
C. Yan
Yongdong Zhang
24
52
0
19 Nov 2022
Pure Transformer with Integrated Experts for Scene Text Recognition
Yew Lee Tan
A. Kong
Jung-jae Kim
ViT
15
14
0
09 Nov 2022
Masked Vision-Language Transformers for Scene Text Recognition
Jie Wu
Ying Peng
Shenmin Zhang
Weigang Qi
Jian Andrew Zhang
24
3
0
09 Nov 2022
Self-supervised Character-to-Character Distillation for Text Recognition
Tongkun Guan
Wei Shen
Xuehang Yang
Qi Feng
Zekun Jiang
Xiaokang Yang
17
11
0
01 Nov 2022
Out-of-Vocabulary Challenge Report
Sergi Garcia-Bordils
Andrés Mafla
Ali Furkan Biten
Oren Nuriel
Aviad Aberdam
Shai Mazor
Ron Litman
Dimosthenis Karatzas
9
16
0
14 Sep 2022
Levenshtein OCR
Cheng Da
P. Wang
Cong Yao
ViT
71
32
0
08 Sep 2022
Multi-Granularity Prediction for Scene Text Recognition
P. Wang
Cheng Da
Cong Yao
66
48
0
08 Sep 2022
Scene Text Recognition with Permuted Autoregressive Sequence Models
Darwin Bautista
Rowel Atienza
6
167
0
14 Jul 2022
Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
Mingkun Yang
Minghui Liao
Pu Lu
Jing Wang
Shenggao Zhu
Hualin Luo
Qingzhen Tian
X. Bai
SSL
27
55
0
01 Jul 2022
MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
Pengyuan Lyu
Chengquan Zhang
Shanshan Liu
Meina Qiao
Yangliu Xu
Liang Wu
Kun Yao
Junyu Han
Errui Ding
Jingdong Wang
22
42
0
01 Jun 2022
Multimodal Semi-Supervised Learning for Text Recognition
Aviad Aberdam
Roy Ganz
Shai Mazor
Ron Litman
VLM
19
19
0
08 May 2022
SVTR: Scene Text Recognition with a Single Visual Model
Yongkun Du
Zhineng Chen
Caiyan Jia
Xiaoyue Yin
Tianlun Zheng
Chenxia Li
Yuning Du
Yu-Gang Jiang
9
165
0
30 Apr 2022
IterVM: Iterative Vision Modeling Module for Scene Text Recognition
Xiaojie Chu
Yongtao Wang
22
2
0
06 Apr 2022
SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition
Mingxin Huang
Yuliang Liu
Zhenghao Peng
Chongyu Liu
Dahua Lin
Shenggao Zhu
N. Yuan
Kai Ding
Lianwen Jin
ViT
9
97
0
19 Mar 2022
Self-supervised Implicit Glyph Attention for Text Recognition
Tongkun Guan
Chaochen Gu
Jingzheng Tu
Xuehang Yang
Qi Feng
Yudi Zhao
Xiaokang Yang
Wei Shen
9
25
0
07 Mar 2022
Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study
Haiyang Yu
Jingye Chen
Bin Li
Jianqi Ma
Mengnan Guan
Xixi Xu
Xiaocong Wang
Shaobo Qu
Xiangyang Xue
12
55
0
30 Dec 2021
Visual-Semantic Transformer for Scene Text Recognition
Xin Tang
Yongquan Lai
Ying Liu
Yuanyuan Fu
Rui Fang
ViT
19
8
0
02 Dec 2021
Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
Byeonghu Na
Yoonsik Kim
Sungrae Park
24
53
0
30 Nov 2021
CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition
Tianlun Zheng
Zhineng Chen
Shancheng Fang
Hongtao Xie
Yu-Gang Jiang
24
51
0
22 Nov 2021
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Minghao Li
Tengchao Lv
Jingye Chen
Lei Cui
Yijuan Lu
D. Florêncio
Cha Zhang
Zhoujun Li
Furu Wei
ViT
90
340
0
21 Sep 2021
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
Oren Nuriel
Sharon Fogel
Ron Litman
14
9
0
09 May 2021
Scene Text Recognition with Sliding Convolutional Character Models
Fei Yin
Yi-Chao Wu
Xu-Yao Zhang
Cheng-Lin Liu
VLM
3DV
48
77
0
06 Sep 2017
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
214
7,687
0
17 Aug 2015
Previous
1
2