Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.15263
Cited By
Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
30 November 2021
Byeonghu Na
Yoonsik Kim
Sungrae Park
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features"
7 / 7 papers shown
Title
Instruction-Guided Scene Text Recognition
Yongkun Du
Z. Chen
Yuchen Su
Caiyan Jia
Yu-Gang Jiang
68
3
0
03 Jan 2025
Focus on the Whole Character: Discriminative Character Modeling for Scene Text Recognition
Bangbang Zhou
Yadong Qu
Zixiao Wang
Zicheng Li
Boqiang Zhang
Hongtao Xie
37
1
0
08 Jul 2024
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
23
6
0
29 Dec 2023
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Yu Zhou
DiffM
52
1
0
19 Dec 2023
Scene Text Recognition Models Explainability Using Local Features
M. Ty
Rowel Atienza
26
1
0
14 Oct 2023
Towards Robust Scene Text Image Super-resolution via Explicit Location Enhancement
Han Guo
Tao Dai
G. MEng
Shutao Xia
21
11
0
19 Jul 2023
Multi-modal Transformer for Video Retrieval
Valentin Gabeur
Chen Sun
Alahari Karteek
Cordelia Schmid
ViT
404
594
0
21 Jul 2020
1