ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.04286
  4. Cited By
Primitive Representation Learning for Scene Text Recognition

Primitive Representation Learning for Scene Text Recognition

10 May 2021
Ruijie Yan
Liangrui Peng
Shanyu Xiao
Gang Yao
ArXivPDFHTML

Papers citing "Primitive Representation Learning for Scene Text Recognition"

29 / 29 papers shown
Title
DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation
DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation
Naphat Nithisopa
Teerapong Panboonyuen
ViT
22
0
0
07 May 2025
Disentanglement and Compositionality of Letter Identity and Letter
  Position in Variational Auto-Encoder Vision Models
Disentanglement and Compositionality of Letter Identity and Letter Position in Variational Auto-Encoder Vision Models
Bruno Bianchi
Aakash Agrawal
S. Dehaene
Emmanuel Chemla
Yair Lakretz
DRL
CoGe
68
0
0
11 Dec 2024
Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and
  Margin Loss
Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and Margin Loss
Xuhua Ren
Hengcan Shi
Jin Li
VLM
33
0
0
12 Mar 2024
Sequential Visual and Semantic Consistency for Semi-supervised Text
  Recognition
Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition
Mingkun Yang
Biao Yang
Minghui Liao
Yingying Zhu
Xiang Bai
24
5
0
24 Feb 2024
Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition
Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition
Mingkun Yang
Biao Yang
Minghui Liao
Yingying Zhu
X. Bai
VLM
63
10
0
21 Feb 2024
VIPTR: A Vision Permutable Extractor for Fast and Efficient Scene Text
  Recognition
VIPTR: A Vision Permutable Extractor for Fast and Efficient Scene Text Recognition
Xianfu Cheng
Weixiao Zhou
Xiang Li
Xiaoming Chen
Jian Yang
Tongliang Li
Zhoujun Li
32
2
0
18 Jan 2024
On Manipulating Scene Text in the Wild with Diffusion Models
On Manipulating Scene Text in the Wild with Diffusion Models
Joshua Santoso
Christian Simon
Williem Pao
DiffM
24
6
0
01 Nov 2023
DTrOCR: Decoder-only Transformer for Optical Character Recognition
DTrOCR: Decoder-only Transformer for Optical Character Recognition
Masato Fujitake
35
35
0
30 Aug 2023
Self-distillation Regularized Connectionist Temporal Classification Loss
  for Text Recognition: A Simple Yet Effective Approach
Self-distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach
Ziyin Zhang
Ning Lu
Minghui Liao
Yongshuai Huang
Cheng Li
Min Wang
Wei Peng
21
11
0
17 Aug 2023
Collaborative Chinese Text Recognition with Personalized Federated
  Learning
Collaborative Chinese Text Recognition with Personalized Federated Learning
Shangchao Su
Haiyang Yu
Bin Li
Xiangyang Xue
FedML
11
0
0
09 May 2023
TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition
TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition
Tianlun Zheng
Zhineng Chen
Jinfeng Bai
Hongtao Xie
Yu-Gang Jiang
19
18
0
09 May 2023
Text2shape Deep Retrieval Model: Generating Initial Cases for Mechanical
  Part Redesign under the Context of Case-Based Reasoning
Text2shape Deep Retrieval Model: Generating Initial Cases for Mechanical Part Redesign under the Context of Case-Based Reasoning
Tianshuo Zang
Maolin Yang
Wentao Yong
Pingyu Jiang
3DV
8
4
0
13 Feb 2023
Pure Transformer with Integrated Experts for Scene Text Recognition
Pure Transformer with Integrated Experts for Scene Text Recognition
Yew Lee Tan
A. Kong
Jung-jae Kim
ViT
20
14
0
09 Nov 2022
Masked Vision-Language Transformers for Scene Text Recognition
Masked Vision-Language Transformers for Scene Text Recognition
Jie Wu
Ying Peng
Shenmin Zhang
Weigang Qi
Jian Andrew Zhang
27
3
0
09 Nov 2022
Self-supervised Character-to-Character Distillation for Text Recognition
Self-supervised Character-to-Character Distillation for Text Recognition
Tongkun Guan
Wei Shen
Xuehang Yang
Qi Feng
Zekun Jiang
Xiaokang Yang
17
26
0
01 Nov 2022
Searching a High-Performance Feature Extractor for Text Recognition
  Network
Searching a High-Performance Feature Extractor for Text Recognition Network
Hui Zhang
Quanming Yao
James T. Kwok
X. Bai
18
7
0
27 Sep 2022
SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily
  Oriented Scene Text Recognition
SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily Oriented Scene Text Recognition
Dajian Zhong
Shujing Lyu
P. Shivakumara
Bing Yin
Jiajia Wu
Umapada Pal
Yue Lu
24
20
0
21 Jul 2022
Scene Text Recognition with Permuted Autoregressive Sequence Models
Scene Text Recognition with Permuted Autoregressive Sequence Models
Darwin Bautista
Rowel Atienza
19
168
0
14 Jul 2022
Reading and Writing: Discriminative and Generative Modeling for
  Self-Supervised Text Recognition
Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
Mingkun Yang
Minghui Liao
Pu Lu
Jing Wang
Shenggao Zhu
Hualin Luo
Qingzhen Tian
X. Bai
SSL
27
55
0
01 Jul 2022
SVTR: Scene Text Recognition with a Single Visual Model
SVTR: Scene Text Recognition with a Single Visual Model
Yongkun Du
Zhineng Chen
Caiyan Jia
Xiaoyue Yin
Tianlun Zheng
Chenxia Li
Yuning Du
Yu-Gang Jiang
11
170
0
30 Apr 2022
IterVM: Iterative Vision Modeling Module for Scene Text Recognition
IterVM: Iterative Vision Modeling Module for Scene Text Recognition
Xiaojie Chu
Yongtao Wang
25
2
0
06 Apr 2022
Self-supervised Implicit Glyph Attention for Text Recognition
Self-supervised Implicit Glyph Attention for Text Recognition
Tongkun Guan
Chaochen Gu
Jingzheng Tu
Xuehang Yang
Qi Feng
Yudi Zhao
Xiaokang Yang
Wei Shen
17
25
0
07 Mar 2022
Visual Semantics Allow for Textual Reasoning Better in Scene Text
  Recognition
Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition
Y. He
Chen Chen
Jing Zhang
Juhua Liu
Fengxiang He
Chaoyue Wang
Bo Du
26
55
0
24 Dec 2021
Visual-Semantic Transformer for Scene Text Recognition
Visual-Semantic Transformer for Scene Text Recognition
Xin Tang
Yongquan Lai
Ying Liu
Yuanyuan Fu
Rui Fang
ViT
24
8
0
02 Dec 2021
Multi-modal Text Recognition Networks: Interactive Enhancements between
  Visual and Semantic Features
Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
Byeonghu Na
Yoonsik Kim
Sungrae Park
32
54
0
30 Nov 2021
CDistNet: Perceiving Multi-Domain Character Distance for Robust Text
  Recognition
CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition
Tianlun Zheng
Zhineng Chen
Shancheng Fang
Hongtao Xie
Yu-Gang Jiang
26
51
0
22 Nov 2021
TrOCR: Transformer-based Optical Character Recognition with Pre-trained
  Models
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Minghao Li
Tengchao Lv
Jingye Chen
Lei Cui
Yijuan Lu
D. Florêncio
Cha Zhang
Zhoujun Li
Furu Wei
ViT
93
340
0
21 Sep 2021
I2C2W: Image-to-Character-to-Word Transformers for Accurate Scene Text
  Recognition
I2C2W: Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition
Chuhui Xue
Jiaxing Huang
Wenqing Zhang
Shijian Lu
Changhu Wang
S. Bai
21
16
0
18 May 2021
Graph-Based Global Reasoning Networks
Graph-Based Global Reasoning Networks
Yunpeng Chen
Marcus Rohrbach
Zhicheng Yan
Shuicheng Yan
Jiashi Feng
Yannis Kalantidis
GNN
NAI
255
456
0
30 Nov 2018
1