ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1507.05717
  4. Cited By
An End-to-End Trainable Neural Network for Image-based Sequence
  Recognition and Its Application to Scene Text Recognition

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2015
21 July 2015
Baoguang Shi
X. Bai
Cong Yao
    VLM
ArXiv (abs)PDFHTML

Papers citing "An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition"

50 / 681 papers shown
Contrastive Learning of Semantic and Visual Representations for Text Tracking
Zhuang Li
Weijia Wu
Mike Zheng Shou
Jiahong Li
Size Li
Zhongyuan Wang
Hong Zhou
179
12
0
30 Dec 2021
Visual Semantics Allow for Textual Reasoning Better in Scene Text
  Recognition
Visual Semantics Allow for Textual Reasoning Better in Scene Text RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2021
Y. He
Chen Chen
Jing Zhang
Juhua Liu
Fengxiang He
Chaoyue Wang
Bo Du
231
59
0
24 Dec 2021
Image-free multi-character recognition
Image-free multi-character recognition
Huayi Wang
Chunli Zhu
Liheng Bian
89
8
0
20 Dec 2021
Text Gestalt: Stroke-Aware Scene Text Image Super-Resolution
Text Gestalt: Stroke-Aware Scene Text Image Super-Resolution
Jingye Chen
Haiyang Yu
Jianqi Ma
Bin Li
Xiangyang Xue
DiffM
183
56
0
13 Dec 2021
A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text
  Spotter with Transformer
A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text Spotter with Transformer
Weijia Wu
Yuanqiang Cai
Debing Zhang
Sibo Wang
Zhuang Li
Jiahong Li
Yejun Tang
Hong Zhou
177
38
0
09 Dec 2021
Visual-Semantic Transformer for Scene Text Recognition
Visual-Semantic Transformer for Scene Text Recognition
Xin Tang
Yongquan Lai
Ying Liu
Yuanyuan Fu
Rui Fang
ViT
221
12
0
02 Dec 2021
OCR-free Document Understanding Transformer
OCR-free Document Understanding Transformer
Geewook Kim
Teakgyu Hong
Moonbin Yim
Jeongyeon Nam
Jinyoung Park
Jinyeong Yim
Wonseok Hwang
Sangdoo Yun
Dongyoon Han
Seunghyun Park
ViT
570
387
0
30 Nov 2021
Multi-modal Text Recognition Networks: Interactive Enhancements between
  Visual and Semantic Features
Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
Byeonghu Na
Yoonsik Kim
Sungrae Park
318
67
0
30 Nov 2021
Decoupling Visual-Semantic Feature Learning for Robust Scene Text
  Recognition
Decoupling Visual-Semantic Feature Learning for Robust Scene Text Recognition
Changxu Cheng
Bohan Li
Qi Zheng
Yongpan Wang
Wenyu Liu
92
2
0
24 Nov 2021
Utilizing Resource-Rich Language Datasets for End-to-End Scene Text
  Recognition in Resource-Poor Languages
Utilizing Resource-Rich Language Datasets for End-to-End Scene Text Recognition in Resource-Poor Languages
Shota Orihashi
Yoshihiro Yamazaki
Naoki Makishima
Mana Ihori
Akihiko Takashima
Tomohiro Tanaka
Ryo Masumura
156
1
0
24 Nov 2021
CDistNet: Perceiving Multi-Domain Character Distance for Robust Text
  Recognition
CDistNet: Perceiving Multi-Domain Character Distance for Robust Text RecognitionInternational Journal of Computer Vision (IJCV), 2021
Tianlun Zheng
Zhineng Chen
Shancheng Fang
Hongtao Xie
Yu-Gang Jiang
428
80
0
22 Nov 2021
TRIG: Transformer-Based Text Recognizer with Initial Embedding Guidance
TRIG: Transformer-Based Text Recognizer with Initial Embedding Guidance
Yuefeng Tao
Zhiwei Jia
Runze Ma
Shugong Xu
ViT
195
7
0
16 Nov 2021
Improving Structured Text Recognition with Regular Expression Biasing
Improving Structured Text Recognition with Regular Expression Biasing
Baoguang Shi
W. Cheng
Yijuan Lu
Cha Zhang
D. Florêncio
127
2
0
10 Nov 2021
Video Text Tracking With a Spatio-Temporal Complementary Model
Video Text Tracking With a Spatio-Temporal Complementary Model
Yuzhe Gao
Xing Li
Jiajian Zhang
Yu Zhou
Dian Jin
Jing Wang
Shenggao Zhu
Xiang Bai
288
21
0
09 Nov 2021
Oracle Teacher: Leveraging Target Information for Better Knowledge
  Distillation of CTC Models
Oracle Teacher: Leveraging Target Information for Better Knowledge Distillation of CTC Models
J. Yoon
H. Kim
Hyeon Seung Lee
Sunghwan Ahn
N. Kim
492
1
0
05 Nov 2021
Distantly Supervised Semantic Text Detection and Recognition for
  Broadcast Sports Videos Understanding
Distantly Supervised Semantic Text Detection and Recognition for Broadcast Sports Videos UnderstandingACM Multimedia (ACM MM), 2021
Avijit Shah
Topojoy Biswas
Sathish Ramadoss
Deven Santosh Shah
188
4
0
31 Oct 2021
TPSNet: Reverse Thinking of Thin Plate Splines for Arbitrary Shape Scene
  Text Representation
TPSNet: Reverse Thinking of Thin Plate Splines for Arbitrary Shape Scene Text RepresentationACM Multimedia (ACM MM), 2021
Wei Wang
Can Ma
Jiahao Lv
Dayan Wu
Guoqing Zhao
Ning Jiang
Weiping Wang
223
42
0
25 Oct 2021
Ultra Light OCR Competition Technical Report
Ultra Light OCR Competition Technical Report
Shuhan Zhang
S. Moussa
Ziad El-Khatib
A. B. Mnaouer
3DV
158
0
0
25 Oct 2021
Recurrence along Depth: Deep Convolutional Neural Networks with
  Recurrent Layer Aggregation
Recurrence along Depth: Deep Convolutional Neural Networks with Recurrent Layer AggregationNeural Information Processing Systems (NeurIPS), 2021
Jingyu Zhao
Yanwen Fang
Guodong Li
142
29
0
22 Oct 2021
Accurate Fine-grained Layout Analysis for the Historical Tibetan
  Document Based on the Instance Segmentation
Accurate Fine-grained Layout Analysis for the Historical Tibetan Document Based on the Instance Segmentation
Penghai Zhao
Weilan Wang
Zhengqi Cai
Guowei Zhang
Yuqi Lu
144
10
0
15 Oct 2021
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech
  Recognition
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Binbin Zhang
Hang Lv
Pengcheng Guo
Qijie Shao
Chao Yang
...
Hui Bu
Xiaoyu Chen
Chenchen Zeng
Di Wu
Zhendong Peng
408
290
0
07 Oct 2021
Asking questions on handwritten document collections
Asking questions on handwritten document collections
Minesh Mathew
Lluís Gómez
Dimosthenis Karatzas
C. V. Jawahar
RALM
264
17
0
02 Oct 2021
TrOCR: Transformer-based Optical Character Recognition with Pre-trained
  Models
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Minghao Li
Tengchao Lv
Jingye Chen
Lei Cui
Yijuan Lu
D. Florêncio
Cha Zhang
Zhoujun Li
Furu Wei
ViT
542
515
0
21 Sep 2021
PIMNet: A Parallel, Iterative and Mimicking Network for Scene Text
  Recognition
PIMNet: A Parallel, Iterative and Mimicking Network for Scene Text RecognitionACM Multimedia (ACM MM), 2021
Zhi Qiao
Can Ma
Jin Wei
Wei Wang
Yuanqing Zhang
Ning Jiang
Hongbin Wang
Weiping Wang
249
80
0
09 Sep 2021
PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System
PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System
Yuning Du
Chenxia Li
Ruoyu Guo
Cheng Cui
Weiwei Liu
...
Yehua Yang
Qiwen Liu
Xiaoguang Hu
Dianhai Yu
Yanjun Ma
232
80
0
07 Sep 2021
Meta Self-Learning for Multi-Source Domain Adaptation: A Benchmark
Meta Self-Learning for Multi-Source Domain Adaptation: A Benchmark
Shuhao Qiu
Chuang Zhu
Wenli Zhou
VLMOOD
129
11
0
24 Aug 2021
EKTVQA: Generalized use of External Knowledge to empower Scene Text in
  Text-VQA
EKTVQA: Generalized use of External Knowledge to empower Scene Text in Text-VQAIEEE Access (IEEE Access), 2021
Arka Ujjal Dey
Ernest Valveny
Gaurav Harit
353
4
0
22 Aug 2021
From Two to One: A New Scene Text Recognizer with Visual Language
  Modeling Network
From Two to One: A New Scene Text Recognizer with Visual Language Modeling NetworkIEEE International Conference on Computer Vision (ICCV), 2021
Yuxin Wang
Hongtao Xie
Shancheng Fang
Jing Wang
Shenggao Zhu
Yongdong Zhang
VLM
247
175
0
22 Aug 2021
Data Augmentation for Scene Text Recognition
Data Augmentation for Scene Text Recognition
Rowel Atienza
187
22
0
16 Aug 2021
MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and
  Understanding
MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and UnderstandingACM Multimedia (ACM MM), 2021
Zhanghui Kuang
Hongbin Sun
Zhizhong Li
Xiaoyu Yue
T. Lin
...
Tong Gao
Wenwei Zhang
Kai-xiang Chen
Wayne Zhang
Dahua Lin
VLM
182
84
0
14 Aug 2021
IFR: Iterative Fusion Based Recognizer For Low Quality Scene Text
  Recognition
IFR: Iterative Fusion Based Recognizer For Low Quality Scene Text RecognitionChinese Conference on Pattern Recognition and Computer Vision (CPRCV), 2021
Zhiwei Jia
Shugong Xu
Shiyi Mu
Y. Tao
Shan Cao
Zhiyong Chen
142
5
0
13 Aug 2021
VTLayout: Fusion of Visual and Text Features for Document Layout
  Analysis
VTLayout: Fusion of Visual and Text Features for Document Layout AnalysisPacific Rim International Conference on Artificial Intelligence (PRICAI), 2021
Shoubin Li
Xuyan Ma
Shuaiqun Pan
Jun Hu
Lin Shi
Qing Wang
110
10
0
12 Aug 2021
StrucTexT: Structured Text Understanding with Multi-Modal Transformers
StrucTexT: Structured Text Understanding with Multi-Modal TransformersACM Multimedia (ACM MM), 2021
Yulin Li
Yuxi Qian
Yuchen Yu
Xiameng Qin
Chengquan Zhang
Yan Liu
Kun Yao
Junyu Han
Jingtuo Liu
Errui Ding
306
139
0
06 Aug 2021
Why You Should Try the Real Data for the Scene Text Recognition
Why You Should Try the Real Data for the Scene Text Recognition
V. Loginov
139
11
0
29 Jul 2021
Joint Visual Semantic Reasoning: Multi-Stage Decoder for Text
  Recognition
Joint Visual Semantic Reasoning: Multi-Stage Decoder for Text RecognitionIEEE International Conference on Computer Vision (ICCV), 2021
A. Bhunia
Aneeshan Sain
Amandeep Kumar
S. Ghose
Pinaki Nath Chowdhury
Yi-Zhe Song
239
58
0
26 Jul 2021
Text is Text, No Matter What: Unifying Text Recognition using Knowledge
  Distillation
Text is Text, No Matter What: Unifying Text Recognition using Knowledge DistillationIEEE International Conference on Computer Vision (ICCV), 2021
A. Bhunia
Aneeshan Sain
Pinaki Nath Chowdhury
Yi-Zhe Song
204
32
0
26 Jul 2021
Towards the Unseen: Iterative Text Recognition by Distilling from Errors
Towards the Unseen: Iterative Text Recognition by Distilling from ErrorsIEEE International Conference on Computer Vision (ICCV), 2021
A. Bhunia
Pinaki Nath Chowdhury
Aneeshan Sain
Yi-Zhe Song
183
19
0
26 Jul 2021
RewriteNet: Reliable Scene Text Editing with Implicit Decomposition of
  Text Contents and Styles
RewriteNet: Reliable Scene Text Editing with Implicit Decomposition of Text Contents and Styles
Junyeop Lee
Yoonsik Kim
Seonghyeon Kim
Moonbin Yim
Seung Shin
Gayoung Lee
Sungrae Park
DiffM
146
10
0
23 Jul 2021
Scene Text recognition with Full Normalization
Scene Text recognition with Full Normalization
Nathan Zachary
Gerald Carl
Russell Elijah
Hessi Roma
R. Leer
James Amelia
139
0
0
13 Jul 2021
Plot2Spectra: an Automatic Spectra Extraction Tool
Plot2Spectra: an Automatic Spectra Extraction Tool
Weixin Jiang
Eric S. Schwenker
Trevor Spreadbury
Kai Li
Maria K. Y. Chan
O. Cossairt
203
4
0
06 Jul 2021
Text Prior Guided Scene Text Image Super-resolution
Text Prior Guided Scene Text Image Super-resolutionIEEE Transactions on Image Processing (TIP), 2021
Jianqi Ma
Shihao Guo
Lei Zhang
160
87
0
29 Jun 2021
PERT: A Progressively Region-based Network for Scene Text Removal
PERT: A Progressively Region-based Network for Scene Text Removal
Yuxin Wang
Hongtao Xie
Shancheng Fang
Yadong Qu
Yongdong Zhang
188
20
0
24 Jun 2021
Open Images V5 Text Annotation and Yet Another Mask Text Spotter
Open Images V5 Text Annotation and Yet Another Mask Text Spotter
Ilya Krylov
S. Nosov
V. Sovrasov
VLM
188
64
0
23 Jun 2021
Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for
  Visual Information Extraction using Sequences
Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using SequencesInternational Joint Conference on Artificial Intelligence (IJCAI), 2021
Jiapeng Wang
Tianwei Wang
Guozhi Tang
Lianwen Jin
Weihong Ma
Kai Ding
Yichao Huang
186
13
0
20 Jun 2021
Representation and Correlation Enhanced Encoder-Decoder Framework for
  Scene Text Recognition
Representation and Correlation Enhanced Encoder-Decoder Framework for Scene Text RecognitionIEEE International Conference on Document Analysis and Recognition (ICDAR), 2021
Meng Cui
Wei Wang
Jinjin Zhang
Liang Wang
3DV
247
16
0
13 Jun 2021
Implicit Feature Alignment: Learn to Convert Text Recognizer to Text
  Spotter
Implicit Feature Alignment: Learn to Convert Text Recognizer to Text SpotterComputer Vision and Pattern Recognition (CVPR), 2021
Tianwei Wang
Yuanzhi Zhu
Lianwen Jin
Dezhi Peng
Zhe Li
Mengchao He
Yongpan Wang
Canjie Luo
OOD
133
12
0
10 Jun 2021
Context-Free TextSpotter for Real-Time and Mobile End-to-End Text
  Detection and Recognition
Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and RecognitionIEEE International Conference on Document Analysis and Recognition (ICDAR), 2021
Ryota Yoshihashi
Tomohiro Tanaka
Kenji Doi
Takumi Fujino
Naoaki Yamashita
110
2
0
10 Jun 2021
Convolutional Neural Networks with Gated Recurrent Connections
Convolutional Neural Networks with Gated Recurrent ConnectionsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Jianfeng Wang
Xiaolin Hu
ObjD
177
50
0
05 Jun 2021
Pho(SC)-CTC -- A Hybrid Approach Towards Zero-shot Word Image
  Recognition
Pho(SC)-CTC -- A Hybrid Approach Towards Zero-shot Word Image RecognitionInternational Journal on Document Analysis and Recognition (IJDAR), 2021
Ravindra K Bhatt
Anuj Rai
N. C. Krishnan
S. Chanda
189
3
0
31 May 2021
Spatio-Temporal Dual Graph Neural Networks for Travel Time Estimation
Spatio-Temporal Dual Graph Neural Networks for Travel Time Estimation
G. Jin
Huan Yan
Fuxian Li
Jincai Huang
Yong Li
AI4TS
230
23
0
28 May 2021
Previous
123...789...121314
Next
Page 8 of 14
Pageof 14