ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1507.05717
  4. Cited By
An End-to-End Trainable Neural Network for Image-based Sequence
  Recognition and Its Application to Scene Text Recognition

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2015
21 July 2015
Baoguang Shi
X. Bai
Cong Yao
    VLM
ArXiv (abs)PDFHTML

Papers citing "An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition"

50 / 681 papers shown
TRIE++: Towards End-to-End Information Extraction from Visually Rich
  Documents
TRIE++: Towards End-to-End Information Extraction from Visually Rich Documents
Zhanzhan Cheng
Peng Zhang
Can Li
Qiao Liang
Yunlu Xu
Pengfei Li
Shiliang Pu
Yi Niu
Fei Wu
131
14
0
14 Jul 2022
DavarOCR: A Toolbox for OCR and Multi-Modal Document Understanding
DavarOCR: A Toolbox for OCR and Multi-Modal Document UnderstandingACM Multimedia (ACM MM), 2022
Liang Qiao
Hui Jiang
Ying-Cong Chen
Can Li
Pengfei Li
...
Dashan Guo
Yi Xu
Yunlu Xu
Zhanzhan Cheng
Yi Niu
187
5
0
14 Jul 2022
Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text
  Spotting
Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text SpottingEuropean Conference on Computer Vision (ECCV), 2022
Ying-Cong Chen
Liang Qiao1
Zhanzhan Cheng
Shiliang Pu
Yi Niu
Xi Li
319
4
0
14 Jul 2022
COO: Comic Onomatopoeia Dataset for Recognizing Arbitrary or Truncated
  Texts
COO: Comic Onomatopoeia Dataset for Recognizing Arbitrary or Truncated TextsEuropean Conference on Computer Vision (ECCV), 2022
Jeonghun Baek
Yusuke Matsui
Kiyoharu Aizawa
199
19
0
11 Jul 2022
A Lexicon and Depth-wise Separable Convolution Based Handwritten Text
  Recognition System
A Lexicon and Depth-wise Separable Convolution Based Handwritten Text Recognition SystemImage and Vision Computing New Zealand (IVCNZ), 2022
Lalita Kumari
Sukhdeep Singh
Vvs Rathore
Anuj Sharma
159
5
0
11 Jul 2022
Reading and Writing: Discriminative and Generative Modeling for
  Self-Supervised Text Recognition
Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text RecognitionACM Multimedia (ACM MM), 2022
Mingkun Yang
Minghui Liao
Pu Lu
Jing Wang
Shenggao Zhu
Hualin Luo
Qingzhen Tian
X. Bai
SSL
364
68
0
01 Jul 2022
Impact of Acoustic Event Tagging on Scene Classification in a Multi-Task
  Learning Framework
Impact of Acoustic Event Tagging on Scene Classification in a Multi-Task Learning FrameworkInterspeech (Interspeech), 2022
Rahil Parikh
Harshavardhan Sundar
Ming Sun
Chao Wang
Spyros Matsoukas
85
3
0
27 Jun 2022
An Evaluation of OCR on Egocentric Data
An Evaluation of OCR on Egocentric Data
Valentin Popescu
Dima Damen
Toby Perrett
EgoV
161
0
0
11 Jun 2022
PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR
  System
PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System
Chenxia Li
Weiwei Liu
Ruoyu Guo
Xiaoyue Yin
Kaitao Jiang
...
Lingfeng Zhu
Baohua Lai
Xiaoguang Hu
Dianhai Yu
Yanjun Ma
421
163
0
07 Jun 2022
E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial
  Vehicles
E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial Vehicles
Zhenyu Hu
Zhenyu Wu
Pengcheng Pi
Yunhe Xue
Jiayi Shen
Jianchao Tan
Xiangru Lian
Zinan Lin
Ji Liu
146
3
0
05 Jun 2022
HYCEDIS: HYbrid Confidence Engine for Deep Document Intelligence System
HYCEDIS: HYbrid Confidence Engine for Deep Document Intelligence SystemInternational Conference on Neural Information Processing (ICONIP), 2022
Bao-Sinh Nguyen
Q. Tran
Tuan-Anh Dang Nguyen
D. Nguyen
H. Le
154
1
0
01 Jun 2022
MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
Pengyuan Lyu
Chengquan Zhang
Shanshan Liu
Meina Qiao
Yangliu Xu
Liang Wu
Kun Yao
Junyu Han
Errui Ding
Jingdong Wang
549
46
0
01 Jun 2022
LILA-BOTI : Leveraging Isolated Letter Accumulations By Ordering Teacher
  Insights for Bangla Handwriting Recognition
LILA-BOTI : Leveraging Isolated Letter Accumulations By Ordering Teacher Insights for Bangla Handwriting RecognitionInternational Conference on Pattern Recognition (ICPR), 2022
Md. Ismail Hossain
Mohammed Rakib
Sabbir Mollah
Fuad Rahman
Nabeel Mohammed
191
8
0
23 May 2022
A Comprehensive Handwritten Paragraph Text Recognition System:
  LexiconNet
A Comprehensive Handwritten Paragraph Text Recognition System: LexiconNet
Lalita Kumari
Sukhdeep Singh
V. Rathore
Anuj Sharma
3DV
103
7
0
23 May 2022
MolMiner: You only look once for chemical structure recognition
MolMiner: You only look once for chemical structure recognitionJournal of Chemical Information and Modeling (JCIM), 2022
Youjun Xu
Jinchuan Xiao
Chia-Han Chou
Jianhang Zhang
Jintao Zhu
...
Zhen Zhang
Shuhao Zhang
Weilin Zhang
L. Lai
Jianfeng Pei
159
24
0
23 May 2022
Automated Audio Captioning: An Overview of Recent Progress and New
  Challenges
Automated Audio Captioning: An Overview of Recent Progress and New ChallengesEURASIP Journal on Audio, Speech, and Music Processing (EURASIP J. Audio Speech Music Process.), 2022
Xinhao Mei
Xubo Liu
Mark D. Plumbley
Wenwu Wang
299
54
0
12 May 2022
Multimodal Semi-Supervised Learning for Text Recognition
Multimodal Semi-Supervised Learning for Text Recognition
Aviad Aberdam
Roy Ganz
Shai Mazor
Ron Litman
VLM
265
22
0
08 May 2022
Unified Chinese License Plate Detection and Recognition with High
  Efficiency
Unified Chinese License Plate Detection and Recognition with High EfficiencyJournal of Visual Communication and Image Representation (JVCIR), 2022
Yanxiang Gong
Linjie Deng
Shuai Tao
Xinchen Lu
Peicheng Wu
Zhiwei Xie
Zheng Ma
M. Xie
222
45
0
07 May 2022
SVTR: Scene Text Recognition with a Single Visual Model
SVTR: Scene Text Recognition with a Single Visual ModelInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Yongkun Du
Zhineng Chen
Caiyan Jia
Xiaoyue Yin
Tianlun Zheng
Chenxia Li
Yuning Du
Yu-Gang Jiang
298
234
0
30 Apr 2022
Towards Automatic Parsing of Structured Visual Content through the Use
  of Synthetic Data
Towards Automatic Parsing of Structured Visual Content through the Use of Synthetic DataInternational Conference on Pattern Recognition (ICPR), 2022
Lukas Scholch
Jonas Steinhauser
Maximilian Beichter
C. Seibold
Kailun Yang
Merlin Knable
Thorsten Schwarz
Alexander Madche
Rainer Stiefelhagen
58
1
0
29 Apr 2022
C3-STISR: Scene Text Image Super-resolution with Triple Clues
C3-STISR: Scene Text Image Super-resolution with Triple CluesInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Minyi Zhao
Miaosen Wang
Fan Bai
Bingjia Li
Jie Wang
Shuigeng Zhou
205
44
0
29 Apr 2022
Pushing the Performance Limit of Scene Text Recognizer without Human
  Annotation
Pushing the Performance Limit of Scene Text Recognizer without Human AnnotationComputer Vision and Pattern Recognition (CVPR), 2022
Caiyuan Zheng
Hui Li
Seon-Min Rhee
Seungju Han
Jae-Joon Han
Peng Wang
240
14
0
16 Apr 2022
IterVM: Iterative Vision Modeling Module for Scene Text Recognition
IterVM: Iterative Vision Modeling Module for Scene Text RecognitionInternational Conference on Pattern Recognition (ICPR), 2022
Xiaojie Chu
Yongtao Wang
181
4
0
06 Apr 2022
Text Spotting Transformers
Text Spotting TransformersComputer Vision and Pattern Recognition (CVPR), 2022
Xiang Zhang
Yongwen Su
Subarna Tripathi
Zhuowen Tu
ViT
226
123
0
05 Apr 2022
Unitail: Detecting, Reading, and Matching in Retail Scene
Unitail: Detecting, Reading, and Matching in Retail SceneEuropean Conference on Computer Vision (ECCV), 2022
Fangyi Chen
Han Zhang
Zaiwang Li
Jiachen Dou
Shentong Mo
Hao Chen
Yongxin Zhang
Uzair Ahmed
Chenchen Zhu
Marios Savvides
323
12
0
01 Apr 2022
Robust Onboard Localization in Changing Environments Exploiting Text
  Spotting
Robust Onboard Localization in Changing Environments Exploiting Text SpottingIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Nicky Zimmerman
Louis Wiesmann
Tiziano Guadagnino
Thomas Labe
Jens Behley
C. Stachniss
191
30
0
23 Mar 2022
DAN: a Segmentation-free Document Attention Network for Handwritten
  Document Recognition
DAN: a Segmentation-free Document Attention Network for Handwritten Document RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Denis Coquenet
Clément Chatelain
Thierry Paquet
366
76
0
23 Mar 2022
Comprehensive Benchmark Datasets for Amharic Scene Text Detection and
  Recognition
Comprehensive Benchmark Datasets for Amharic Scene Text Detection and RecognitionScience China Information Sciences (Sci. China Inf. Sci.), 2022
Wondimu Dikubab
Dingkang Liang
Minghui Liao
Xiang Bai
107
4
0
23 Mar 2022
End-to-End Video Text Spotting with Transformer
End-to-End Video Text Spotting with TransformerInternational Journal of Computer Vision (IJCV), 2022
Weijia Wu
Yuanqiang Cai
Chunhua Shen
Debing Zhang
Ying Fu
Hong Zhou
Ping Luo
ViT
253
32
0
20 Mar 2022
SimAN: Exploring Self-Supervised Representation Learning of Scene Text
  via Similarity-Aware Normalization
SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware NormalizationComputer Vision and Pattern Recognition (CVPR), 2022
Canjie Luo
Lianwen Jin
Jingdong Chen
SSLAI4TS
254
33
0
20 Mar 2022
SwinTextSpotter: Scene Text Spotting via Better Synergy between Text
  Detection and Text Recognition
SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text RecognitionComputer Vision and Pattern Recognition (CVPR), 2022
Mingxin Huang
Yuliang Liu
Zhenghao Peng
Chongyu Liu
Dahua Lin
Shenggao Zhu
N. Yuan
Kai Ding
Lianwen Jin
ViT
212
138
0
19 Mar 2022
A Text Attention Network for Spatial Deformation Robust Scene Text Image
  Super-resolution
A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolutionComputer Vision and Pattern Recognition (CVPR), 2022
Jianqi Ma
Zhetong Liang
Lei Zhang
206
67
0
17 Mar 2022
A Survey of Historical Document Image Datasets
A Survey of Historical Document Image DatasetsInternational Journal on Document Analysis and Recognition (IJDAR), 2022
Konstantina Nikolaidou
Mathias Seuret
Hamam Mokayed
Marcus Liwicki
288
43
0
16 Mar 2022
Training Protocol Matters: Towards Accurate Scene Text Recognition via
  Training Protocol Searching
Training Protocol Matters: Towards Accurate Scene Text Recognition via Training Protocol Searching
Xiaojie Chu
Yongtao Wang
Chunhua Shen
Jingdong Chen
Wei Chu
120
1
0
13 Mar 2022
Towards Open-Set Text Recognition via Label-to-Prototype Learning
Towards Open-Set Text Recognition via Label-to-Prototype LearningPattern Recognition (Pattern Recogn.), 2021
Chang-rui Liu
Chun Yang
Haibo Qin
Xiaobin Zhu
Cheng-Lin Liu
Xu-Cheng Yin
VLM
193
39
0
10 Mar 2022
DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting
DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting
Seonghyeon Kim
Seung Shin
Yoonsik Kim
Han-Cheol Cho
Taeho Kil
Jaeheung Surh
Seunghyun Park
Bado Lee
Youngmin Baek
178
9
0
10 Mar 2022
Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text
  Recognition and Document Enhancement
Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document EnhancementAAAI Conference on Artificial Intelligence (AAAI), 2022
Mohamed Ali Souibgui
Sanket Biswas
Andrés Mafla
Ali Furkan Biten
Alicia Fornés
Yousri Kessentini
Josep Lladós
Lluís Gómez
Dimosthenis Karatzas
241
28
0
09 Mar 2022
Self-supervised Implicit Glyph Attention for Text Recognition
Self-supervised Implicit Glyph Attention for Text RecognitionComputer Vision and Pattern Recognition (CVPR), 2022
Tongkun Guan
Chaochen Gu
Jingzheng Tu
Xuehang Yang
Qi Feng
Yudi Zhao
Xiaokang Yang
Wei Shen
489
30
0
07 Mar 2022
Syntax-Aware Network for Handwritten Mathematical Expression Recognition
Syntax-Aware Network for Handwritten Mathematical Expression RecognitionComputer Vision and Pattern Recognition (CVPR), 2022
Ye Yuan
Xiao-Chang Liu
Wondimu Dikubab
Hui Liu
Zhilong Ji
Zhongqin Wu
X. Bai
422
86
0
03 Mar 2022
SMILE: Sequence-to-Sequence Domain Adaption with Minimizing Latent
  Entropy for Text Image Recognition
SMILE: Sequence-to-Sequence Domain Adaption with Minimizing Latent Entropy for Text Image RecognitionInternational Conference on Information Photonics (ICIP), 2022
Yen-Cheng Chang
Yi-Chang Chen
Yu-Chuan Chang
Yi-Ren Yeh
149
7
0
24 Feb 2022
Auxiliary Cross-Modal Representation Learning with Triplet Loss
  Functions for Online Handwriting Recognition
Auxiliary Cross-Modal Representation Learning with Triplet Loss Functions for Online Handwriting RecognitionIEEE Access (IEEE Access), 2022
Felix Ott
David Rügamer
Lucas Heublein
B. Bischl
Christopher Mutschler
445
12
0
16 Feb 2022
Towards Weakly-Supervised Text Spotting using a Multi-Task Transformer
Towards Weakly-Supervised Text Spotting using a Multi-Task TransformerComputer Vision and Pattern Recognition (CVPR), 2022
Yair Kittenplon
I. Lavi
Sharon Fogel
Yarin Bar
R. Manmatha
Pietro Perona
ViT
238
61
0
11 Feb 2022
AttentionHTR: Handwritten Text Recognition Based on Attention
  Encoder-Decoder Networks
AttentionHTR: Handwritten Text Recognition Based on Attention Encoder-Decoder NetworksInternational Workshop on Document Analysis Systems (DAS), 2022
Dmitrijs Kass
Ekta Vats
HAI
275
39
0
23 Jan 2022
Region-based Layout Analysis of Music Score Images
Region-based Layout Analysis of Music Score ImagesExpert systems with applications (ESWA), 2022
Francisco J. Castellanos
Carlos Garrido-Munoz
Antonio Ríos-Vila
Jorge Calvo-Zaragoza
124
11
0
11 Jan 2022
Towards Boosting the Accuracy of Non-Latin Scene Text Recognition
Towards Boosting the Accuracy of Non-Latin Scene Text Recognition
Sanjana Gunna
Rohit Saluja
C. V. Jawahar
142
6
0
10 Jan 2022
Transfer Learning for Scene Text Recognition in Indian Languages
Transfer Learning for Scene Text Recognition in Indian Languages
Sanjana Gunna
Rohit Saluja
C. V. Jawahar
VLM
207
15
0
10 Jan 2022
Image-based Automatic Dial Meter Reading in Unconstrained Scenarios
Image-based Automatic Dial Meter Reading in Unconstrained Scenarios
Gabriel Salomon
Rayson Laroca
David Menotti
278
22
0
08 Jan 2022
On the Cross-dataset Generalization in License Plate Recognition
On the Cross-dataset Generalization in License Plate RecognitionVISIGRAPP (VISIGRAPP), 2022
Rayson Laroca
Everton VIlhena Cardoso
D. Lucio
Valter Estevam
David Menotti
337
54
0
02 Jan 2022
SAFL: A Self-Attention Scene Text Recognizer with Focal Loss
SAFL: A Self-Attention Scene Text Recognizer with Focal LossInternational Conference on Machine Learning and Applications (ICMLA), 2020
Bao Hieu Tran
Le Thanh
Huu Manh Nguyen
Duc Anh Le
T. Nguyen
Phi Le Nguyen
80
3
0
01 Jan 2022
Benchmarking Chinese Text Recognition: Datasets, Baselines, and an
  Empirical Study
Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study
Haiyang Yu
Jingye Chen
Bin Li
Jianqi Ma
Mengnan Guan
Xixi Xu
Xiaocong Wang
Shaobo Qu
Xiangyang Xue
251
69
0
30 Dec 2021
Previous
123...678...121314
Next
Page 7 of 14
Pageof 14