Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1507.05717
Cited By
An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2015
21 July 2015
Baoguang Shi
X. Bai
Cong Yao
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition"
50 / 681 papers shown
TRIE++: Towards End-to-End Information Extraction from Visually Rich Documents
Zhanzhan Cheng
Peng Zhang
Can Li
Qiao Liang
Yunlu Xu
Pengfei Li
Shiliang Pu
Yi Niu
Fei Wu
161
15
0
14 Jul 2022
DavarOCR: A Toolbox for OCR and Multi-Modal Document Understanding
ACM Multimedia (ACM MM), 2022
Liang Qiao
Hui Jiang
Ying-Cong Chen
Can Li
Pengfei Li
...
Dashan Guo
Yi Xu
Yunlu Xu
Zhanzhan Cheng
Yi Niu
249
5
0
14 Jul 2022
Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text Spotting
European Conference on Computer Vision (ECCV), 2022
Ying-Cong Chen
Liang Qiao1
Zhanzhan Cheng
Shiliang Pu
Yi Niu
Xi Li
347
4
0
14 Jul 2022
COO: Comic Onomatopoeia Dataset for Recognizing Arbitrary or Truncated Texts
European Conference on Computer Vision (ECCV), 2022
Jeonghun Baek
Yusuke Matsui
Kiyoharu Aizawa
250
19
0
11 Jul 2022
A Lexicon and Depth-wise Separable Convolution Based Handwritten Text Recognition System
Image and Vision Computing New Zealand (IVCNZ), 2022
Lalita Kumari
Sukhdeep Singh
Vvs Rathore
Anuj Sharma
191
6
0
11 Jul 2022
Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
ACM Multimedia (ACM MM), 2022
Mingkun Yang
Minghui Liao
Pu Lu
Jing Wang
Shenggao Zhu
Hualin Luo
Qingzhen Tian
X. Bai
SSL
429
71
0
01 Jul 2022
Impact of Acoustic Event Tagging on Scene Classification in a Multi-Task Learning Framework
Interspeech (Interspeech), 2022
Rahil Parikh
Harshavardhan Sundar
Ming Sun
Chao Wang
Spyros Matsoukas
109
3
0
27 Jun 2022
An Evaluation of OCR on Egocentric Data
Valentin Popescu
Dima Damen
Toby Perrett
EgoV
174
0
0
11 Jun 2022
PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System
Chenxia Li
Weiwei Liu
Ruoyu Guo
Xiaoyue Yin
Kaitao Jiang
...
Lingfeng Zhu
Baohua Lai
Xiaoguang Hu
Dianhai Yu
Yanjun Ma
465
179
0
07 Jun 2022
E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial Vehicles
Zhenyu Hu
Zhenyu Wu
Pengcheng Pi
Yunhe Xue
Jiayi Shen
Jianchao Tan
Xiangru Lian
Zinan Lin
Ji Liu
180
3
0
05 Jun 2022
HYCEDIS: HYbrid Confidence Engine for Deep Document Intelligence System
International Conference on Neural Information Processing (ICONIP), 2022
Bao-Sinh Nguyen
Q. Tran
Tuan-Anh Dang Nguyen
D. Nguyen
H. Le
222
1
0
01 Jun 2022
MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
Pengyuan Lyu
Chengquan Zhang
Shanshan Liu
Meina Qiao
Yangliu Xu
Liang Wu
Kun Yao
Junyu Han
Errui Ding
Jingdong Wang
583
47
0
01 Jun 2022
LILA-BOTI : Leveraging Isolated Letter Accumulations By Ordering Teacher Insights for Bangla Handwriting Recognition
International Conference on Pattern Recognition (ICPR), 2022
Md. Ismail Hossain
Mohammed Rakib
Sabbir Mollah
Fuad Rahman
Nabeel Mohammed
216
9
0
23 May 2022
A Comprehensive Handwritten Paragraph Text Recognition System: LexiconNet
Lalita Kumari
Sukhdeep Singh
V. Rathore
Anuj Sharma
3DV
150
7
0
23 May 2022
MolMiner: You only look once for chemical structure recognition
Journal of Chemical Information and Modeling (JCIM), 2022
Youjun Xu
Jinchuan Xiao
Chia-Han Chou
Jianhang Zhang
Jintao Zhu
...
Zhen Zhang
Shuhao Zhang
Weilin Zhang
L. Lai
Jianfeng Pei
207
25
0
23 May 2022
Automated Audio Captioning: An Overview of Recent Progress and New Challenges
EURASIP Journal on Audio, Speech, and Music Processing (EURASIP J. Audio Speech Music Process.), 2022
Xinhao Mei
Xubo Liu
Mark D. Plumbley
Wenwu Wang
399
55
0
12 May 2022
Multimodal Semi-Supervised Learning for Text Recognition
Aviad Aberdam
Roy Ganz
Shai Mazor
Ron Litman
VLM
287
22
0
08 May 2022
Unified Chinese License Plate Detection and Recognition with High Efficiency
Journal of Visual Communication and Image Representation (JVCIR), 2022
Yanxiang Gong
Linjie Deng
Shuai Tao
Xinchen Lu
Peicheng Wu
Zhiwei Xie
Zheng Ma
M. Xie
238
45
0
07 May 2022
SVTR: Scene Text Recognition with a Single Visual Model
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Yongkun Du
Zhineng Chen
Caiyan Jia
Xiaoyue Yin
Tianlun Zheng
Chenxia Li
Yuning Du
Yu-Gang Jiang
335
259
0
30 Apr 2022
Towards Automatic Parsing of Structured Visual Content through the Use of Synthetic Data
International Conference on Pattern Recognition (ICPR), 2022
Lukas Scholch
Jonas Steinhauser
Maximilian Beichter
C. Seibold
Kailun Yang
Merlin Knable
Thorsten Schwarz
Alexander Madche
Rainer Stiefelhagen
123
2
0
29 Apr 2022
C3-STISR: Scene Text Image Super-resolution with Triple Clues
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Minyi Zhao
Miaosen Wang
Fan Bai
Bingjia Li
Jie Wang
Shuigeng Zhou
234
47
0
29 Apr 2022
Pushing the Performance Limit of Scene Text Recognizer without Human Annotation
Computer Vision and Pattern Recognition (CVPR), 2022
Caiyuan Zheng
Hui Li
Seon-Min Rhee
Seungju Han
Jae-Joon Han
Peng Wang
264
14
0
16 Apr 2022
IterVM: Iterative Vision Modeling Module for Scene Text Recognition
International Conference on Pattern Recognition (ICPR), 2022
Xiaojie Chu
Yongtao Wang
220
5
0
06 Apr 2022
Text Spotting Transformers
Computer Vision and Pattern Recognition (CVPR), 2022
Xiang Zhang
Yongwen Su
Subarna Tripathi
Zhuowen Tu
ViT
260
125
0
05 Apr 2022
Unitail: Detecting, Reading, and Matching in Retail Scene
European Conference on Computer Vision (ECCV), 2022
Fangyi Chen
Han Zhang
Zaiwang Li
Jiachen Dou
Shentong Mo
Hao Chen
Yongxin Zhang
Uzair Ahmed
Chenchen Zhu
Marios Savvides
353
12
0
01 Apr 2022
Robust Onboard Localization in Changing Environments Exploiting Text Spotting
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Nicky Zimmerman
Louis Wiesmann
Tiziano Guadagnino
Thomas Labe
Jens Behley
C. Stachniss
265
32
0
23 Mar 2022
DAN: a Segmentation-free Document Attention Network for Handwritten Document Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Denis Coquenet
Clément Chatelain
Thierry Paquet
405
81
0
23 Mar 2022
Comprehensive Benchmark Datasets for Amharic Scene Text Detection and Recognition
Science China Information Sciences (Sci. China Inf. Sci.), 2022
Wondimu Dikubab
Dingkang Liang
Minghui Liao
Xiang Bai
125
5
0
23 Mar 2022
End-to-End Video Text Spotting with Transformer
International Journal of Computer Vision (IJCV), 2022
Weijia Wu
Yuanqiang Cai
Chunhua Shen
Debing Zhang
Ying Fu
Hong Zhou
Ping Luo
ViT
282
34
0
20 Mar 2022
SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization
Computer Vision and Pattern Recognition (CVPR), 2022
Canjie Luo
Lianwen Jin
Jingdong Chen
SSL
AI4TS
285
33
0
20 Mar 2022
SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition
Computer Vision and Pattern Recognition (CVPR), 2022
Mingxin Huang
Yuliang Liu
Zhenghao Peng
Chongyu Liu
Dahua Lin
Shenggao Zhu
N. Yuan
Kai Ding
Lianwen Jin
ViT
247
142
0
19 Mar 2022
A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution
Computer Vision and Pattern Recognition (CVPR), 2022
Jianqi Ma
Zhetong Liang
Lei Zhang
228
72
0
17 Mar 2022
A Survey of Historical Document Image Datasets
International Journal on Document Analysis and Recognition (IJDAR), 2022
Konstantina Nikolaidou
Mathias Seuret
Hamam Mokayed
Marcus Liwicki
401
46
0
16 Mar 2022
Training Protocol Matters: Towards Accurate Scene Text Recognition via Training Protocol Searching
Xiaojie Chu
Yongtao Wang
Chunhua Shen
Jingdong Chen
Wei Chu
140
1
0
13 Mar 2022
Towards Open-Set Text Recognition via Label-to-Prototype Learning
Pattern Recognition (Pattern Recogn.), 2021
Chang-rui Liu
Chun Yang
Haibo Qin
Xiaobin Zhu
Cheng-Lin Liu
Xu-Cheng Yin
VLM
256
42
0
10 Mar 2022
DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting
Seonghyeon Kim
Seung Shin
Yoonsik Kim
Han-Cheol Cho
Taeho Kil
Jaeheung Surh
Seunghyun Park
Bado Lee
Youngmin Baek
212
9
0
10 Mar 2022
Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement
AAAI Conference on Artificial Intelligence (AAAI), 2022
Mohamed Ali Souibgui
Sanket Biswas
Andrés Mafla
Ali Furkan Biten
Alicia Fornés
Yousri Kessentini
Josep Lladós
Lluís Gómez
Dimosthenis Karatzas
285
30
0
09 Mar 2022
Self-supervised Implicit Glyph Attention for Text Recognition
Computer Vision and Pattern Recognition (CVPR), 2022
Tongkun Guan
Chaochen Gu
Jingzheng Tu
Xuehang Yang
Qi Feng
Yudi Zhao
Xiaokang Yang
Wei Shen
539
34
0
07 Mar 2022
Syntax-Aware Network for Handwritten Mathematical Expression Recognition
Computer Vision and Pattern Recognition (CVPR), 2022
Ye Yuan
Xiao-Chang Liu
Wondimu Dikubab
Hui Liu
Zhilong Ji
Zhongqin Wu
X. Bai
497
96
0
03 Mar 2022
SMILE: Sequence-to-Sequence Domain Adaption with Minimizing Latent Entropy for Text Image Recognition
International Conference on Information Photonics (ICIP), 2022
Yen-Cheng Chang
Yi-Chang Chen
Yu-Chuan Chang
Yi-Ren Yeh
166
7
0
24 Feb 2022
Auxiliary Cross-Modal Representation Learning with Triplet Loss Functions for Online Handwriting Recognition
IEEE Access (IEEE Access), 2022
Felix Ott
David Rügamer
Lucas Heublein
B. Bischl
Christopher Mutschler
485
13
0
16 Feb 2022
Towards Weakly-Supervised Text Spotting using a Multi-Task Transformer
Computer Vision and Pattern Recognition (CVPR), 2022
Yair Kittenplon
I. Lavi
Sharon Fogel
Yarin Bar
R. Manmatha
Pietro Perona
ViT
265
61
0
11 Feb 2022
AttentionHTR: Handwritten Text Recognition Based on Attention Encoder-Decoder Networks
International Workshop on Document Analysis Systems (DAS), 2022
Dmitrijs Kass
Ekta Vats
HAI
298
43
0
23 Jan 2022
Region-based Layout Analysis of Music Score Images
Expert systems with applications (ESWA), 2022
Francisco J. Castellanos
Carlos Garrido-Munoz
Antonio Ríos-Vila
Jorge Calvo-Zaragoza
233
11
0
11 Jan 2022
Towards Boosting the Accuracy of Non-Latin Scene Text Recognition
Sanjana Gunna
Rohit Saluja
C. V. Jawahar
178
6
0
10 Jan 2022
Transfer Learning for Scene Text Recognition in Indian Languages
Sanjana Gunna
Rohit Saluja
C. V. Jawahar
VLM
268
16
0
10 Jan 2022
Image-based Automatic Dial Meter Reading in Unconstrained Scenarios
Gabriel Salomon
Rayson Laroca
David Menotti
309
25
0
08 Jan 2022
On the Cross-dataset Generalization in License Plate Recognition
VISIGRAPP (VISIGRAPP), 2022
Rayson Laroca
Everton VIlhena Cardoso
D. Lucio
Valter Estevam
David Menotti
376
57
0
02 Jan 2022
SAFL: A Self-Attention Scene Text Recognizer with Focal Loss
International Conference on Machine Learning and Applications (ICMLA), 2020
Bao Hieu Tran
Le Thanh
Huu Manh Nguyen
Duc Anh Le
T. Nguyen
Phi Le Nguyen
102
3
0
01 Jan 2022
Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study
Haiyang Yu
Jingye Chen
Bin Li
Jianqi Ma
Mengnan Guan
Xixi Xu
Xiaocong Wang
Shaobo Qu
Xiangyang Xue
269
74
0
30 Dec 2021
Previous
1
2
3
...
6
7
8
...
12
13
14
Next
Page 7 of 14
Page
of 14
Go