ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1507.05717
  4. Cited By
An End-to-End Trainable Neural Network for Image-based Sequence
  Recognition and Its Application to Scene Text Recognition

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2015
21 July 2015
Baoguang Shi
X. Bai
Cong Yao
    VLM
ArXiv (abs)PDFHTML

Papers citing "An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition"

50 / 680 papers shown
Phonological Level wav2vec2-based Mispronunciation Detection and
  Diagnosis Method
Phonological Level wav2vec2-based Mispronunciation Detection and Diagnosis Method
M. Shahin
Julien Epps
Beena Ahmed
116
3
0
13 Nov 2023
Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and
  In-depth Evaluation
Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation
Yongxin Shi
Dezhi Peng
Wenhui Liao
Zening Lin
Xinhong Chen
Chongyu Liu
Yuyi Zhang
Lianwen Jin
MLLM
395
54
0
25 Oct 2023
Adversarial sample generation and training using geometric masks for
  accurate and resilient license plate character recognition
Adversarial sample generation and training using geometric masks for accurate and resilient license plate character recognition
Bishal Shrestha
Griwan Khakurel
Kritika Simkhada
Badri Adhikari
AAML
186
1
0
25 Oct 2023
Convolutional Bidirectional Variational Autoencoder for Image Domain
  Translation of Dotted Arabic Expiration
Convolutional Bidirectional Variational Autoencoder for Image Domain Translation of Dotted Arabic Expiration
Ahmed Zidane
Ghada Soliman
120
0
0
21 Oct 2023
EfficientOCR: An Extensible, Open-Source Package for Efficiently
  Digitizing World Knowledge
EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge
Tom Bryan
Jacob Carlson
Abhishek Arora
Melissa Dell
197
8
0
16 Oct 2023
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text
  Recognition
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text RecognitionACM Multimedia (ACM MM), 2023
Zixiao Wang
Hongtao Xie
Yuxin Wang
Jianjun Xu
Boqiang Zhang
Yongdong Zhang
322
26
0
08 Oct 2023
A Holistic Evaluation of Piano Sound Quality
A Holistic Evaluation of Piano Sound Quality
Monan Zhou
Shangda Wu
Shaohua Ji
Zijin Li
Wei Li
286
0
0
07 Oct 2023
1D-CapsNet-LSTM: A Deep Learning-Based Model for Multi-Step Stock Index
  Forecasting
1D-CapsNet-LSTM: A Deep Learning-Based Model for Multi-Step Stock Index ForecastingJournal of King Saud University: Computer and Information Sciences (JSUCIS), 2023
Cheng Zhang
N. N. Sjarif
Roslina Ibrahim
AIFinAI4TS
263
14
0
03 Oct 2023
Pixel Adapter: A Graph-Based Post-Processing Approach for Scene Text
  Image Super-Resolution
Pixel Adapter: A Graph-Based Post-Processing Approach for Scene Text Image Super-ResolutionACM Multimedia (ACM MM), 2023
Wenyu Zhang
Xin Deng
Baojun Jia
Xingtong Yu
Yifan Chen
Jin Ma
Qing Ding
Xinming Zhang
255
14
0
16 Sep 2023
DeNoising-MOT: Towards Multiple Object Tracking with Severe Occlusions
DeNoising-MOT: Towards Multiple Object Tracking with Severe OcclusionsACM Multimedia (ACM MM), 2023
Teng Fu
Xiaocong Wang
Haiyang Yu
Ke Niu
Bin Li
Xiangyang Xue
VOTViT
237
17
0
09 Sep 2023
Leveraging Model Fusion for Improved License Plate Recognition
Leveraging Model Fusion for Improved License Plate RecognitionIberoamerican Congress on Pattern Recognition (CIARP), 2023
Rayson Laroca
L. A. Zanlorensi
Valter Estevam
Rodrigo Minetto
David Menotti
MoMe
232
11
0
08 Sep 2023
STEP -- Towards Structured Scene-Text Spotting
STEP -- Towards Structured Scene-Text SpottingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Sergi Garcia-Bordils
Dimosthenis Karatzas
Marccal Rusinol
283
2
0
05 Sep 2023
Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through
  Image-IDS Aligning
Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS AligningIEEE International Conference on Computer Vision (ICCV), 2023
Haiyang Yu
Xiaocong Wang
Bin Li
Xiangyang Xue
VLM
251
35
0
03 Sep 2023
Orientation-Independent Chinese Text Recognition in Scene Images
Orientation-Independent Chinese Text Recognition in Scene ImagesInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Haiyang Yu
Xiaocong Wang
Bin Li
Xiangyang Xue
195
7
0
03 Sep 2023
DTrOCR: Decoder-only Transformer for Optical Character Recognition
DTrOCR: Decoder-only Transformer for Optical Character RecognitionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Masato Fujitake
429
57
0
30 Aug 2023
Enhancing OCR Performance through Post-OCR Models: Adopting Glyph
  Embedding for Improved Correction
Enhancing OCR Performance through Post-OCR Models: Adopting Glyph Embedding for Improved Correction
Yung-Hsin Chen
Yuli Zhou
162
4
0
29 Aug 2023
Vision Grid Transformer for Document Layout Analysis
Vision Grid Transformer for Document Layout AnalysisIEEE International Conference on Computer Vision (ICCV), 2023
Cheng Da
Chuwei Luo
Qi Zheng
Cong Yao
ViT
234
52
0
29 Aug 2023
High-Resolution Document Shadow Removal via A Large-Scale Real-World
  Dataset and A Frequency-Aware Shadow Erasing Net
High-Resolution Document Shadow Removal via A Large-Scale Real-World Dataset and A Frequency-Aware Shadow Erasing NetIEEE International Conference on Computer Vision (ICCV), 2023
Zinuo Li
Xuhang Chen
Chi-Man Pun
Xiaodong Cun
501
65
0
27 Aug 2023
Self-supervised Scene Text Segmentation with Object-centric Layered
  Representations Augmented by Text Regions
Self-supervised Scene Text Segmentation with Object-centric Layered Representations Augmented by Text RegionsACM Multimedia (ACM MM), 2022
Yibo Wang
Yunhu Ye
Yuanpeng Mao
Yanwei Yu
Yuanping Song
269
2
0
25 Aug 2023
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
LISTER: Neighbor Decoding for Length-Insensitive Scene Text RecognitionIEEE International Conference on Computer Vision (ICCV), 2023
Changxu Cheng
Peng Wang
Cheng Da
Qi Zheng
Cong Yao
232
21
0
24 Aug 2023
Semantic Graph Representation Learning for Handwritten Mathematical
  Expression Recognition
Semantic Graph Representation Learning for Handwritten Mathematical Expression RecognitionIEEE International Conference on Document Analysis and Recognition (ICDAR), 2023
Zhuang Liu
Ye Yuan
Zhilong Ji
Jingfeng Bai
X. Bai
164
7
0
21 Aug 2023
Self-distillation Regularized Connectionist Temporal Classification Loss
  for Text Recognition: A Simple Yet Effective Approach
Self-distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective ApproachAAAI Conference on Artificial Intelligence (AAAI), 2023
Ziyin Zhang
Ning Lu
Minghui Liao
Yongshuai Huang
Cheng Li
Min Wang
Wei Peng
385
19
0
17 Aug 2023
Towards Robust Real-Time Scene Text Detection: From Semantic to Instance
  Representation Learning
Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation LearningACM Multimedia (ACM MM), 2023
Xugong Qin
Pengyuan Lyu
Chengquan Zhang
Can Ma
Kun Yao
Peng Zhang
Hailun Lin
Weiping Wang
195
20
0
14 Aug 2023
TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image Super-Resolution
TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image Super-ResolutionPattern Recognition (Pattern Recogn.), 2023
Baolin Liu
Zongyuan Yang
Pengfei Wang
Yueze Wang
Ziqi Liu
Ziyi Song
Yan Liu
Yongping Xiong
273
19
0
13 Aug 2023
A Benchmark for Chinese-English Scene Text Image Super-resolution
A Benchmark for Chinese-English Scene Text Image Super-resolutionIEEE International Conference on Computer Vision (ICCV), 2023
Jianqi Ma
Zhetong Liang
Wangmeng Xiang
Xi Yang
Lei Zhang
146
20
0
07 Aug 2023
One-stage Low-resolution Text Recognition with High-resolution Knowledge
  Transfer
One-stage Low-resolution Text Recognition with High-resolution Knowledge TransferACM Multimedia (ACM MM), 2023
Han Guo
Tao Dai
Mingyan Zhu
G. MEng
Bin Chen
Zhi Wang
Shutao Xia
145
5
0
05 Aug 2023
CTP-Net: Character Texture Perception Network for Document Image Forgery
  Localization
CTP-Net: Character Texture Perception Network for Document Image Forgery Localization
Xin Liao
Si-ping Chen
Jiaxin Chen
Tianyi Wang
Xiehua Li
89
5
0
04 Aug 2023
HiREN: Towards Higher Supervision Quality for Better Scene Text Image
  Super-Resolution
HiREN: Towards Higher Supervision Quality for Better Scene Text Image Super-Resolution
Minyi Zhao
Yi Xu
Bingjia Li
Jie Wang
Jihong Guan
Shuigeng Zhou
261
2
0
31 Jul 2023
A Transformer-based Approach for Arabic Offline Handwritten Text
  Recognition
A Transformer-based Approach for Arabic Offline Handwritten Text RecognitionSignal, Image and Video Processing (SIVP), 2023
Saleh Momeni
B. BabaAli
244
24
0
27 Jul 2023
Multi-Granularity Prediction with Learnable Fusion for Scene Text
  Recognition
Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Cheng Da
Peng Wang
Cong Yao
259
9
0
25 Jul 2023
Context Perception Parallel Decoder for Scene Text Recognition
Context Perception Parallel Decoder for Scene Text RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yongkun Du
Zhineng Chen
Caiyan Jia
Xiaoyue Yin
Chenxia Li
Yuning Du
Yu-Gang Jiang
284
19
0
23 Jul 2023
Physics-Driven Turbulence Image Restoration with Stochastic Refinement
Physics-Driven Turbulence Image Restoration with Stochastic RefinementIEEE International Conference on Computer Vision (ICCV), 2023
Ajay Jaiswal
Xingguang Zhang
Stanley H. Chan
Zinan Lin
178
32
0
20 Jul 2023
Towards Robust Scene Text Image Super-resolution via Explicit Location
  Enhancement
Towards Robust Scene Text Image Super-resolution via Explicit Location EnhancementInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Han Guo
Tao Dai
G. MEng
Shutao Xia
212
18
0
19 Jul 2023
Revisiting Scene Text Recognition: A Data Perspective
Revisiting Scene Text Recognition: A Data PerspectiveIEEE International Conference on Computer Vision (ICCV), 2023
Qing-Yuan Jiang
Jiapeng Wang
Dezhi Peng
Chongyu Liu
Lianwen Jin
349
60
0
17 Jul 2023
Writer adaptation for offline text recognition: An exploration of neural
  network-based methods
Writer adaptation for offline text recognition: An exploration of neural network-based methods
Tobias van der Werff
Maruf A. Dhali
Lambert Schomaker
194
1
0
11 Jul 2023
ECG-Image-Kit: A Synthetic Image Generation Toolbox to Facilitate Deep
  Learning-Based Electrocardiogram Digitization
ECG-Image-Kit: A Synthetic Image Generation Toolbox to Facilitate Deep Learning-Based Electrocardiogram DigitizationPhysiological Measurement (PM), 2023
Kshama Kodthalu Shivashankara
Deepanshi
Afagh Mehri Shervedani
Gari D. Clifford
Matthew A. Reyna
Reza Sameni
MedIm
358
50
0
04 Jul 2023
CNN-BiLSTM model for English Handwriting Recognition: Comprehensive
  Evaluation on the IAM Dataset
CNN-BiLSTM model for English Handwriting Recognition: Comprehensive Evaluation on the IAM Dataset
Firat Kizilirmak
Berrin Yanikoglu
200
10
0
02 Jul 2023
Fraunhofer SIT at CheckThat! 2023: Mixing Single-Modal Classifiers to
  Estimate the Check-Worthiness of Multi-Modal Tweets
Fraunhofer SIT at CheckThat! 2023: Mixing Single-Modal Classifiers to Estimate the Check-Worthiness of Multi-Modal Tweets
R. Frick
Inna Vogel
49
1
0
02 Jul 2023
DiffusionSTR: Diffusion Model for Scene Text Recognition
DiffusionSTR: Diffusion Model for Scene Text RecognitionInternational Conference on Information Photonics (ICIP), 2023
Masato Fujitake
DiffM
126
7
0
29 Jun 2023
UTRNet: High-Resolution Urdu Text Recognition In Printed Documents
UTRNet: High-Resolution Urdu Text Recognition In Printed DocumentsIEEE International Conference on Document Analysis and Recognition (ICDAR), 2023
Abdur Rahman
Arjun Ghosh
Chetan Arora
214
9
0
27 Jun 2023
The Deep Arbitrary Polynomial Chaos Neural Network or how Deep
  Artificial Neural Networks could benefit from Data-Driven Homogeneous Chaos
  Theory
The Deep Arbitrary Polynomial Chaos Neural Network or how Deep Artificial Neural Networks could benefit from Data-Driven Homogeneous Chaos TheoryNeural Networks (Neural Netw.), 2023
S. Oladyshkin
T. Praditia
Ilja Kroker
F. Mohammadi
Wolfgang Nowak
S. Otte
AI4CE
159
6
0
26 Jun 2023
Resume Information Extraction via Post-OCR Text Processing
Resume Information Extraction via Post-OCR Text Processing
Selahattin Serdar Helli
Senem Tanberk
Sena Nur Cavsak
78
2
0
23 Jun 2023
Document Image Cleaning using Budget-Aware Black-Box Approximation
Document Image Cleaning using Budget-Aware Black-Box Approximation
Ganesh Tata
Katyani Singh
E. V. Oeveren
Nilanjan Ray
AAML
120
0
0
22 Jun 2023
Conditional Text Image Generation with Diffusion Models
Conditional Text Image Generation with Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2023
Yuanzhi Zhu
Zhaohai Li
Tianwei Wang
Mengchao He
Cong Yao
VLMDiffM
288
83
0
19 Jun 2023
Looking and Listening: Audio Guided Text Recognition
Looking and Listening: Audio Guided Text Recognition
Wenwen Yu
Mingyu Liu
Biao Yang
Enming Zhang
Deqiang Jiang
Xing Sun
Yuliang Liu
Xiang Bai
DiffM
155
1
0
06 Jun 2023
ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich
  Document Images
ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document ImagesIEEE International Conference on Document Analysis and Recognition (ICDAR), 2023
Wenwen Yu
Chengquan Zhang
H. Cao
Wei Hua
Bohan Li
...
Hao Fei
Dimosthenis Karatzas
Xingchao Sun
Jingdong Wang
Xiang Bai
194
18
0
05 Jun 2023
ESTISR: Adapting Efficient Scene Text Image Super-resolution for
  Real-Scenes
ESTISR: Adapting Efficient Scene Text Image Super-resolution for Real-Scenes
Minghao Fu
Xin Man
Yihan Xu
Jie Shao
185
2
0
04 Jun 2023
Perception and Semantic Aware Regularization for Sequential Confidence
  Calibration
Perception and Semantic Aware Regularization for Sequential Confidence CalibrationComputer Vision and Pattern Recognition (CVPR), 2023
Zhenghua Peng
Yuanmao Luo
Tianshui Chen
Keke Xu
Shuangping Huang
AI4TS
285
3
0
31 May 2023
Masked and Permuted Implicit Context Learning for Scene Text Recognition
Masked and Permuted Implicit Context Learning for Scene Text RecognitionIEEE Signal Processing Letters (IEEE SPL), 2023
Xiaomeng Yang
Zhi Qiao
Jin Wei
Dongbao Yang
Can Ma
218
8
0
25 May 2023
MRN: Multiplexed Routing Network for Incremental Multilingual Text
  Recognition
MRN: Multiplexed Routing Network for Incremental Multilingual Text RecognitionIEEE International Conference on Computer Vision (ICCV), 2023
Tianlun Zheng
Zhineng Chen
Bin Huang
Wei Zhang
Yuran Jiang
354
15
0
24 May 2023
Previous
12345...121314
Next