An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2015

21 July 2015

Papers citing "An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition"

50 / 680 papers shown

OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models

335

22 Feb 2025

Handwritten Text Recognition: A Survey

Carlos Garrido-Munoz

Antonio Ríos-Vila

Jorge Calvo-Zaragoza

315

12 Feb 2025

PLATTER: A Page-Level Handwritten Text Recognition System for Indic Scripts

Badri Vishal Kasuba

Dhruv Kudale

Venkatapathy Subramanian

P. Chaudhuri

Ganesh Ramakrishnan

294

10 Feb 2025

SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild

354

08 Jan 2025

First-place Solution for Streetscape Shop Sign Recognition Competition

Bin Wang

Li Jing

979

06 Jan 2025

Efficient Video-Based ALPR System Using YOLO and Visual Rhythm

Victor Nascimento Ribeiro

Nina S. T. Hirata

217

04 Jan 2025

Instruction-Guided Scene Text RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

490

03 Jan 2025

Disentanglement and Compositionality of Letter Identity and Letter Position in Variational Auto-Encoder Vision Models

350

11 Dec 2024

TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition

433

02 Dec 2024

DLaVA: Document Language and Vision Assistant for Answer Localization with Enhanced Interpretability and Trustworthiness

Ahmad Mohammadshirazi

Pinaki Prasad Guha Neogi

Ser-Nam Lim

R. Ramnath

433

29 Nov 2024

SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition

384

24 Nov 2024

Boosting Semi-Supervised Scene Text Recognition via Viewing and SummarizingNeural Information Processing Systems (NeurIPS), 2024

236

23 Nov 2024

Learning based Geéz character handwritten recognition

Hailemicael Lulseged Yimer

Hailegabriel Dereje Degefa

Marco Cristani

Federico Cunico

193

20 Nov 2024

Relational Contrastive Learning and Masked Image Modeling for Scene Text Recognition

349

18 Nov 2024

SAN: Structure-Aware Network for Complex and Long-tailed Chinese Text RecognitionIEEE International Conference on Document Analysis and Recognition (ICDAR), 2024

Jing Zhang

Chang-rui Liu

Chun Yang

167

10 Nov 2024

HIP: Hierarchical Point Modeling and Pre-training for Visual Information Extraction

251

02 Nov 2024

Visual Text Matters: Improving Text-KVQA with Visual Text Entity Knowledge-aware Large Multimodal AssistantConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

A. S. Penamakuri

Anand Mishra

328

24 Oct 2024

Human-Inspired Long-Term Indoor Localization in Human-Oriented Environment

Nicky Zimmerman

Matteo Sodano

232

16 Oct 2024

ChartKG: A Knowledge-Graph-Based Representation for Chart ImagesIEEE Transactions on Visualization and Computer Graphics (TVCG), 2024

287

13 Oct 2024

Grounding Partially-Defined Events in Multimodal DataConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

171

07 Oct 2024

HATFormer: Historic Handwritten Arabic Text Recognition with Transformers

648

03 Oct 2024

AI-Powered Augmented Reality for Satellite Assembly, Integration and Test

132

26 Sep 2024

Text Image Generation for Low-Resource Languages with Dual Translation Learning

204

26 Sep 2024

General Detection-based Text Line RecognitionNeural Information Processing Systems (NeurIPS), 2024

Raphael Baena

Syrine Kalleli

Mathieu Aubry

981

25 Sep 2024

One Model for Two Tasks: Cooperatively Recognizing and Recovering Low-Resolution Scene Text Images by Iterative Mutual Guidance

Minyi Zhao

Yang Wang

Jihong Guan

Shuigeng Zhou

182

22 Sep 2024

VL-Reader: Vision and Language Reconstructor is an Effective Scene Text RecognizerACM Multimedia (MM), 2024

Humen Zhong

Zhibo Yang

Zhaohai Li

Peng Wang

Jun Tang

Wenqing Cheng

Cong Yao

252

18 Sep 2024

HTR-VT: Handwritten Text Recognition with Vision TransformerPattern Recognition (Pattern Recogn.), 2024

Yuting Li

156

13 Sep 2024

Boosting CNN-based Handwriting Recognition Systems with Learnable Relaxation Labeling

S. Ferro

Alessandro Torcinovich

Arianna Traviglia

Marcello Pelillo

126

09 Sep 2024

PdfTable: A Unified Toolkit for Deep Learning-Based Table Extraction

Lei Sheng

Shuai-Shuai Xu

LMTD

200

08 Sep 2024

RoomDiffusion: A Specialized Diffusion Model in the Interior Design Industry

Zhaowei Wang

Yue Yang

102

05 Sep 2024

Platypus: A Generalized Specialist Model for Reading Text in Various FormsEuropean Conference on Computer Vision (ECCV), 2024

Peng Wang

Zhaohai Li

Jun Tang

203

27 Aug 2024

Decoder Pre-Training with only Text for Scene Text RecognitionACM Multimedia (MM), 2024

Shuai Zhao

Yongkun Du

Zhineng Chen

Yu-Gang Jiang

154

11 Aug 2024

Image-to-LaTeX Converter for Mathematical Formulas and Text

Daniil Gurgurov

Aleksey Morshnev

ViT VLM

188

07 Aug 2024

LEGO: Self-Supervised Representation Learning for Scene Text Images

Yujin Ren

Jiaxin Zhang

Lianwen Jin

SSL

252

04 Aug 2024

Self-Supervised Learning for Text Recognition: A Critical SurveyInternational Journal of Computer Vision (IJCV), 2024

Carlos Peñarrubia

J. J. Valero-Mas

Jorge Calvo-Zaragoza

424

29 Jul 2024

Visual Text Generation in the Wild

Fei Huang

240

19 Jul 2024

Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognition

Gagan Bhatia

El Moatez Billah Nagoudi

Fakhraddin Alwajih

Muhammad Abdul-Mageed

179

18 Jul 2024

Back to Newton's Laws: Learning Vision-based Agile Flight via Differentiable Physics

328

15 Jul 2024

Long-range Turbulence Mitigation: A Large-scale Dataset and A Coarse-to-fine Framework

Xueyao Xiao

189

11 Jul 2024

PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer

Tongkun Guan

Chengyu Lin

Wei Shen

Xiaokang Yang

267

10 Jul 2024

Spanish TrOCR: Leveraging Transfer Learning for Language Adaptation

Filipe Lauar

Valentin Laurent

142

09 Jul 2024

Focus on the Whole Character: Discriminative Character Modeling for Scene Text Recognition

266

08 Jul 2024

MixTex: Unambiguous Recognition Should Not Rely Solely on Real Data

Renqing Luo

Yuhan Xu

224

24 Jun 2024

Fusion of Movement and Naive Predictions for Point Forecasting in Univariate Random Walks

Cheng Zhang

148

20 Jun 2024

AnyTrans: Translate AnyText in the Image with Large Scale Models

Xiaoshuai Sun

Rongrong Ji

VLM

240

17 Jun 2024

VCR: A Task for Pixel-Level Complex Reasoning in Vision Language Models via Restoring Occluded TextInternational Conference on Learning Representations (ICLR), 2024

Tianyu Zhang

Ge Zhang

261

10 Jun 2024

Classification of Non-native Handwritten Characters Using Convolutional Neural Network

264

06 Jun 2024

Improving Text Generation on Images with Synthetic Captions

339

01 Jun 2024

LOGO: Video Text Spotting with Language Collaboration and Glyph Perception Model

270

29 May 2024

Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering

Yuliang Liu

175

21 May 2024