ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.03630
87
11

A Binary Convolutional Encoder-decoder Network for Real-time Natural Scene Text Processing

12 December 2016
Zichuan Liu
Yixing Li
Fengbo Ren
Hao Yu
ArXiv (abs)PDFHTML
Abstract

In this paper, we develop a binary convolutional encoder-decoder network (B-CEDNet) for natural scene text processing (NSTP). It converts a text image to a class-distinguished salience map that reveals the categorical, spatial and morphological information of characters. The existing solutions are either memory consuming or run-time consuming that cannot be applied to real-time applications on resource-constrained devices such as advanced driver assistance systems. The developed network can process multiple regions containing characters by one-off forward operation, and is trained to have binary weights and binary feature maps, which lead to both remarkable inference run-time speedup and memory usage reduction. By training with over 200, 000 synthesis scene text images (size of 32×12832\times12832×128), it can achieve 90%90\%90% and 91%91\%91% pixel-wise accuracy on ICDAR-03 and ICDAR-13 datasets. It only consumes 4.59 ms4.59\ ms4.59 ms inference run-time realized on GPU with a small network size of 2.14 MB, which is up to 8×8\times8× faster and 96%96\%96% smaller than it full-precision version.

View on arXiv
Comments on this paper