ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1604.06646
  4. Cited By
Synthetic Data for Text Localisation in Natural Images

Synthetic Data for Text Localisation in Natural Images

22 April 2016
Ankush Gupta
Andrea Vedaldi
Andrew Zisserman
ArXivPDFHTML

Papers citing "Synthetic Data for Text Localisation in Natural Images"

50 / 580 papers shown
Title
Seeing Text in the Dark: Algorithm and Benchmark
Seeing Text in the Dark: Algorithm and Benchmark
Chengpei Xu
Hao Fu
Long Ma
Wenjing Jia
Chengqi Zhang
Feng Xia
Xiaoyu Ai
Binghao Li
Wenjie Zhang
32
13
0
13 Apr 2024
HRVDA: High-Resolution Visual Document Assistant
HRVDA: High-Resolution Visual Document Assistant
Chaohu Liu
Kun Yin
Haoyu Cao
Xinghua Jiang
Xin Li
Yinsong Liu
Deqiang Jiang
Xing Sun
Linli Xu
VLM
35
23
0
10 Apr 2024
JSTR: Judgment Improves Scene Text Recognition
JSTR: Judgment Improves Scene Text Recognition
Masato Fujitake
36
1
0
09 Apr 2024
Ensemble Learning for Vietnamese Scene Text Spotting in Urban
  Environments
Ensemble Learning for Vietnamese Scene Text Spotting in Urban Environments
Hieu Nguyen
Cong-Hoang Ta
Phuong-Thuy Le-Nguyen
Minh-Triet Tran
Trung-Truc Huynh-Le
32
0
0
01 Apr 2024
Refining Text-to-Image Generation: Towards Accurate Training-Free
  Glyph-Enhanced Image Generation
Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation
Sanyam Lakhanpal
Shivang Chopra
Vinija Jain
Aman Chadha
Man Luo
32
9
0
25 Mar 2024
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with
  Pre-trained Language Model
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Jiahao Lyu
Jin Wei
Gangyan Zeng
Zeng Li
Enze Xie
Wei Wang
Yu Zhou
VLM
27
3
0
15 Mar 2024
IndicSTR12: A Dataset for Indic Scene Text Recognition
IndicSTR12: A Dataset for Indic Scene Text Recognition
Harsh Lunia
Ajoy Mondal
C. V. Jawahar
20
2
0
12 Mar 2024
Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and
  Margin Loss
Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and Margin Loss
Xuhua Ren
Hengcan Shi
Jin Li
VLM
33
0
0
12 Mar 2024
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text
  Detection and Spotting
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
Chen Duan
Pei Fu
Shan Guo
Qianyi Jiang
Xiaoming Wei
VLM
46
5
0
01 Mar 2024
DeepEraser: Deep Iterative Context Mining for Generic Text Eraser
DeepEraser: Deep Iterative Context Mining for Generic Text Eraser
Hao Feng
Wendi Wang
Shaokai Liu
Jiajun Deng
Wen-gang Zhou
Houqiang Li
29
2
0
29 Feb 2024
Efficiently Leveraging Linguistic Priors for Scene Text Spotting
Efficiently Leveraging Linguistic Priors for Scene Text Spotting
Nguyen Nguyen
Yapeng Tian
Chenliang Xu
47
1
0
27 Feb 2024
Sequential Visual and Semantic Consistency for Semi-supervised Text
  Recognition
Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition
Mingkun Yang
Biao Yang
Minghui Liao
Yingying Zhu
Xiang Bai
30
5
0
24 Feb 2024
Typographic Text Generation with Off-the-Shelf Diffusion Model
Typographic Text Generation with Off-the-Shelf Diffusion Model
KhayTze Peong
Seiichi Uchida
Daichi Haraguchi
DiffM
33
4
0
22 Feb 2024
Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition
Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition
Mingkun Yang
Biao Yang
Minghui Liao
Yingying Zhu
X. Bai
VLM
70
10
0
21 Feb 2024
CPN: Complementary Proposal Network for Unconstrained Text Detection
CPN: Complementary Proposal Network for Unconstrained Text Detection
Longhuang Wu
Shangxuan Tian
Youxin Wang
Pengfei Xiong
37
0
0
18 Feb 2024
Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual
  Text Processing
Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing
Yan Shu
Weichao Zeng
Zhenhang Li
Fangmin Zhao
Yu Zhou
30
3
0
05 Feb 2024
Text Image Inpainting via Global Structure-Guided Diffusion Models
Text Image Inpainting via Global Structure-Guided Diffusion Models
Shipeng Zhu
Pengfei Fang
Chenjie Zhu
Zuoyan Zhao
Qiang Xu
Hui Xue
DiffM
28
4
0
26 Jan 2024
Supervised Fine-tuning in turn Improves Visual Foundation Models
Supervised Fine-tuning in turn Improves Visual Foundation Models
Xiaohu Jiang
Yixiao Ge
Yuying Ge
Dachuan Shi
Chun Yuan
Ying Shan
VLM
CLIP
38
8
0
18 Jan 2024
VIPTR: A Vision Permutable Extractor for Fast and Efficient Scene Text
  Recognition
VIPTR: A Vision Permutable Extractor for Fast and Efficient Scene Text Recognition
Xianfu Cheng
Weixiao Zhou
Xiang Li
Xiaoming Chen
Jian Yang
Tongliang Li
Zhoujun Li
32
2
0
18 Jan 2024
CMFN: Cross-Modal Fusion Network for Irregular Scene Text Recognition
CMFN: Cross-Modal Fusion Network for Irregular Scene Text Recognition
Jinzhi Zheng
Ruyi Ji
Libo Zhang
Yanjun Wu
Chen Zhao
30
4
0
18 Jan 2024
Text Region Multiple Information Perception Network for Scene Text
  Detection
Text Region Multiple Information Perception Network for Scene Text Detection
Jinzhi Zheng
Libo Zhang
Yanjun Wu
Chen Zhao
31
0
0
18 Jan 2024
BPDO:Boundary Points Dynamic Optimization for Arbitrary Shape Scene Text
  Detection
BPDO:Boundary Points Dynamic Optimization for Arbitrary Shape Scene Text Detection
Jinzhi Zheng
Libo Zhang
Yanjun Wu
Chen Zhao
27
1
0
18 Jan 2024
Enhancing Small Object Encoding in Deep Neural Networks: Introducing
  Fast&Focused-Net with Volume-wise Dot Product Layer
Enhancing Small Object Encoding in Deep Neural Networks: Introducing Fast&Focused-Net with Volume-wise Dot Product Layer
Tofik Ali
Partha Pratim Roy
ObjD
28
2
0
18 Jan 2024
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
Mingxin Huang
Dezhi Peng
Hongliang Li
Zhenghao Peng
Chongyu Liu
Dahua Lin
Yuliang Liu
Xiang Bai
Lianwen Jin
72
1
0
15 Jan 2024
An Empirical Study of Scaling Law for OCR
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
33
6
0
29 Dec 2023
Progressive Evolution from Single-Point to Polygon for Scene Text
Progressive Evolution from Single-Point to Polygon for Scene Text
Linger Deng
Mingxin Huang
Xudong Xie
Yuliang Liu
Lianwen Jin
Xiang Bai
29
1
0
21 Dec 2023
Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model
Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model
Lingjun Zhang
Xinyuan Chen
Yaohui Wang
Yue Lu
Yu Qiao
DiffM
11
32
0
19 Dec 2023
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Yu Zhou
DiffM
59
1
0
19 Dec 2023
Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors
Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors
Tongkun Guan
Wei Shen
Xuehang Yang
Xuehui Wang
Xiaokang Yang
34
7
0
08 Dec 2023
UDiffText: A Unified Framework for High-quality Text Synthesis in
  Arbitrary Images via Character-aware Diffusion Models
UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models
Yiming Zhao
Zhouhui Lian
71
27
0
08 Dec 2023
Compression of end-to-end non-autoregressive image-to-speech system for
  low-resourced devices
Compression of end-to-end non-autoregressive image-to-speech system for low-resourced devices
Gokul Srinivasagan
Michael Deisher
Munir Georges
VLM
19
0
0
30 Nov 2023
DSText V2: A Comprehensive Video Text Spotting Dataset for Dense and
  Small Text
DSText V2: A Comprehensive Video Text Spotting Dataset for Dense and Small Text
Weijia Wu
Yiming Zhang
Yefei He
Luoming Zhang
Zhenyu Lou
Hong Zhou
Xiang Bai
35
5
0
29 Nov 2023
STR-Cert: Robustness Certification for Deep Text Recognition on Deep
  Learning Pipelines and Vision Transformers
STR-Cert: Robustness Certification for Deep Text Recognition on Deep Learning Pipelines and Vision Transformers
Daqian Shao
Lukas Fesser
Marta Z. Kwiatkowska
26
0
0
28 Nov 2023
Enhancing Scene Text Detectors with Realistic Text Image Synthesis Using
  Diffusion Models
Enhancing Scene Text Detectors with Realistic Text Image Synthesis Using Diffusion Models
Ling Fu
Zijie Wu
Yingying Zhu
Yuliang Liu
Xiang Bai
26
0
0
28 Nov 2023
Towards Detecting, Recognizing, and Parsing the Address Information from
  Bangla Signboard: A Deep Learning-based Approach
Towards Detecting, Recognizing, and Parsing the Address Information from Bangla Signboard: A Deep Learning-based Approach
Hasan Murad
Mohammed Eunus Ali
11
0
0
22 Nov 2023
Scene Text Image Super-resolution based on Text-conditional Diffusion
  Models
Scene Text Image Super-resolution based on Text-conditional Diffusion Models
Chihiro Noguchi
Shun Fukuda
Masao Yamanaka
DiffM
27
10
0
16 Nov 2023
Image Generation and Learning Strategy for Deep Document Forgery
  Detection
Image Generation and Learning Strategy for Deep Document Forgery Detection
Yamato Okamoto
Osada Genki
Iu Yahiro
Rintaro Hasegawa
Peifei Zhu
Hirokatsu Kataoka
AAML
31
0
0
07 Nov 2023
On Manipulating Scene Text in the Wild with Diffusion Models
On Manipulating Scene Text in the Wild with Diffusion Models
Joshua Santoso
Christian Simon
Williem Pao
DiffM
24
6
0
01 Nov 2023
Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis
Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis
Shangbang Long
Siyang Qin
Yasuhisa Fujii
Alessandro Bissacco
Michalis Raptis
22
5
0
25 Oct 2023
Convolutional Bidirectional Variational Autoencoder for Image Domain
  Translation of Dotted Arabic Expiration
Convolutional Bidirectional Variational Autoencoder for Image Domain Translation of Dotted Arabic Expiration
Ahmed Zidane
Ghada Soliman
16
0
0
21 Oct 2023
Deep Aramaic: Towards a Synthetic Data Paradigm Enabling Machine
  Learning in Epigraphy
Deep Aramaic: Towards a Synthetic Data Paradigm Enabling Machine Learning in Epigraphy
Andrei C. Aioanei
R. Hunziker-Rodewald
Konstantin Klein
Dominik L. Michels
25
2
0
11 Oct 2023
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text
  Recognition
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text Recognition
Zixiao Wang
Hongtao Xie
Yuxin Wang
Jianjun Xu
Boqiang Zhang
Yongdong Zhang
37
15
0
08 Oct 2023
AI-Generated Images as Data Source: The Dawn of Synthetic Era
AI-Generated Images as Data Source: The Dawn of Synthetic Era
Zuhao Yang
Fangneng Zhan
Kunhao Liu
Muyu Xu
Shijian Lu
EGVM
27
18
0
03 Oct 2023
Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards
  Enhancing Text Spotting Performance
Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting Performance
Alloy Das
Sanket Biswas
Ayan Banerjee
Josep Lladós
Umapada Pal
Saumik Bhattacharya
25
3
0
02 Oct 2023
SCOB: Universal Text Understanding via Character-wise Supervised
  Contrastive Learning with Online Text Rendering for Bridging Domain Gap
SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap
Daehee Kim
Yoon Kim
Donghyun Kim
Yumin Lim
Geewook Kim
Taeho Kil
23
3
0
21 Sep 2023
Kosmos-2.5: A Multimodal Literate Model
Kosmos-2.5: A Multimodal Literate Model
Tengchao Lv
Yupan Huang
Jingye Chen
Lei Cui
Shuming Ma
...
Weiyao Luo
Shaoxiang Wu
Guoxin Wang
Cha Zhang
Furu Wei
VLM
MLLM
23
63
0
20 Sep 2023
Pixel Adapter: A Graph-Based Post-Processing Approach for Scene Text
  Image Super-Resolution
Pixel Adapter: A Graph-Based Post-Processing Approach for Scene Text Image Super-Resolution
Wenyu Zhang
Xin Deng
Baojun Jia
Xingtong Yu
Yifan Chen
Jin Ma
Qing Ding
Xinming Zhang
19
11
0
16 Sep 2023
Attention Where It Matters: Rethinking Visual Document Understanding
  with Selective Region Concentration
Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration
H. Cao
Changcun Bao
Chaohu Liu
Huang-wei Chen
Kun Yin
Hao Liu
Yinsong Liu
Deqiang Jiang
Xing Sun
14
13
0
03 Sep 2023
Selective Scene Text Removal
Selective Scene Text Removal
Hayato Mitani
Akisato Kimura
Seiichi Uchida
21
1
0
01 Sep 2023
DTrOCR: Decoder-only Transformer for Optical Character Recognition
DTrOCR: Decoder-only Transformer for Optical Character Recognition
Masato Fujitake
41
35
0
30 Aug 2023
Previous
12345...101112
Next