Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1604.06646
Cited By
Synthetic Data for Text Localisation in Natural Images
22 April 2016
Ankush Gupta
Andrea Vedaldi
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Synthetic Data for Text Localisation in Natural Images"
50 / 607 papers shown
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
440
12
0
29 Dec 2023
Progressive Evolution from Single-Point to Polygon for Scene Text
Linger Deng
Mingxin Huang
Xudong Xie
Yuliang Liu
Lianwen Jin
Xiang Bai
191
1
0
21 Dec 2023
Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model
Lingjun Zhang
Xinyuan Chen
Yaohui Wang
Yue Lu
Yu Qiao
DiffM
249
50
0
19 Dec 2023
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Can Ma
DiffM
397
7
0
19 Dec 2023
Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors
European Conference on Computer Vision (ECCV), 2023
Tongkun Guan
Wei Shen
Xuehang Yang
Xuehui Wang
Yunbo Wang
315
8
0
08 Dec 2023
UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models
Yiming Zhao
Zhouhui Lian
260
48
0
08 Dec 2023
Compression of end-to-end non-autoregressive image-to-speech system for low-resourced devices
Gokul Srinivasagan
Michael Deisher
Munir Georges
VLM
232
0
0
30 Nov 2023
DSText V2: A Comprehensive Video Text Spotting Dataset for Dense and Small Text
Pattern Recognition (Pattern Recogn.), 2023
Weijia Wu
Yiming Zhang
Yefei He
Luoming Zhang
Zhenyu Lou
Hong Zhou
Xiang Bai
240
9
0
29 Nov 2023
STR-Cert: Robustness Certification for Deep Text Recognition on Deep Learning Pipelines and Vision Transformers
Daqian Shao
Lukas Fesser
Marta Z. Kwiatkowska
192
0
0
28 Nov 2023
Enhancing Scene Text Detectors with Realistic Text Image Synthesis Using Diffusion Models
Ling Fu
Zijie Wu
Yingying Zhu
Yuliang Liu
Xiang Bai
206
0
0
28 Nov 2023
Towards Detecting, Recognizing, and Parsing the Address Information from Bangla Signboard: A Deep Learning-based Approach
Hasan Murad
Mohammed Eunus Ali
186
0
0
22 Nov 2023
Scene Text Image Super-resolution based on Text-conditional Diffusion Models
Chihiro Noguchi
Shun Fukuda
Masao Yamanaka
DiffM
238
23
0
16 Nov 2023
Image Generation and Learning Strategy for Deep Document Forgery Detection
Yamato Okamoto
Osada Genki
Iu Yahiro
Rintaro Hasegawa
Peifei Zhu
Hirokatsu Kataoka
AAML
242
4
0
07 Nov 2023
On Manipulating Scene Text in the Wild with Diffusion Models
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Joshua Santoso
Christian Simon
Williem Pao
DiffM
224
8
0
01 Nov 2023
Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Shangbang Long
Siyang Qin
Yasuhisa Fujii
Alessandro Bissacco
Michalis Raptis
181
9
0
25 Oct 2023
Convolutional Bidirectional Variational Autoencoder for Image Domain Translation of Dotted Arabic Expiration
Ahmed Zidane
Ghada Soliman
124
0
0
21 Oct 2023
Deep Aramaic: Towards a Synthetic Data Paradigm Enabling Machine Learning in Epigraphy
PLoS ONE (PLoS ONE), 2023
Andrei C. Aioanei
R. Hunziker-Rodewald
Konstantin Klein
Dominik L. Michels
274
3
0
11 Oct 2023
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text Recognition
ACM Multimedia (ACM MM), 2023
Zixiao Wang
Hongtao Xie
Yuxin Wang
Jianjun Xu
Boqiang Zhang
Yongdong Zhang
331
27
0
08 Oct 2023
AI-Generated Images as Data Source: The Dawn of Synthetic Era
Zuhao Yang
Fangneng Zhan
Kunhao Liu
Muyu Xu
Shijian Lu
EGVM
443
28
0
03 Oct 2023
Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting Performance
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Alloy Das
Sanket Biswas
Ayan Banerjee
Josep Lladós
Umapada Pal
Saumik Bhattacharya
324
4
0
02 Oct 2023
SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap
IEEE International Conference on Computer Vision (ICCV), 2023
Daehee Kim
Yoon Kim
Donghyun Kim
Yumin Lim
Geewook Kim
Taeho Kil
283
4
0
21 Sep 2023
Kosmos-2.5: A Multimodal Literate Model
Tengchao Lv
Yupan Huang
Jingye Chen
Lei Cui
Shuming Ma
...
Weiyao Luo
Shaoxiang Wu
Guoxin Wang
Cha Zhang
Furu Wei
VLM
MLLM
267
91
0
20 Sep 2023
Pixel Adapter: A Graph-Based Post-Processing Approach for Scene Text Image Super-Resolution
ACM Multimedia (ACM MM), 2023
Wenyu Zhang
Xin Deng
Baojun Jia
Xingtong Yu
Yifan Chen
Jin Ma
Qing Ding
Xinming Zhang
260
14
0
16 Sep 2023
Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration
IEEE International Conference on Computer Vision (ICCV), 2023
H. Cao
Changcun Bao
Chaohu Liu
Huang-wei Chen
Kun Yin
Hao Liu
Yinsong Liu
Deqiang Jiang
Xing Sun
202
17
0
03 Sep 2023
Selective Scene Text Removal
British Machine Vision Conference (BMVC), 2023
Hayato Mitani
Akisato Kimura
Seiichi Uchida
251
3
0
01 Sep 2023
DTrOCR: Decoder-only Transformer for Optical Character Recognition
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Masato Fujitake
442
58
0
30 Aug 2023
Self-supervised Scene Text Segmentation with Object-centric Layered Representations Augmented by Text Regions
ACM Multimedia (ACM MM), 2022
Yibo Wang
Yunhu Ye
Yuanpeng Mao
Yanwei Yu
Yuanping Song
284
2
0
25 Aug 2023
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
IEEE International Conference on Computer Vision (ICCV), 2023
Changxu Cheng
Peng Wang
Cheng Da
Qi Zheng
Cong Yao
238
22
0
24 Aug 2023
MixNet: Toward Accurate Detection of Challenging Scene Text in the Wild
Yu Zeng
J. Hsieh
Xuzhao Li
Ming-Ching Chang
298
20
0
23 Aug 2023
Turning a CLIP Model into a Scene Text Spotter
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Wenwen Yu
Yuliang Liu
Xingkui Zhu
H. Cao
Xing Sun
Xiang Bai
VLM
CLIP
183
20
0
21 Aug 2023
Self-distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach
AAAI Conference on Artificial Intelligence (AAAI), 2023
Ziyin Zhang
Ning Lu
Minghui Liao
Yongshuai Huang
Cheng Li
Min Wang
Wei Peng
393
20
0
17 Aug 2023
Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning
ACM Multimedia (ACM MM), 2023
Xugong Qin
Pengyuan Lyu
Chengquan Zhang
Can Ma
Kun Yao
Peng Zhang
Hailun Lin
Weiping Wang
203
21
0
14 Aug 2023
Rapid Training Data Creation by Synthesizing Medical Images for Classification and Localization
A. Kushwaha
Sarthak Gupta
Anish Bhanushali
T. R. Dastidar
MedIm
173
5
0
09 Aug 2023
Relational Contrastive Learning for Scene Text Recognition
ACM Multimedia (ACM MM), 2023
Jinglei Zhang
Tiancheng Lin
Yi Xu
Kaibo Chen
Rui Zhang
256
14
0
01 Aug 2023
CT-Net: Arbitrary-Shaped Text Detection via Contour Transformer
Zhiwen Shao
Yuchen Su
Yong Zhou
Fanrong Meng
Hancheng Zhu
Bing-Quan Liu
Rui Yao
107
27
0
25 Jul 2023
Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Cheng Da
Peng Wang
Cong Yao
264
9
0
25 Jul 2023
Context Perception Parallel Decoder for Scene Text Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yongkun Du
Zhineng Chen
Caiyan Jia
Xiaoyue Yin
Chenxia Li
Yuning Du
Yu-Gang Jiang
301
19
0
23 Jul 2023
Revisiting Scene Text Recognition: A Data Perspective
IEEE International Conference on Computer Vision (ICCV), 2023
Qing-Yuan Jiang
Jiapeng Wang
Dezhi Peng
Chongyu Liu
Lianwen Jin
354
61
0
17 Jul 2023
The mapKurator System: A Complete Pipeline for Extracting and Linking Text from Historical Maps
Jina Kim
Zekun Li
Yijun Lin
Min Namgung
Leeje Jang
Yao-Yi Chiang
211
16
0
29 Jun 2023
DiffusionSTR: Diffusion Model for Scene Text Recognition
International Conference on Information Photonics (ICIP), 2023
Masato Fujitake
DiffM
141
8
0
29 Jun 2023
Weakly Supervised Scene Text Generation for Low-resource Languages
Expert systems with applications (ESWA), 2023
Yangchen Xie
Xinyuan Chen
Hongjian Zhan
Palaiahankote Shivakumara
Bing Yin
Cong Liu
Yue Lu
182
9
0
25 Jun 2023
ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining
AAAI Conference on Artificial Intelligence (AAAI), 2023
Dezhi Peng
Chongyu Liu
Yuliang Liu
Lianwen Jin
DiffM
210
18
0
21 Jun 2023
Conditional Text Image Generation with Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2023
Yuanzhi Zhu
Zhaohai Li
Tianwei Wang
Mengchao He
Cong Yao
VLM
DiffM
301
84
0
19 Jun 2023
FETNet: Feature Erasing and Transferring Network for Scene Text Removal
Pattern Recognition (Pattern Recogn.), 2023
Guangtao Lyu
Kun Liu
Anna Zhu
S. Uchida
Brian Kenji Iwana
245
20
0
16 Jun 2023
PSSTRNet: Progressive Segmentation-guided Scene Text Removal Network
IEEE International Conference on Multimedia and Expo (ICME), 2022
Guangtao Lyu
Anna Zhu
167
15
0
13 Jun 2023
Looking and Listening: Audio Guided Text Recognition
Wenwen Yu
Mingyu Liu
Biao Yang
Enming Zhang
Deqiang Jiang
Xing Sun
Yuliang Liu
Xiang Bai
DiffM
163
1
0
06 Jun 2023
Bridging the Domain Gap between Synthetic and Real-World Data for Autonomous Driving
Xiangyu Bai
Yedi Luo
Le Jiang
Aniket Gupta
Pushyami Kaveti
H. Singh
Sarah Ostadabbas
291
13
0
05 Jun 2023
Perception and Semantic Aware Regularization for Sequential Confidence Calibration
Computer Vision and Pattern Recognition (CVPR), 2023
Zhenghua Peng
Yuanmao Luo
Tianshui Chen
Keke Xu
Shuangping Huang
AI4TS
294
4
0
31 May 2023
Masked and Permuted Implicit Context Learning for Scene Text Recognition
IEEE Signal Processing Letters (IEEE SPL), 2023
Xiaomeng Yang
Zhi Qiao
Jin Wei
Dongbao Yang
Can Ma
234
8
0
25 May 2023
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
IEEE Transactions on Image Processing (IEEE TIP), 2023
Shuai Zhao
Xiaohan Wang
Linchao Zhu
Yezhou Yang
CLIP
VLM
381
45
0
23 May 2023
Previous
1
2
3
4
5
6
...
11
12
13
Next
Page 3 of 13
Page
of 13
Go