ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1604.06646
  4. Cited By
Synthetic Data for Text Localisation in Natural Images

Synthetic Data for Text Localisation in Natural Images

22 April 2016
Ankush Gupta
Andrea Vedaldi
Andrew Zisserman
ArXivPDFHTML

Papers citing "Synthetic Data for Text Localisation in Natural Images"

50 / 580 papers shown
Title
PsOCR: Benchmarking Large Multimodal Models for Optical Character Recognition in Low-resource Pashto Language
PsOCR: Benchmarking Large Multimodal Models for Optical Character Recognition in Low-resource Pashto Language
Ijazul Haq
Yingjie Zhang
Irfan Ali Khan
17
0
0
15 May 2025
Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies
Joint Low-level and High-level Textual Representation Learning with Multiple Masking Strategies
Zhengmi Tang
Yuto Mitsui
Tomo Miyazaki
S. Omachi
31
0
0
11 May 2025
Visual Text Processing: A Comprehensive Review and Unified Evaluation
Visual Text Processing: A Comprehensive Review and Unified Evaluation
Yan Shu
Weichao Zeng
Fangmin Zhao
Zeyu Chen
Z. Li
...
Paolo Rota
Xiang Bai
Lianwen Jin
Xu-Cheng Yin
N. Sebe
CoGe
52
0
0
30 Apr 2025
Skip-Vision: Efficient and Scalable Acceleration of Vision-Language Models via Adaptive Token Skipping
Skip-Vision: Efficient and Scalable Acceleration of Vision-Language Models via Adaptive Token Skipping
Weili Zeng
Ziyuan Huang
Kaixiang Ji
Yichao Yan
VLM
42
1
0
26 Mar 2025
Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition
Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition
Yifei Zhang
Chang-Shu Liu
Jin Wei
Xiaomeng Yang
Yu Zhou
Can Ma
Xiangyang Ji
60
2
0
24 Mar 2025
From Fragment to One Piece: A Survey on AI-Driven Graphic Design
From Fragment to One Piece: A Survey on AI-Driven Graphic Design
Xingxing Zou
Wen Zhang
Nanxuan Zhao
54
0
0
24 Mar 2025
A Context-Driven Training-Free Network for Lightweight Scene Text Segmentation and Recognition
A Context-Driven Training-Free Network for Lightweight Scene Text Segmentation and Recognition
Ritabrata Chakraborty
Shivakumara Palaiahnakote
Umapada Pal
Cheng-Lin Liu
VLM
47
0
0
19 Mar 2025
Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding
Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding
Zining Wang
Tongkun Guan
Pei Fu
Chen Duan
Qianyi Jiang
Zhentao Guo
Shan Guo
Junfeng Luo
Wei-Ming Shen
Xiaokang Yang
MLLM
VLM
69
0
0
18 Mar 2025
Scale Efficient Training for Large Datasets
Scale Efficient Training for Large Datasets
Qing Zhou
Junyu Gao
Qi Wang
DD
67
0
0
17 Mar 2025
M2-omni: Advancing Omni-MLLM for Comprehensive Modality Support with Competitive Performance
M2-omni: Advancing Omni-MLLM for Comprehensive Modality Support with Competitive Performance
Qingpei Guo
Kaiyou Song
Zipeng Feng
Ziping Ma
Qinglong Zhang
...
Yunxiao Sun
Tai-WeiChang
Jingdong Chen
Ming Yang
Jun Zhou
MLLM
VLM
82
3
0
26 Feb 2025
Megrez-Omni Technical Report
Boxun Li
Yadong Li
Z. Li
Congyi Liu
Weilin Liu
...
Dong Zhou
Yueqing Zhuang
Shengen Yan
Guohao Dai
Y. Wang
44
0
0
19 Feb 2025
PLATTER: A Page-Level Handwritten Text Recognition System for Indic Scripts
Badri Vishal Kasuba
Dhruv Kudale
Venkatapathy Subramanian
P. Chaudhuri
Ganesh Ramakrishnan
41
0
0
10 Feb 2025
Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation
Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation
Minxing Luo
Zixun Xia
L. Chen
Zhenhang Li
Weichao Zeng
J. T. Wang
Wentao Cheng
Yaxing Wang
Yu Zhou
Jian Yang
DiffM
44
1
0
10 Jan 2025
SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
Jiawei Liu
Yuanzhi Zhu
Feiyu Gao
Z. Yang
P. Wang
Junyang Lin
X. Wang
Wenyu Liu
DiffM
43
0
0
08 Jan 2025
Instruction-Guided Scene Text Recognition
Instruction-Guided Scene Text Recognition
Yongkun Du
Z. Chen
Yuchen Su
Caiyan Jia
Yu-Gang Jiang
71
3
0
03 Jan 2025
TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition
TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition
Xingsong Ye
Yongkun Du
Yunbo Tao
Z. Chen
DiffM
103
0
0
02 Dec 2024
SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition
SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition
Yongkun Du
Z. Chen
Hongtao Xie
Caiyan Jia
Yu Jiang
83
1
0
24 Nov 2024
Boosting Semi-Supervised Scene Text Recognition via Viewing and
  Summarizing
Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing
Yadong Qu
Yuxin Wang
Bangbang Zhou
Z. Wang
Hongtao Xie
Yongdong Zhang
85
0
0
23 Nov 2024
Relational Contrastive Learning and Masked Image Modeling for Scene Text Recognition
T. Lin
Jinglei Zhang
Yi Xu
Kai Chen
Rui Zhang
C. L. P. Chen
38
0
0
18 Nov 2024
Real-Time Text Detection with Similar Mask in Traffic, Industrial, and
  Natural Scenes
Real-Time Text Detection with Similar Mask in Traffic, Industrial, and Natural Scenes
Xu Han
Junyu Gao
Chuang Yang
Yuan Yuan
Qi Wang
38
0
0
05 Nov 2024
High-Fidelity Document Stain Removal via A Large-Scale Real-World
  Dataset and A Memory-Augmented Transformer
High-Fidelity Document Stain Removal via A Large-Scale Real-World Dataset and A Memory-Augmented Transformer
Mingxian Li
Hao Sun
Yingtie Lei
Xiaofeng Zhang
Yihang Dong
Yilin Zhou
Zimeng Li
Xuhang Chen
35
9
0
30 Oct 2024
Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5%
  Parameters and 90% Performance
Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance
Zhangwei Gao
Zhe Chen
Erfei Cui
Yiming Ren
Weiyun Wang
...
Lewei Lu
Tong Lu
Yu Qiao
Jifeng Dai
Wenhai Wang
VLM
62
24
0
21 Oct 2024
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
Weichao Zeng
Yan Shu
Zhenhang Li
Dongbao Yang
Yu Zhou
DiffM
22
7
0
14 Oct 2024
CodeSCAN: ScreenCast ANalysis for Video Programming Tutorials
CodeSCAN: ScreenCast ANalysis for Video Programming Tutorials
Alexander Naumann
Felix Hertlein
Jacqueline Höllig
Lucas Cazzonelli
Steffen Thoma
16
0
0
27 Sep 2024
AI-Powered Augmented Reality for Satellite Assembly, Integration and
  Test
AI-Powered Augmented Reality for Satellite Assembly, Integration and Test
Alvaro Patricio
Joao Valente
Atabak Dehban
Ines Cadilha
Daniel Reis
Rodrigo Ventura
22
1
0
26 Sep 2024
Text Image Generation for Low-Resource Languages with Dual Translation
  Learning
Text Image Generation for Low-Resource Languages with Dual Translation Learning
Chihiro Noguchi
Shun Fukuda
Shoichiro Mihara
Masao Yamanaka
DiffM
26
0
0
26 Sep 2024
Focus Entirety and Perceive Environment for Arbitrary-Shaped Text
  Detection
Focus Entirety and Perceive Environment for Arbitrary-Shaped Text Detection
Xu Han
Junyu Gao
Chuang Yang
Yuan Yuan
Qi Wang
25
0
0
25 Sep 2024
Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera
Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera
Xu Han
Junyu Gao
Chuang Yang
Yuan Yuan
Qi Wang
29
2
0
25 Sep 2024
One Model for Two Tasks: Cooperatively Recognizing and Recovering
  Low-Resolution Scene Text Images by Iterative Mutual Guidance
One Model for Two Tasks: Cooperatively Recognizing and Recovering Low-Resolution Scene Text Images by Iterative Mutual Guidance
Minyi Zhao
Yang Wang
Jihong Guan
Shuigeng Zhou
25
0
0
22 Sep 2024
Leveraging Text Localization for Scene Text Removal via Text-aware
  Masked Image Modeling
Leveraging Text Localization for Scene Text Removal via Text-aware Masked Image Modeling
Zixiao Wang
Hongtao Xie
Yuxin Wang
Yadong Qu
Fengjun Guo
Pengwei Liu
DiffM
31
0
0
20 Sep 2024
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text
  Recognizer
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer
Humen Zhong
Zhibo Yang
Zhaohai Li
Peng Wang
Jun Tang
Wenqing Cheng
Cong Yao
21
1
0
18 Sep 2024
Platypus: A Generalized Specialist Model for Reading Text in Various
  Forms
Platypus: A Generalized Specialist Model for Reading Text in Various Forms
Peng Wang
Zhaohai Li
Jun Tang
Humen Zhong
Fei Huang
Zhibo Yang
Cong Yao
VLM
ObjD
38
2
0
27 Aug 2024
LEGO: Self-Supervised Representation Learning for Scene Text Images
LEGO: Self-Supervised Representation Learning for Scene Text Images
Yujin Ren
Jiaxin Zhang
Lianwen Jin
SSL
29
0
0
04 Aug 2024
Self-Supervised Learning for Text Recognition: A Critical Survey
Self-Supervised Learning for Text Recognition: A Critical Survey
Carlos Peñarrubia
J. J. Valero-Mas
Jorge Calvo-Zaragoza
69
1
0
29 Jul 2024
WeCromCL: Weakly Supervised Cross-Modality Contrastive Learning for Transcription-only Supervised Text Spotting
WeCromCL: Weakly Supervised Cross-Modality Contrastive Learning for Transcription-only Supervised Text Spotting
Jingjing Wu
Zhengyao Fang
Pengyuan Lyu
Chengquan Zhang
Fanglin Chen
Guangming Lu
Wenjie Pei
50
2
0
28 Jul 2024
CLII: Visual-Text Inpainting via Cross-Modal Predictive Interaction
CLII: Visual-Text Inpainting via Cross-Modal Predictive Interaction
Liang Zhao
Qing-Wu Guo
Xiaoguang Li
Song Wang
DiffM
36
0
0
23 Jul 2024
Visual Text Generation in the Wild
Visual Text Generation in the Wild
Yuanzhi Zhu
Jiawei Liu
Feiyu Gao
Wenyu Liu
Xinggang Wang
Peng Wang
Fei Huang
Cong Yao
Zhibo Yang
DiffM
37
10
0
19 Jul 2024
Out of Length Text Recognition with Sub-String Matching
Out of Length Text Recognition with Sub-String Matching
Yongkun Du
Zhineng Chen
Caiyan Jia
Xieping Gao
Yu-Gang Jiang
49
2
0
17 Jul 2024
Spanish TrOCR: Leveraging Transfer Learning for Language Adaptation
Spanish TrOCR: Leveraging Transfer Learning for Language Adaptation
Filipe Lauar
Valentin Laurent
29
0
0
09 Jul 2024
Focus on the Whole Character: Discriminative Character Modeling for
  Scene Text Recognition
Focus on the Whole Character: Discriminative Character Modeling for Scene Text Recognition
Bangbang Zhou
Yadong Qu
Zixiao Wang
Zicheng Li
Boqiang Zhang
Hongtao Xie
37
1
0
08 Jul 2024
LOGO: Video Text Spotting with Language Collaboration and Glyph
  Perception Model
LOGO: Video Text Spotting with Language Collaboration and Glyph Perception Model
Hongen Liu
Di Sun
Jiahao Wang
Yi Liu
Gang Pan
37
0
0
29 May 2024
A General Framework for Jersey Number Recognition in Sports Video
A General Framework for Jersey Number Recognition in Sports Video
Maria Koshkina
James H. Elder
14
4
0
22 May 2024
HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive
  Permutation for Scene Text Recognition
HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive Permutation for Scene Text Recognition
Honghui Chen
Yuhang Qiu
Jiabao Wang
Pingping Chen
Nam Ling
35
0
0
15 May 2024
Self-Supervised Pre-training with Symmetric Superimposition Modeling for
  Scene Text Recognition
Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition
Zuan Gao
Yuxin Wang
Yadong Qu
Boqiang Zhang
Zixiao Wang
Jianjun Xu
Hongtao Xie
ViT
40
9
0
09 May 2024
Exploring the Capabilities of Large Multimodal Models on Dense Text
Exploring the Capabilities of Large Multimodal Models on Dense Text
Shuo Zhang
Biao Yang
Zhang Li
Zhiyin Ma
Yuliang Liu
Xiang Bai
VLM
29
7
0
09 May 2024
Synthetic Datasets for Program Similarity Research
Synthetic Datasets for Program Similarity Research
Alex Interrante-Grant
Michael Wang
Lisa Baer
Ryan Whelan
Tim Leek
SyDa
31
1
0
06 May 2024
VimTS: A Unified Video and Image Text Spotter for Enhancing the
  Cross-domain Generalization
VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization
Yuliang Liu
Mingxin Huang
Hao Yan
Linger Deng
Weijia Wu
Hao Lu
Chunhua Shen
Lianwen Jin
Xiang Bai
27
0
0
30 Apr 2024
MorphText: Deep Morphology Regularized Arbitrary-shape Scene Text
  Detection
MorphText: Deep Morphology Regularized Arbitrary-shape Scene Text Detection
Chengpei Xu
Wenjing Jia
Ruomei Wang
Xiaonan Luo
Xiangjian He
23
0
0
26 Apr 2024
LTOS: Layout-controllable Text-Object Synthesis via Adaptive
  Cross-attention Fusions
LTOS: Layout-controllable Text-Object Synthesis via Adaptive Cross-attention Fusions
Xiaoran Zhao
Tianhao Wu
Yu Lai
Zhiliang Tian
Zhen Huang
Yahui Liu
Zejiang He
Dongsheng Li
DiffM
31
1
0
21 Apr 2024
Simple In-place Data Augmentation for Surveillance Object Detection
Simple In-place Data Augmentation for Surveillance Object Detection
Munkh-Erdene Otgonbold
Ganzorig Batnasan
Munkhjargal Gochoo
31
0
0
17 Apr 2024
1234...101112
Next