Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.12106
Cited By
ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining
21 June 2023
Dezhi Peng
Chongyu Liu
Yuliang Liu
Lianwen Jin
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining"
7 / 7 papers shown
Title
Leveraging Text Localization for Scene Text Removal via Text-aware Masked Image Modeling
Zixiao Wang
Hongtao Xie
Yuxin Wang
Yadong Qu
Fengjun Guo
Pengwei Liu
DiffM
26
0
0
20 Sep 2024
DeepEraser: Deep Iterative Context Mining for Generic Text Eraser
Hao Feng
Wendi Wang
Shaokai Liu
Jiajun Deng
Wen-gang Zhou
Houqiang Li
29
2
0
29 Feb 2024
Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
Mingkun Yang
Minghui Liao
Pu Lu
Jing Wang
Shenggao Zhu
Hualin Luo
Qingzhen Tian
X. Bai
SSL
27
55
0
01 Jul 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,337
0
11 Nov 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
263
3,538
0
24 Feb 2021
Image-to-Image Translation with Conditional Adversarial Networks
Phillip Isola
Jun-Yan Zhu
Tinghui Zhou
Alexei A. Efros
SSeg
212
19,191
0
21 Nov 2016
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images
Andreas Veit
Tomas Matera
Lukás Neumann
Jirí Matas
Serge J. Belongie
175
515
0
26 Jan 2016
1