Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.14338
Cited By
Turning a CLIP Model into a Scene Text Detector
28 February 2023
Wenwen Yu
Yuliang Liu
Wei Hua
Deqiang Jiang
Bo Ren
Xiang Bai
VLM
CLIP
MLLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Turning a CLIP Model into a Scene Text Detector"
8 / 8 papers shown
Title
WeCromCL: Weakly Supervised Cross-Modality Contrastive Learning for Transcription-only Supervised Text Spotting
Jingjing Wu
Zhengyao Fang
Pengyuan Lyu
Chengquan Zhang
Fanglin Chen
Guangming Lu
Wenjie Pei
39
2
0
28 Jul 2024
Domain-Agnostic Mutual Prompting for Unsupervised Domain Adaptation
Zhekai Du
Xinyao Li
Fengling Li
Ke Lu
Lei Zhu
Jingjing Li
25
15
0
05 Mar 2024
Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Jianfeng Kuang
Wei Hua
Dingkang Liang
Mingkun Yang
Deqiang Jiang
Bo Ren
Xiang Bai
15
39
0
12 May 2023
Learning to Prompt for Vision-Language Models
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
VPVLM
CLIP
VLM
319
2,108
0
02 Sep 2021
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation
Xiuye Gu
Tsung-Yi Lin
Weicheng Kuo
Yin Cui
VLM
ObjD
206
698
0
28 Apr 2021
Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection
Shi-Xue Zhang
Xiaobin Zhu
Jie-Bo Hou
Chang-rui Liu
Chun Yang
Hongfa Wang
Xu-Cheng Yin
GNN
30
181
0
17 Mar 2020
Convolutional Character Networks
Linjie Xing
Zhi Tian
Weilin Huang
Matthew R. Scott
35
155
0
17 Oct 2019
Language Models as Knowledge Bases?
Fabio Petroni
Tim Rocktaschel
Patrick Lewis
A. Bakhtin
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
AI4MH
391
2,216
0
03 Sep 2019
1