ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.01552
  4. Cited By
Scene Text Retrieval via Joint Text Detection and Similarity Learning

Scene Text Retrieval via Joint Text Detection and Similarity Learning

Computer Vision and Pattern Recognition (CVPR), 2021
4 April 2021
Hao Wang
X. Bai
Mingkun Yang
Shenggao Zhu
Jing Wang
Wenyu Liu
    3DV
ArXiv (abs)PDFHTMLGithub (82★)

Papers citing "Scene Text Retrieval via Joint Text Detection and Similarity Learning"

14 / 14 papers shown
MSTAR: Box-free Multi-query Scene Text Retrieval with Attention Recycling
MSTAR: Box-free Multi-query Scene Text Retrieval with Attention Recycling
Liang Yin
Xudong Xie
Zhang Li
Xiang Bai
Yuliang Liu
LRM
372
0
0
12 Jun 2025
A Token-level Text Image Foundation Model for Document Understanding
A Token-level Text Image Foundation Model for Document Understanding
Tongkun Guan
Zining Wang
Pei Fu
Zhengtao Guo
Wei Shen
...
Chen Duan
Hao Sun
Qianyi Jiang
Junfeng Luo
Yunbo Wang
VLM
702
8
0
04 Mar 2025
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and
  Flexible Scene Text Retrieval
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text RetrievalACM Multimedia (MM), 2024
Gangyan Zeng
Yuan Zhang
Jin Wei
Dongbao Yang
Peng Zhang
Yiwen Gao
Xugong Qin
Can Ma
VLMCLIP
276
9
0
01 Aug 2024
A Language-based solution to enable Metaverse Retrieval
A Language-based solution to enable Metaverse Retrieval
Ali Abdari
Alex Falcon
Giuseppe Serra
DiffM
360
9
0
22 Dec 2023
FArMARe: a Furniture-Aware Multi-task methodology for Recommending
  Apartments based on the user interests
FArMARe: a Furniture-Aware Multi-task methodology for Recommending Apartments based on the user interests
Ali Abdari
Alex Falcon
Giuseppe Serra
295
7
0
06 Sep 2023
Visual Information Extraction in the Wild: Practical Dataset and
  End-to-end Solution
Visual Information Extraction in the Wild: Practical Dataset and End-to-end SolutionIEEE International Conference on Document Analysis and Recognition (ICDAR), 2023
Jianfeng Kuang
Wei Hua
Dingkang Liang
Mingkun Yang
Deqiang Jiang
Bo Ren
Xiang Bai
379
60
0
12 May 2023
A Large Cross-Modal Video Retrieval Dataset with Reading Comprehension
A Large Cross-Modal Video Retrieval Dataset with Reading ComprehensionPattern Recognition (Pattern Recogn.), 2023
Weijia Wu
Yuzhong Zhao
Zhuangzi Li
Jiahong Li
Hong Zhou
Mike Zheng Shou
Xiang Bai
285
36
0
05 May 2023
Improving Scene Text Image Super-resolution via Dual Prior Modulation
  Network
Improving Scene Text Image Super-resolution via Dual Prior Modulation NetworkAAAI Conference on Artificial Intelligence (AAAI), 2023
Shipeng Zhu
Zuoyan Zhao
Pengfei Fang
H. Xue
SupRDiffM
284
38
0
21 Feb 2023
Domain Adaptive Scene Text Detection via Subcategorization
Domain Adaptive Scene Text Detection via Subcategorization
Zichen Tian
Chuhui Xue
Jingyi Zhang
Shijian Lu
324
5
0
01 Dec 2022
MSLKANet: A Multi-Scale Large Kernel Attention Network for Scene Text
  Removal
MSLKANet: A Multi-Scale Large Kernel Attention Network for Scene Text Removal
Guangtao Lyu
172
2
0
12 Nov 2022
ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval
ViSTA: Vision and Scene Text Aggregation for Cross-Modal RetrievalComputer Vision and Pattern Recognition (CVPR), 2022
Mengjun Cheng
Yipeng Sun
Long Wang
Xiongwei Zhu
Kun Yao
...
Guoli Song
Junyu Han
Jingtuo Liu
Errui Ding
Jingdong Wang
377
77
0
31 Mar 2022
Knowledge Mining with Scene Text for Fine-Grained Recognition
Knowledge Mining with Scene Text for Fine-Grained RecognitionComputer Vision and Pattern Recognition (CVPR), 2022
Hao Wang
Junchao Liao
Tianheng Cheng
Zewen Gao
Hao Liu
Bo Ren
X. Bai
Wenyu Liu
304
14
0
27 Mar 2022
A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text
  Spotter with Transformer
A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text Spotter with Transformer
Weijia Wu
Yuanqiang Cai
Debing Zhang
Sibo Wang
Zhuang Li
Jiahong Li
Yejun Tang
Hong Zhou
214
38
0
09 Dec 2021
YOLO9000: Better, Faster, Stronger
YOLO9000: Better, Faster, StrongerComputer Vision and Pattern Recognition (CVPR), 2016
Joseph Redmon
Ali Farhadi
VLMObjD
824
17,410
0
25 Dec 2016
1
Page 1 of 1