ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.06509
  4. Cited By
Step-Wise Hierarchical Alignment Network for Image-Text Matching

Step-Wise Hierarchical Alignment Network for Image-Text Matching

11 June 2021
Zhong Ji
Kexin Chen
Haoran Wang
ArXivPDFHTML

Papers citing "Step-Wise Hierarchical Alignment Network for Image-Text Matching"

20 / 20 papers shown
Title
ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval
ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval
Guanqi Zhan
Yuanpei Liu
Kai Han
Weidi Xie
Andrew Zisserman
VLM
135
0
0
21 Feb 2025
Image Embedding Sampling Method for Diverse Captioning
Image Embedding Sampling Method for Diverse Captioning
Sania Waheed
Na Min An
55
0
0
14 Feb 2025
Object-Aware Query Perturbation for Cross-Modal Image-Text Retrieval
Object-Aware Query Perturbation for Cross-Modal Image-Text Retrieval
Naoya Sogi
Takashi Shibata
Makoto Terao
VLM
28
1
0
17 Jul 2024
Dynamic Self-adaptive Multiscale Distillation from Pre-trained
  Multimodal Large Model for Efficient Cross-modal Representation Learning
Dynamic Self-adaptive Multiscale Distillation from Pre-trained Multimodal Large Model for Efficient Cross-modal Representation Learning
Zhengyang Liang
Meiyu Liang
Wei Huang
Yawen Li
Zhe Xue
21
1
0
16 Apr 2024
Direction-Oriented Visual-semantic Embedding Model for Remote Sensing
  Image-text Retrieval
Direction-Oriented Visual-semantic Embedding Model for Remote Sensing Image-text Retrieval
Qing Ma
Jiancheng Pan
Cong Bai
18
16
0
12 Oct 2023
A Survey on Interpretable Cross-modal Reasoning
A Survey on Interpretable Cross-modal Reasoning
Dizhan Xue
Shengsheng Qian
Zuyi Zhou
Changsheng Xu
LRM
29
4
0
05 Sep 2023
Towards Fast and Accurate Image-Text Retrieval with Self-Supervised
  Fine-Grained Alignment
Towards Fast and Accurate Image-Text Retrieval with Self-Supervised Fine-Grained Alignment
Jiamin Zhuang
Jing Yu
Yang Ding
Xiangyang Qu
Yue Hu
19
9
0
27 Aug 2023
Progressive Feature Mining and External Knowledge-Assisted
  Text-Pedestrian Image Retrieval
Progressive Feature Mining and External Knowledge-Assisted Text-Pedestrian Image Retrieval
Huafeng Li
Shedan Yang
Yafei Zhang
Dapeng Tao
Z. Yu
25
3
0
23 Aug 2023
Hierarchical Matching and Reasoning for Multi-Query Image Retrieval
Hierarchical Matching and Reasoning for Multi-Query Image Retrieval
Zhong Ji
Zhihao Li
Yan Zhang
Haoran Wang
Yanwei Pang
Xuelong Li
24
11
0
26 Jun 2023
Plug-and-Play Regulators for Image-Text Matching
Plug-and-Play Regulators for Image-Text Matching
Haiwen Diao
Y. Zhang
W. Liu
Xiang Ruan
Huchuan Lu
27
20
0
23 Mar 2023
Scene Graph Based Fusion Network For Image-Text Retrieval
Scene Graph Based Fusion Network For Image-Text Retrieval
Guoliang Wang
Yanlei Shang
Yongzhe Chen
24
1
0
20 Mar 2023
USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text
  Retrieval
USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval
Yan Zhang
Zhong Ji
Dingrong Wang
Yanwei Pang
Xuelong Li
VLM
16
21
0
17 Jan 2023
HGAN: Hierarchical Graph Alignment Network for Image-Text Retrieval
HGAN: Hierarchical Graph Alignment Network for Image-Text Retrieval
Jie Guo
Meiting Wang
Yan Zhou
Bin Song
Yuhao Chi
Wei-liang Fan
Jianglong Chang
29
15
0
16 Dec 2022
Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval
Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval
Xuri Ge
Fuhai Chen
Songpei Xu
Fuxiang Tao
J. Jose
13
26
0
17 Oct 2022
CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for
  Image-Text Retrieval
CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval
Haoran Wang
Dongliang He
Wenhao Wu
Boyang Xia
Min Yang
Fu Li
Yunlong Yu
Zhong Ji
Errui Ding
Jingdong Wang
19
22
0
21 Aug 2022
Boosting Video-Text Retrieval with Explicit High-Level Semantics
Boosting Video-Text Retrieval with Explicit High-Level Semantics
Haoran Wang
Di Xu
Dongliang He
Fu Li
Zhong Ji
Jungong Han
Errui Ding
18
11
0
08 Aug 2022
NSNet: Non-saliency Suppression Sampler for Efficient Video Recognition
NSNet: Non-saliency Suppression Sampler for Efficient Video Recognition
Boyang Xia
Wenhao Wu
Haoran Wang
Rui Su
Dongliang He
Haosen Yang
Xiaoran Fan
Wanli Ouyang
15
21
0
21 Jul 2022
HiVLP: Hierarchical Vision-Language Pre-Training for Fast Image-Text
  Retrieval
HiVLP: Hierarchical Vision-Language Pre-Training for Fast Image-Text Retrieval
Feilong Chen
Xiuyi Chen
Jiaxin Shi
Duzhen Zhang
Jianlong Chang
Qi Tian
VLM
CLIP
32
6
0
24 May 2022
Image-text Retrieval: A Survey on Recent Research and Development
Image-text Retrieval: A Survey on Recent Research and Development
Min Cao
Shiping Li
Juntao Li
Liqiang Nie
Min Zhang
21
81
0
28 Mar 2022
MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes
MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes
Yang Jiao
Shaoxiang Chen
Zequn Jie
Jing Chen
Lin Ma
Yu-Gang Jiang
3DPC
19
46
0
10 Mar 2022
1