ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.11612
  4. Cited By
Balance Act: Mitigating Hubness in Cross-Modal Retrieval with Query and
  Gallery Banks

Balance Act: Mitigating Hubness in Cross-Modal Retrieval with Query and Gallery Banks

17 October 2023
Yimu Wang
Xiangru Jian
Bo Xue
ArXivPDFHTML

Papers citing "Balance Act: Mitigating Hubness in Cross-Modal Retrieval with Query and Gallery Banks"

11 / 11 papers shown
Title
NeighborRetr: Balancing Hub Centrality in Cross-Modal Retrieval
Zengrong Lin
Zheng Wang
Tianwen Qian
Pan Mu
Sixian Chan
Cong Bai
42
0
0
13 Mar 2025
Prediction hubs are context-informed frequent tokens in LLMs
Prediction hubs are context-informed frequent tokens in LLMs
Beatrix M. G. Nielsen
Iuri Macocco
Marco Baroni
123
1
0
17 Feb 2025
Adversarial Hubness in Multi-Modal Retrieval
Adversarial Hubness in Multi-Modal Retrieval
Tingwei Zhang
Fnu Suya
Rishi Jha
Collin Zhang
Vitaly Shmatikov
AAML
81
1
0
18 Dec 2024
Nearest Neighbor Normalization Improves Multimodal Retrieval
Nearest Neighbor Normalization Improves Multimodal Retrieval
Neil Chowdhury
Franklin Wang
Sumedh Shenoy
Douwe Kiela
Sarah Schwettmann
Tristan Thrush
VLM
32
3
0
31 Oct 2024
Domain Prompt Learning with Quaternion Networks
Domain Prompt Learning with Quaternion Networks
Qinglong Cao
Zhengqin Xu
Yuntian Chen
Chao Ma
Xiaokang Yang
VLM
19
10
0
12 Dec 2023
InvGC: Robust Cross-Modal Retrieval by Inverse Graph Convolution
InvGC: Robust Cross-Modal Retrieval by Inverse Graph Convolution
Xiangru Jian
Yimu Wang
17
4
0
20 Oct 2023
Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval
Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval
Xiang Fang
Daizong Liu
Pan Zhou
Yuchong Hu
77
35
0
23 Sep 2022
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text
  Understanding
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
Hu Xu
Gargi Ghosh
Po-Yao (Bernie) Huang
Dmytro Okhonko
Armen Aghajanyan
Florian Metze
Luke Zettlemoyer
Florian Metze Luke Zettlemoyer Christoph Feichtenhofer
CLIP
VLM
245
554
0
28 Sep 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy
  Text Supervision
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
293
3,683
0
11 Feb 2021
Multi-modal Transformer for Video Retrieval
Multi-modal Transformer for Video Retrieval
Valentin Gabeur
Chen Sun
Alahari Karteek
Cordelia Schmid
ViT
401
594
0
21 Jul 2020
Word Translation Without Parallel Data
Word Translation Without Parallel Data
Alexis Conneau
Guillaume Lample
MarcÁurelio Ranzato
Ludovic Denoyer
Hervé Jégou
158
1,630
0
11 Oct 2017
1