Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2012.05107
Cited By
Towards Zero-shot Cross-lingual Image Retrieval
24 November 2020
Pranav Aggarwal
Ajinkya Kale
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (14★)
Papers citing
"Towards Zero-shot Cross-lingual Image Retrieval"
14 / 14 papers shown
Multilingual Text-to-Image Person Retrieval via Bidirectional Relation Reasoning and Aligning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Min Cao
Xinyu Zhou
Ding Jiang
Bo Du
Mang Ye
Min Zhang
235
2
0
20 Oct 2025
Multilingual Vision-Language Models, A Survey
Andrei-Alexandru Manea
Jindřich Libovický
VLM
210
1
0
26 Sep 2025
Meta CLIP 2: A Worldwide Scaling Recipe
Yung-Sung Chuang
Yang Li
Dong Wang
Ching-Feng Yeh
Kehan Lyu
...
Zhuang Liu
Saining Xie
Anuj Kumar
Shang-Wen Li
Hu Xu
CLIP
VLM
477
34
0
29 Jul 2025
jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images
Andreas Koukounas
Georgios Mastrapas
Bo Wang
Mohammad Kalim Akram
Sedigheh Eslami
Michael Gunther
Isabelle Mohr
Saba Sturua
Scott Martens
Nan Wang
VLM
951
27
0
11 Dec 2024
Do Vision and Language Encoders Represent the World Similarly?
Computer Vision and Pattern Recognition (CVPR), 2024
Mayug Maniparambil
Raiymbek Akshulakov
Y. A. D. Djilali
Sanath Narayan
Abdalgader Abubaker
K. Mangalam
Noel E. O'Connor
VLM
354
41
0
10 Jan 2024
Dual-view Curricular Optimal Transport for Cross-lingual Cross-modal Retrieval
IEEE Transactions on Image Processing (IEEE TIP), 2023
Yabing Wang
Shuhui Wang
Hao Luo
Jianfeng Dong
F. Wang
Meng Han
Xun Wang
Meng Wang
243
15
0
11 Sep 2023
AltDiffusion: A Multilingual Text-to-Image Diffusion Model
AAAI Conference on Artificial Intelligence (AAAI), 2023
Fulong Ye
Guangyi Liu
Xinya Wu
Ledell Yu Wu
VLM
413
52
0
19 Aug 2023
Translation-Enhanced Multilingual Text-to-Image Generation
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Yaoyiran Li
Ching-Yun Chang
Stephen Rawls
Ivan Vulić
Anna Korhonen
280
14
0
30 May 2023
AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Zhongzhi Chen
Guangyi Liu
Bo Zhang
Fulong Ye
Qinghong Yang
Ledell Yu Wu
VLM
447
110
0
12 Nov 2022
MaXM: Towards Multilingual Visual Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Soravit Changpinyo
Linting Xue
Michal Yarom
Ashish V. Thapliyal
Idan Szpektor
J. Amelot
Xi Chen
Radu Soricut
318
8
0
12 Sep 2022
Cross-Lingual Cross-Modal Retrieval with Noise-Robust Learning
ACM Multimedia (ACM MM), 2022
Yabing Wang
Jianfeng Dong
Tianxiang Liang
Minsong Zhang
Rui Cai
Xun Wang
315
31
0
26 Aug 2022
Generalizing Multimodal Pre-training into Multilingual via Language Acquisition
Liang Zhang
Anwen Hu
Qin Jin
VLM
176
7
0
29 May 2022
Towards Zero-shot Cross-lingual Image Retrieval and Tagging
Pranav Aggarwal
Ritiz Tambi
Ajinkya Kale
VLM
290
7
0
15 Sep 2021
MURAL: Multimodal, Multitask Retrieval Across Languages
Aashi Jain
Mandy Guo
Krishna Srinivasan
Ting-Li Chen
Sneha Kudugunta
Chao Jia
Yinfei Yang
Jason Baldridge
VLM
390
63
0
10 Sep 2021
1
Page 1 of 1