Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2206.00621
Cited By
v1
v2 (latest)
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
1 June 2022
Yan Zeng
Wangchunshu Zhou
Ao Luo
Ziming Cheng
Xinsong Zhang
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training"
24 / 24 papers shown
Multilingual Text-to-Image Person Retrieval via Bidirectional Relation Reasoning and Aligning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Min Cao
Xinyu Zhou
Ding Jiang
Bo Du
Mang Ye
Min Zhang
224
2
0
20 Oct 2025
Multilingual Vision-Language Models, A Survey
Andrei-Alexandru Manea
Jindřich Libovický
VLM
176
1
0
26 Sep 2025
Investigating the Effect of Parallel Data in the Cross-Lingual Transfer for Vision-Language Encoders
International Conference on Text, Speech and Dialogue (TSD), 2025
Andrei-Alexandru Manea
Jindřich Libovický
VLM
424
1
0
30 Apr 2025
No Culture Left Behind: ArtELingo-28, a Benchmark of WikiArt with Captions in 28 Languages
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Youssef Mohamed
Runjia Li
Ibrahim Said Ahmad
Kilichbek Haydarov
Juil Sock
Kenneth Church
Mohamed Elhoseiny
VLM
236
16
0
06 Nov 2024
Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval
ACM Multimedia (MM), 2024
Yabing Wang
Le Wang
Qiang-feng Zhou
Zhibin Wang
Hao Li
Gang Hua
Wei Tang
235
24
0
30 Sep 2024
DSCLAP: Domain-Specific Contrastive Language-Audio Pre-Training
Shengqiang Liu
D. Liu
Anna Wang
Zhiyu Zhang
Jie Ying Gao
Yali Li
CLIP
VLM
147
1
0
14 Sep 2024
CVLUE: A New Benchmark Dataset for Chinese Vision-Language Understanding Evaluation
Yuxuan Wang
Yijun Liu
Fei Yu
Chen Huang
Kexin Li
Zhiguo Wan
Wanxiang Che
VLM
CoGe
180
8
0
01 Jul 2024
Improving the Consistency in Cross-Lingual Cross-Modal Retrieval with 1-to-K Contrastive Learning
Zhijie Nie
Richong Zhang
Zhangchi Feng
Hailang Huang
Xudong Liu
223
6
0
26 Jun 2024
See It from My Perspective: How Language Affects Cultural Bias in Image Understanding
Amith Ananthram
Elias Stengel-Eskin
Carl Vondrick
Joey Tianyi Zhou
VLM
404
7
0
17 Jun 2024
Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question Answering
Yujin Baek
Koanho Lee
Hyesu Lim
Jaeseok Kim
Junmo Park
Yu-Jung Heo
Du-Seong Chang
Jaegul Choo
171
3
0
04 Jun 2024
m3P: Towards Multimodal Multilingual Translation with Multimodal Prompt
Zhiqiang Wang
Hongcheng Guo
Yuwei Yin
Jiaqi Bai
Bing Wang
Jiaheng Liu
Xinnian Liang
Linzheng Cahi
Liqun Yang
Zhoujun Li
224
14
0
26 Mar 2024
What Is Missing in Multilingual Visual Reasoning and How to Fix It
Yueqi Song
Simran Khanuja
Graham Neubig
VLM
LRM
642
8
0
03 Mar 2024
CL2CM: Improving Cross-Lingual Cross-Modal Retrieval via Cross-Lingual Knowledge Transfer
AAAI Conference on Artificial Intelligence (AAAI), 2023
Yabing Wang
Fan Wang
Jianfeng Dong
Hao Luo
VLM
238
20
0
14 Dec 2023
Dual-view Curricular Optimal Transport for Cross-lingual Cross-modal Retrieval
IEEE Transactions on Image Processing (IEEE TIP), 2023
Yabing Wang
Shuhui Wang
Hao Luo
Jianfeng Dong
F. Wang
Meng Han
Xun Wang
Meng Wang
238
15
0
11 Sep 2023
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages
International Conference on Learning Representations (ICLR), 2023
Jinyi Hu
Yuan Yao
Chong Wang
Shanonan Wang
Yinxu Pan
...
Yankai Lin
Jiao Xue
Dahai Li
Zhiyuan Liu
Maosong Sun
MLLM
VLM
302
78
0
23 Aug 2023
mBLIP: Efficient Bootstrapping of Multilingual Vision-LLMs
Gregor Geigle
Abhay Jain
Radu Timofte
Goran Glavaš
VLM
MLLM
299
44
0
13 Jul 2023
Stop Pre-Training: Adapt Visual-Language Models to Unseen Languages
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Yasmine Karoui
R. Lebret
Negar Foroutan
Karl Aberer
MLLM
VLM
143
4
0
29 Jun 2023
Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language Representations
Gregor Geigle
Radu Timofte
Goran Glavaš
VLM
MLLM
181
6
0
14 Jun 2023
Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias
Neural Information Processing Systems (NeurIPS), 2023
Zhongwei Wan
Che Liu
Mi Zhang
Jie Fu
Benyou Wang
Sibo Cheng
Lei Ma
César Quilodrán-Casas
Rossella Arcucci
450
99
0
31 May 2023
RC3: Regularized Contrastive Cross-lingual Cross-modal Pre-training
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Chulun Zhou
Yunlong Liang
Fandong Meng
Jinan Xu
Jinsong Su
Jie Zhou
VLM
270
4
0
13 May 2023
Multimodality Representation Learning: A Survey on Evolution, Pretraining and Its Applications
Muhammad Arslan Manzoor
S. Albarri
Ziting Xian
Zaiqiao Meng
Preslav Nakov
Shangsong Liang
AI4TS
353
61
0
01 Feb 2023
X
2
^2
2
-VLM: All-In-One Pre-trained Model For Vision-Language Tasks
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Yan Zeng
Xinsong Zhang
Hang Li
Jiawei Wang
Jipeng Zhang
Hkust Wangchunshu Zhou
VLM
MLLM
281
26
0
22 Nov 2022
Improving the Cross-Lingual Generalisation in Visual Question Answering
AAAI Conference on Artificial Intelligence (AAAI), 2022
Farhad Nooralahzadeh
Rico Sennrich
278
8
0
07 Sep 2022
Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding
Yidan Sun
Qin Chao
Yangfeng Ji
Boyang Albert Li
VGen
492
11
0
11 Mar 2022
1
Page 1 of 1