Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1909.03493
Cited By
v1
v2 (latest)
MULE: Multimodal Universal Language Embedding
AAAI Conference on Artificial Intelligence (AAAI), 2019
8 September 2019
Donghyun Kim
Kuniaki Saito
Kate Saenko
Stan Sclaroff
Bryan A. Plummer
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"MULE: Multimodal Universal Language Embedding"
28 / 28 papers shown
Towards Understanding Ambiguity Resolution in Multimodal Inference of Meaning
Yufei Wang
Adriana Kovashka
Loretta Fernández
Marc N. Coutanche
Seth Wiener
114
0
0
10 Oct 2025
Artificial Phantasia: Evidence for Propositional Reasoning-Based Mental Imagery in Large Language Models
Morgan McCarty
Jorge Morales
LRM
148
1
0
27 Sep 2025
Remote Sensing Image Intelligent Interpretation with the Language-Centered Perspective: Principles, Methods and Challenges
Haifeng Li
Wang Guo
Haiyang Wu
Mengwei Wu
Jipeng Zhang
Qing Zhu
Yu Liu
Xin Huang
Chao Tao
218
2
0
09 Aug 2025
UniMoCo: Unified Modality Completion for Robust Multi-Modal Embeddings
Jiajun Qin
Yuan Pu
Zhuolun He
Seunggeun Kim
David Z. Pan
Bei Yu
467
4
0
17 May 2025
A Hybrid Swarm Intelligence Approach for Optimizing Multimodal Large Language Models Deployment in Edge-Cloud-based Federated Learning Environments
Computer Communications (Comput. Commun.), 2025
Gaith Rjouba
Hanae Elmekki
Saidul Islam
Jamal Bentahar
Rachida Dssouli
476
8
0
04 Feb 2025
Detecting Frames in News Headlines and Lead Images in U.S. Gun Violence Coverage
Isidora Chara Tourni
Lei Guo
Hengchang Hu
Edward Edberg Halim
Prakash Ishwar
...
Boqi Chen
Margrit Betke
Fabian Zhafransyah
Sha Lai
Derry Wijaya
215
22
0
25 Jun 2024
Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language Representations
Gregor Geigle
Radu Timofte
Goran Glavaš
VLM
MLLM
206
6
0
14 Jun 2023
Accessible Instruction-Following Agent
Kairui Zhou
210
1
0
08 May 2023
Teaching Structured Vision&Language Concepts to Vision&Language Models
Computer Vision and Pattern Recognition (CVPR), 2022
Sivan Doveh
Assaf Arbelle
Sivan Harary
Yikang Shen
Roei Herzig
...
Donghyun Kim
Raja Giryes
Rogerio Feris
S. Ullman
Leonid Karlinsky
VLM
CoGe
397
95
0
21 Nov 2022
Cross-Lingual Cross-Modal Retrieval with Noise-Robust Learning
ACM Multimedia (ACM MM), 2022
Yabing Wang
Jianfeng Dong
Tianxiang Liang
Minsong Zhang
Rui Cai
Xun Wang
315
31
0
26 Aug 2022
MuMUR : Multilingual Multimodal Universal Retrieval
Avinash Madasu
Estelle Aflalo
Gabriela Ben-Melech Stan
Shachar Rosenman
Shao-Yen Tseng
Gedas Bertasius
Vasudev Lal
528
6
0
24 Aug 2022
CLEAR: Improving Vision-Language Navigation with Cross-Lingual, Environment-Agnostic Representations
Jialu Li
Hao Tan
Joey Tianyi Zhou
LM&Ro
246
13
0
05 Jul 2022
Generalizing Multimodal Pre-training into Multilingual via Language Acquisition
Liang Zhang
Anwen Hu
Qin Jin
VLM
176
7
0
29 May 2022
Cross-lingual Adaptation for Recipe Retrieval with Mixup
International Conference on Multimedia Retrieval (ICMR), 2022
B. Zhu
Chong-Wah Ngo
Yue Yu
W. Chan
220
7
0
08 May 2022
IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages
International Conference on Machine Learning (ICML), 2022
Emanuele Bugliarello
Fangyu Liu
Jonas Pfeiffer
Siva Reddy
Desmond Elliott
Edoardo Ponti
Ivan Vulić
MLLM
VLM
ELM
465
71
0
27 Jan 2022
Anchoring to Exemplars for Training Mixture-of-Expert Cell Embeddings
Siqi Wang
Manyuan Lu
Nikita Moshkov
Juan C. Caicedo
Bryan A. Plummer
211
4
0
06 Dec 2021
Towards Zero-shot Cross-lingual Image Retrieval and Tagging
Pranav Aggarwal
Ritiz Tambi
Ajinkya Kale
VLM
290
7
0
15 Sep 2021
MTVR: Multilingual Moment Retrieval in Videos
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Jie Lei
Tamara L. Berg
Joey Tianyi Zhou
206
12
0
30 Jul 2021
UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training
Computer Vision and Pattern Recognition (CVPR), 2021
Mingyang Zhou
Luowei Zhou
Shuohang Wang
Yu Cheng
Linjie Li
Zhou Yu
Jingjing Liu
MLLM
VLM
284
110
0
01 Apr 2021
Retrieve Fast, Rerank Smart: Cooperative and Joint Approaches for Improved Cross-Modal Retrieval
Transactions of the Association for Computational Linguistics (TACL), 2021
Gregor Geigle
Jonas Pfeiffer
Nils Reimers
Ivan Vulić
Iryna Gurevych
427
61
0
22 Mar 2021
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2021
Po-Yao (Bernie) Huang
Mandela Patrick
Junjie Hu
Graham Neubig
Florian Metze
Alexander G. Hauptmann
MLLM
VLM
381
61
0
16 Mar 2021
LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text Retrieval
North American Chapter of the Association for Computational Linguistics (NAACL), 2021
Siqi Sun
Yen-Chun Chen
Linjie Li
Shuohang Wang
Yuwei Fang
Jingjing Liu
VLM
281
91
0
16 Mar 2021
Globetrotter: Connecting Languages by Connecting Images
Dídac Surís
Dave Epstein
Carl Vondrick
VLM
384
9
0
08 Dec 2020
Towards Zero-shot Cross-lingual Image Retrieval
Pranav Aggarwal
Ajinkya Kale
VLM
286
34
0
24 Nov 2020
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
Neural Information Processing Systems (NeurIPS), 2020
Simon Ging
Mohammadreza Zolfaghari
Hamed Pirsiavash
Thomas Brox
ViT
CLIP
276
178
0
01 Nov 2020
M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-training
Minheng Ni
Haoyang Huang
Lin Su
Edward Cui
Taroon Bharti
Lijuan Wang
Jianfeng Gao
Dongdong Zhang
Nan Duan
389
7
0
04 Jun 2020
Learning to Scale Multilingual Representations for Vision-Language Tasks
European Conference on Computer Vision (ECCV), 2020
Andrea Burns
Donghyun Kim
Derry Wijaya
Kate Saenko
Bryan A. Plummer
232
36
0
09 Apr 2020
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Journal of Artificial Intelligence Research (JAIR), 2019
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
489
145
0
22 Jul 2019
1
Page 1 of 1