ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.03493
  4. Cited By
MULE: Multimodal Universal Language Embedding
v1v2 (latest)

MULE: Multimodal Universal Language Embedding

AAAI Conference on Artificial Intelligence (AAAI), 2019
8 September 2019
Donghyun Kim
Kuniaki Saito
Kate Saenko
Stan Sclaroff
Bryan A. Plummer
    VLM
ArXiv (abs)PDFHTML

Papers citing "MULE: Multimodal Universal Language Embedding"

28 / 28 papers shown
Towards Understanding Ambiguity Resolution in Multimodal Inference of Meaning
Towards Understanding Ambiguity Resolution in Multimodal Inference of Meaning
Yufei Wang
Adriana Kovashka
Loretta Fernández
Marc N. Coutanche
Seth Wiener
114
0
0
10 Oct 2025
Artificial Phantasia: Evidence for Propositional Reasoning-Based Mental Imagery in Large Language Models
Artificial Phantasia: Evidence for Propositional Reasoning-Based Mental Imagery in Large Language Models
Morgan McCarty
Jorge Morales
LRM
148
1
0
27 Sep 2025
Remote Sensing Image Intelligent Interpretation with the Language-Centered Perspective: Principles, Methods and Challenges
Remote Sensing Image Intelligent Interpretation with the Language-Centered Perspective: Principles, Methods and Challenges
Haifeng Li
Wang Guo
Haiyang Wu
Mengwei Wu
Jipeng Zhang
Qing Zhu
Yu Liu
Xin Huang
Chao Tao
218
2
0
09 Aug 2025
UniMoCo: Unified Modality Completion for Robust Multi-Modal Embeddings
UniMoCo: Unified Modality Completion for Robust Multi-Modal Embeddings
Jiajun Qin
Yuan Pu
Zhuolun He
Seunggeun Kim
David Z. Pan
Bei Yu
467
4
0
17 May 2025
A Hybrid Swarm Intelligence Approach for Optimizing Multimodal Large Language Models Deployment in Edge-Cloud-based Federated Learning Environments
A Hybrid Swarm Intelligence Approach for Optimizing Multimodal Large Language Models Deployment in Edge-Cloud-based Federated Learning EnvironmentsComputer Communications (Comput. Commun.), 2025
Gaith Rjouba
Hanae Elmekki
Saidul Islam
Jamal Bentahar
Rachida Dssouli
476
8
0
04 Feb 2025
Detecting Frames in News Headlines and Lead Images in U.S. Gun Violence
  Coverage
Detecting Frames in News Headlines and Lead Images in U.S. Gun Violence Coverage
Isidora Chara Tourni
Lei Guo
Hengchang Hu
Edward Edberg Halim
Prakash Ishwar
...
Boqi Chen
Margrit Betke
Fabian Zhafransyah
Sha Lai
Derry Wijaya
215
22
0
25 Jun 2024
Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language
  Representations
Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language Representations
Gregor Geigle
Radu Timofte
Goran Glavaš
VLMMLLM
206
6
0
14 Jun 2023
Accessible Instruction-Following Agent
Accessible Instruction-Following Agent
Kairui Zhou
210
1
0
08 May 2023
Teaching Structured Vision&Language Concepts to Vision&Language Models
Teaching Structured Vision&Language Concepts to Vision&Language ModelsComputer Vision and Pattern Recognition (CVPR), 2022
Sivan Doveh
Assaf Arbelle
Sivan Harary
Yikang Shen
Roei Herzig
...
Donghyun Kim
Raja Giryes
Rogerio Feris
S. Ullman
Leonid Karlinsky
VLMCoGe
397
95
0
21 Nov 2022
Cross-Lingual Cross-Modal Retrieval with Noise-Robust Learning
Cross-Lingual Cross-Modal Retrieval with Noise-Robust LearningACM Multimedia (ACM MM), 2022
Yabing Wang
Jianfeng Dong
Tianxiang Liang
Minsong Zhang
Rui Cai
Xun Wang
315
31
0
26 Aug 2022
MuMUR : Multilingual Multimodal Universal Retrieval
MuMUR : Multilingual Multimodal Universal Retrieval
Avinash Madasu
Estelle Aflalo
Gabriela Ben-Melech Stan
Shachar Rosenman
Shao-Yen Tseng
Gedas Bertasius
Vasudev Lal
528
6
0
24 Aug 2022
CLEAR: Improving Vision-Language Navigation with Cross-Lingual,
  Environment-Agnostic Representations
CLEAR: Improving Vision-Language Navigation with Cross-Lingual, Environment-Agnostic Representations
Jialu Li
Hao Tan
Joey Tianyi Zhou
LM&Ro
246
13
0
05 Jul 2022
Generalizing Multimodal Pre-training into Multilingual via Language
  Acquisition
Generalizing Multimodal Pre-training into Multilingual via Language Acquisition
Liang Zhang
Anwen Hu
Qin Jin
VLM
176
7
0
29 May 2022
Cross-lingual Adaptation for Recipe Retrieval with Mixup
Cross-lingual Adaptation for Recipe Retrieval with MixupInternational Conference on Multimedia Retrieval (ICMR), 2022
B. Zhu
Chong-Wah Ngo
Yue Yu
W. Chan
220
7
0
08 May 2022
IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and
  Languages
IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and LanguagesInternational Conference on Machine Learning (ICML), 2022
Emanuele Bugliarello
Fangyu Liu
Jonas Pfeiffer
Siva Reddy
Desmond Elliott
Edoardo Ponti
Ivan Vulić
MLLMVLMELM
465
71
0
27 Jan 2022
Anchoring to Exemplars for Training Mixture-of-Expert Cell Embeddings
Anchoring to Exemplars for Training Mixture-of-Expert Cell Embeddings
Siqi Wang
Manyuan Lu
Nikita Moshkov
Juan C. Caicedo
Bryan A. Plummer
211
4
0
06 Dec 2021
Towards Zero-shot Cross-lingual Image Retrieval and Tagging
Towards Zero-shot Cross-lingual Image Retrieval and Tagging
Pranav Aggarwal
Ritiz Tambi
Ajinkya Kale
VLM
290
7
0
15 Sep 2021
MTVR: Multilingual Moment Retrieval in Videos
MTVR: Multilingual Moment Retrieval in VideosAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Jie Lei
Tamara L. Berg
Joey Tianyi Zhou
206
12
0
30 Jul 2021
UC2: Universal Cross-lingual Cross-modal Vision-and-Language
  Pre-training
UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-trainingComputer Vision and Pattern Recognition (CVPR), 2021
Mingyang Zhou
Luowei Zhou
Shuohang Wang
Yu Cheng
Linjie Li
Zhou Yu
Jingjing Liu
MLLMVLM
284
110
0
01 Apr 2021
Retrieve Fast, Rerank Smart: Cooperative and Joint Approaches for
  Improved Cross-Modal Retrieval
Retrieve Fast, Rerank Smart: Cooperative and Joint Approaches for Improved Cross-Modal RetrievalTransactions of the Association for Computational Linguistics (TACL), 2021
Gregor Geigle
Jonas Pfeiffer
Nils Reimers
Ivan Vulić
Iryna Gurevych
427
61
0
22 Mar 2021
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual
  Transfer of Vision-Language Models
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Po-Yao (Bernie) Huang
Mandela Patrick
Junjie Hu
Graham Neubig
Florian Metze
Alexander G. Hauptmann
MLLMVLM
381
61
0
16 Mar 2021
LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time
  Image-Text Retrieval
LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text RetrievalNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Siqi Sun
Yen-Chun Chen
Linjie Li
Shuohang Wang
Yuwei Fang
Jingjing Liu
VLM
281
91
0
16 Mar 2021
Globetrotter: Connecting Languages by Connecting Images
Globetrotter: Connecting Languages by Connecting Images
Dídac Surís
Dave Epstein
Carl Vondrick
VLM
384
9
0
08 Dec 2020
Towards Zero-shot Cross-lingual Image Retrieval
Towards Zero-shot Cross-lingual Image Retrieval
Pranav Aggarwal
Ajinkya Kale
VLM
286
34
0
24 Nov 2020
COOT: Cooperative Hierarchical Transformer for Video-Text Representation
  Learning
COOT: Cooperative Hierarchical Transformer for Video-Text Representation LearningNeural Information Processing Systems (NeurIPS), 2020
Simon Ging
Mohammadreza Zolfaghari
Hamed Pirsiavash
Thomas Brox
ViTCLIP
276
178
0
01 Nov 2020
M3P: Learning Universal Representations via Multitask Multilingual
  Multimodal Pre-training
M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-training
Minheng Ni
Haoyang Huang
Lin Su
Edward Cui
Taroon Bharti
Lijuan Wang
Jianfeng Gao
Dongdong Zhang
Nan Duan
389
7
0
04 Jun 2020
Learning to Scale Multilingual Representations for Vision-Language Tasks
Learning to Scale Multilingual Representations for Vision-Language TasksEuropean Conference on Computer Vision (ECCV), 2020
Andrea Burns
Donghyun Kim
Derry Wijaya
Kate Saenko
Bryan A. Plummer
232
36
0
09 Apr 2020
Trends in Integration of Vision and Language Research: A Survey of
  Tasks, Datasets, and Methods
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and MethodsJournal of Artificial Intelligence Research (JAIR), 2019
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
489
145
0
22 Jul 2019
1
Page 1 of 1