v1v2 (latest)

MULE: Multimodal Universal Language Embedding

AAAI Conference on Artificial Intelligence (AAAI), 2019

8 September 2019

Papers citing "MULE: Multimodal Universal Language Embedding"

28 / 28 papers shown

Towards Understanding Ambiguity Resolution in Multimodal Inference of Meaning

114

10 Oct 2025

Artificial Phantasia: Evidence for Propositional Reasoning-Based Mental Imagery in Large Language Models

Morgan McCarty

Jorge Morales

LRM

148

27 Sep 2025

Remote Sensing Image Intelligent Interpretation with the Language-Centered Perspective: Principles, Methods and Challenges

218

09 Aug 2025

UniMoCo: Unified Modality Completion for Robust Multi-Modal Embeddings

467

17 May 2025

A Hybrid Swarm Intelligence Approach for Optimizing Multimodal Large Language Models Deployment in Edge-Cloud-based Federated Learning EnvironmentsComputer Communications (Comput. Commun.), 2025

476

04 Feb 2025

Detecting Frames in News Headlines and Lead Images in U.S. Gun Violence Coverage

...

Sha Lai

215

25 Jun 2024

Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language Representations

Gregor Geigle

Radu Timofte

Goran Glavaš

VLM MLLM

206

14 Jun 2023

Accessible Instruction-Following Agent

Kairui Zhou

210

08 May 2023

Teaching Structured Vision&Language Concepts to Vision&Language ModelsComputer Vision and Pattern Recognition (CVPR), 2022

...

397

21 Nov 2022

Cross-Lingual Cross-Modal Retrieval with Noise-Robust LearningACM Multimedia (ACM MM), 2022

315

26 Aug 2022

MuMUR : Multilingual Multimodal Universal Retrieval

Avinash Madasu

Estelle Aflalo

Gabriela Ben-Melech Stan

Shachar Rosenman

Shao-Yen Tseng

Gedas Bertasius

Vasudev Lal

528

24 Aug 2022

CLEAR: Improving Vision-Language Navigation with Cross-Lingual, Environment-Agnostic Representations

246

05 Jul 2022

Generalizing Multimodal Pre-training into Multilingual via Language Acquisition

Liang Zhang

Anwen Hu

Qin Jin

VLM

176

29 May 2022

Cross-lingual Adaptation for Recipe Retrieval with MixupInternational Conference on Multimedia Retrieval (ICMR), 2022

B. Zhu

Chong-Wah Ngo

Yue Yu

W. Chan

220

08 May 2022

IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and LanguagesInternational Conference on Machine Learning (ICML), 2022

Siva Reddy

465

27 Jan 2022

Anchoring to Exemplars for Training Mixture-of-Expert Cell Embeddings

211

06 Dec 2021

Towards Zero-shot Cross-lingual Image Retrieval and Tagging

290

15 Sep 2021

MTVR: Multilingual Moment Retrieval in VideosAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Jie Lei

Tamara L. Berg

Joey Tianyi Zhou

206

30 Jul 2021

UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-trainingComputer Vision and Pattern Recognition (CVPR), 2021

284

110

01 Apr 2021

Retrieve Fast, Rerank Smart: Cooperative and Joint Approaches for Improved Cross-Modal RetrievalTransactions of the Association for Computational Linguistics (TACL), 2021

427

22 Mar 2021

Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Po-Yao (Bernie) Huang

Mandela Patrick

Junjie Hu

Graham Neubig

Florian Metze

Alexander G. Hauptmann

MLLM VLM

381

16 Mar 2021

LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text RetrievalNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

281

16 Mar 2021

Globetrotter: Connecting Languages by Connecting Images

Dídac Surís

Dave Epstein

Carl Vondrick

VLM

384

08 Dec 2020

Towards Zero-shot Cross-lingual Image Retrieval

Pranav Aggarwal

Ajinkya Kale

VLM

286

24 Nov 2020

COOT: Cooperative Hierarchical Transformer for Video-Text Representation LearningNeural Information Processing Systems (NeurIPS), 2020

Simon Ging

Mohammadreza Zolfaghari

Hamed Pirsiavash

Thomas Brox

ViT CLIP

276

178

01 Nov 2020

M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-training

389

04 Jun 2020

Learning to Scale Multilingual Representations for Vision-Language TasksEuropean Conference on Computer Vision (ECCV), 2020

232

09 Apr 2020

Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and MethodsJournal of Artificial Intelligence Research (JAIR), 2019

489

145

22 Jul 2019