CLIPTrans: Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation

29 August 2023

Papers citing "CLIPTrans: Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation"

6 / 6 papers shown

Title
Expanding Event Modality Applications through a Robust CLIP-Based Encoder SungHeon Jeong Hanning Chen Sanggeon Yun Suhyeon Cho Wenjun Huang Xiangjian Liu Mohsen Imani 98 1 0 04 Dec 2024
Generating Faithful and Salient Text from Multimodal Data Tahsina Hashem Weiqing Wang Derry Tanti Wijaya Mohammed Eunus Ali Yuan-Fang Li 26 0 0 06 Sep 2024
Towards Zero-Shot Multimodal Machine Translation Matthieu Futeral Cordelia Schmid Benoît Sagot Rachel Bawden 30 3 0 18 Jul 2024
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation Junnan Li Dongxu Li Caiming Xiong S. Hoi MLLM BDL VLM CLIP 390 4,110 0 28 Jan 2022
BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translation Haoran Xu Benjamin Van Durme Kenton W. Murray 42 57 0 09 Sep 2021
WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning Krishna Srinivasan K. Raman Jiecao Chen Michael Bendersky Marc Najork VLM 197 308 0 02 Mar 2021