Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1903.08678
Cited By
v1
v2 (latest)
Probing the Need for Visual Context in Multimodal Machine Translation
North American Chapter of the Association for Computational Linguistics (NAACL), 2019
20 March 2019
Ozan Caglayan
Pranava Madhyastha
Lucia Specia
Loïc Barrault
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Probing the Need for Visual Context in Multimodal Machine Translation"
50 / 75 papers shown
Human + AI for Accelerating Ad Localization Evaluation
Harshit Rajgarhia
Shivali Dalmia
Mengyang Zhao
Mukherji Abhishek
Kiran Ganesh
202
0
0
16 Sep 2025
Dual-branch Prompting for Multimodal Machine Translation
Jie Wang
Zhendong Yang
Liansong Zong
Xiaobo Zhang
D. Wang
Ji Zhang
95
0
0
23 Jul 2025
TopicVD: A Topic-Based Dataset of Video-Guided Multimodal Machine Translation for Documentaries
International Conference on Applications of Natural Language to Data Bases (NLDB), 2025
Jinze Lv
Jian Chen
Zi Long
Xianghua Fu
Yin Chen
VGen
328
0
0
09 May 2025
Memory Reviving, Continuing Learning and Beyond: Evaluation of Pre-trained Encoders and Decoders for Multimodal Machine Translation
Zhuang Yu
Shiliang Sun
Jing Zhao
Tengfei Song
Hao Yang
312
0
0
25 Apr 2025
Towards Zero-Shot Multimodal Machine Translation
Matthieu Futeral
Cordelia Schmid
Benoît Sagot
Rachel Bawden
407
5
0
18 Jul 2024
Detecting Frames in News Headlines and Lead Images in U.S. Gun Violence Coverage
Isidora Chara Tourni
Lei Guo
Hengchang Hu
Edward Edberg Halim
Prakash Ishwar
...
Boqi Chen
Margrit Betke
Fabian Zhafransyah
Sha Lai
Derry Wijaya
152
22
0
25 Jun 2024
A Survey on Multi-modal Machine Translation: Tasks, Methods and Challenges
Huangjun Shen
Liangying Shao
Wenbo Li
Zhibin Lan
Zhanyu Liu
Jinsong Su
349
5
0
21 May 2024
3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset
Xinyu Ma
Xuebo Liu
Yang Li
Jun Rao
Bei Li
Liang Ding
Lidia S. Chao
Dacheng Tao
Min Zhang
180
5
0
29 Apr 2024
Exploring the Necessity of Visual Modality in Multimodal Machine Translation using Authentic Datasets
Zi Long
Zhenhao Tang
Xianghua Fu
Jian Chen
Shilong Hou
Jinze Lyu
141
6
0
09 Apr 2024
Detecting Concrete Visual Tokens for Multimodal Machine Translation
Braeden Bowen
Vipin Vijayan
Scott Grigsby
Timothy Anderson
Jeremy Gwinnup
267
5
0
05 Mar 2024
Adding Multimodal Capabilities to a Text-only Translation Model
Vipin Vijayan
Braeden Bowen
Scott Grigsby
Timothy Anderson
Jeremy Gwinnup
LRM
276
10
0
05 Mar 2024
The Case for Evaluating Multimodal Translation Models on Text Datasets
Vipin Vijayan
Braeden Bowen
Scott Grigsby
Timothy Anderson
Jeremy Gwinnup
210
4
0
05 Mar 2024
Video-Helpful Multimodal Machine Translation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yihang Li
Shuichiro Shimizu
Chenhui Chu
Sadao Kurohashi
Wei Li
175
2
0
31 Oct 2023
Incorporating Probing Signals into Multimodal Machine Translation via Visual Question-Answering Pairs
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yuxin Zuo
Bei Li
Chuanhao Lv
Tong Zheng
Tong Xiao
Jingbo Zhu
136
7
0
26 Oct 2023
Visual Question Generation in Bengali
Mahmud Hasan
Labiba Islam
J. Ruma
T. Mayeesha
Rashedur Rahman
231
1
0
12 Oct 2023
Impact of Visual Context on Noisy Multimodal NMT: An Empirical Study for English to Indian Languages
Baban Gain
Dibyanayan Bandyopadhyay
Subhabrata Mukherjee
Chandranath Adak
Asif Ekbal
312
3
0
30 Aug 2023
CLIPTrans: Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation
IEEE International Conference on Computer Vision (ICCV), 2023
Devaansh Gupta
Siddhant Kharbanda
Jiawei Zhou
Wanhua Li
Hanspeter Pfister
D. Wei
VLM
267
26
0
29 Aug 2023
Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language Pretraining?
Haiwei Yang
Liang Ding
Jun Rao
Ye Liu
Li Shen
Changxing Ding
240
24
0
24 Aug 2023
Exploring Better Text Image Translation with Multimodal Codebook
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Zhibin Lan
Jiawei Yu
Xiang-Yang Li
Wen Zhang
Jian Luan
Bin Wang
Degen Huang
Jinsong Su
225
20
0
27 May 2023
BigVideo: A Large-scale Video Subtitle Translation Dataset for Multimodal Machine Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Liyan Kang
Luyang Huang
Ningxin Peng
Peihao Zhu
Zewei Sun
Shanbo Cheng
Mingxuan Wang
Degen Huang
Jinsong Su
385
15
0
23 May 2023
RC3: Regularized Contrastive Cross-lingual Cross-modal Pre-training
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Chulun Zhou
Yunlong Liang
Fandong Meng
Jinan Xu
Jinsong Su
Jie Zhou
VLM
226
4
0
13 May 2023
Multimodal Speech Recognition for Language-Guided Embodied Agents
Interspeech (Interspeech), 2023
Allen Chang
Xiaoyuan Zhu
Aarav Monga
Seoho Ahn
Tejas Srinivasan
Jesse Thomason
AuLLM
359
6
0
27 Feb 2023
Generalization algorithm of multimodal pre-training model based on graph-text self-supervised training
ICON (ICON), 2023
Xiaobing Zhang
Zhenhao Tang
Zi Long
Xianghua Fu
SSL
143
0
0
16 Feb 2023
Multimodality Representation Learning: A Survey on Evolution, Pretraining and Its Applications
Muhammad Arslan Manzoor
S. Albarri
Ziting Xian
Zaiqiao Meng
Preslav Nakov
Shangsong Liang
AI4TS
342
53
0
01 Feb 2023
Universal Multimodal Representation for Language Understanding
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Zhuosheng Zhang
Kehai Chen
Rui Wang
Masao Utiyama
Eiichiro Sumita
Z. Li
Hai Zhao
SSL
291
30
0
09 Jan 2023
Beyond Triplet: Leveraging the Most Data for Multimodal Machine Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Yaoming Zhu
Zewei Sun
Shanbo Cheng
Yuyang Huang
Liwei Wu
Mingxuan Wang
277
20
0
20 Dec 2022
Tackling Ambiguity with Images: Improved Multimodal Machine Translation and Contrastive Evaluation
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Matthieu Futeral
Cordelia Schmid
Ivan Laptev
Benoît Sagot
Rachel Bawden
320
46
0
20 Dec 2022
Low-resource Neural Machine Translation with Cross-modal Alignment
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Zhe Yang
Qingkai Fang
Yang Feng
VLM
202
10
0
13 Oct 2022
Distill the Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Ru Peng
Yawen Zeng
Jiaqi Zhao
240
24
0
10 Oct 2022
Multimodal Neural Machine Translation with Search Engine Based Image Retrieval
Workshop on Asian Translation (WAT), 2022
Zhenhao Tang
Xiaobing Zhang
Zi Long
Xianghua Fu
181
5
0
26 Jul 2022
VALHALLA: Visual Hallucination for Machine Translation
Computer Vision and Pattern Recognition (CVPR), 2022
Yi Li
Yikang Shen
Yoon Kim
Chun-Fu Chen
Rogerio Feris
David D. Cox
Nuno Vasconcelos
MLLM
458
51
0
31 May 2022
BAN-Cap: A Multi-Purpose English-Bangla Image Descriptions Dataset
International Conference on Language Resources and Evaluation (LREC), 2022
Mohammad Faiyaz Khan
S. M. S. Shifath
Md. Saiful Islam
237
6
0
28 May 2022
Neural Machine Translation with Phrase-Level Universal Visual Representations
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Qingkai Fang
Yang Feng
172
53
0
19 Mar 2022
MMLatch: Bottom-up Top-down Fusion for Multimodal Sentiment Analysis
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Georgios Paraskevopoulos
Efthymios Georgiou
Alexandros Potamianos
126
37
0
24 Jan 2022
VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation
International Conference on Language Resources and Evaluation (LREC), 2022
Yihang Li
Shuichiro Shimizu
Weiqi Gu
Chenhui Chu
Sadao Kurohashi
237
20
0
20 Jan 2022
Guiding Visual Question Generation
Nihir Vedd
Zixu Wang
Marek Rei
Yishu Miao
Lucia Specia
348
25
0
15 Oct 2021
A Survey on Multi-modal Summarization
Anubhav Jangra
Sourajit Mukherjee
Adam Jatowt
S. Saha
M. Hasanuzzaman
206
79
0
11 Sep 2021
Vision Matters When It Should: Sanity Checking Multimodal Machine Translation Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Jiaoda Li
Duygu Ataman
Rico Sennrich
198
38
0
08 Sep 2021
Journalistic Guidelines Aware News Image Captioning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Xuewen Yang
Svebor Karaman
Joel R. Tetreault
Alex Jaimes
259
32
0
07 Sep 2021
ReFormer: The Relational Transformer for Image Captioning
ACM Multimedia (ACM MM), 2021
Xuewen Yang
Yingru Liu
Xin Wang
ViT
219
64
0
29 Jul 2021
Multimodal Co-learning: Challenges, Applications with Datasets, Recent Advances and Future Directions
Information Fusion (Inf. Fusion), 2021
Anil Rahate
Rahee Walambe
S. Ramanna
K. Kotecha
402
176
0
29 Jul 2021
BERTGEN: Multi-task Generation through BERT
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Faidon Mitzalis
Ozan Caglayan
Pranava Madhyastha
Lucia Specia
VLM
123
7
0
07 Jun 2021
ViTA: Visual-Linguistic Translation by Aligning Object Tags
Workshop on Asian Translation (WAT), 2021
Kshitij Gupta
Devansh Gautam
R. Mamidi
156
14
0
01 Jun 2021
Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Zhiyong Wu
Lingpeng Kong
W. Bi
Xiang Li
B. Kao
LRM
140
97
0
30 May 2021
"Wikily" Supervised Neural Translation Tailored to Cross-Lingual Tasks
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Mohammad Sadegh Rasooli
Chris Callison-Burch
Derry Wijaya
CLIP
254
6
0
16 Apr 2021
Visual Cues and Error Correction for Translation Robustness
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Zhenhao Li
Marek Rei
Lucia Specia
249
6
0
12 Mar 2021
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts
Computer Vision and Pattern Recognition (CVPR), 2021
Soravit Changpinyo
P. Sharma
Nan Ding
Radu Soricut
VLM
1.2K
1,370
0
17 Feb 2021
Unifying Vision-and-Language Tasks via Text Generation
International Conference on Machine Learning (ICML), 2021
Jaemin Cho
Jie Lei
Hao Tan
Joey Tianyi Zhou
MLLM
614
611
0
04 Feb 2021
Cross-lingual Visual Pre-training for Multimodal Machine Translation
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
Ozan Caglayan
Menekse Kuyu
Mustafa Sercan Amac
Pranava Madhyastha
Erkut Erdem
Aykut Erdem
Lucia Specia
VLM
204
53
0
25 Jan 2021
Multimodal Variational Autoencoders for Semi-Supervised Learning: In Defense of Product-of-Experts
S. Kutuzova
Oswin Krause
D. McCloskey
Mads Nielsen
Christian Igel
241
18
0
18 Jan 2021
1
2
Next
Page 1 of 2