Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1701.06521
Cited By
Incorporating Global Visual Features into Attention-Based Neural Machine Translation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2017
23 January 2017
Iacer Calixto
Qun Liu
Nick Campbell
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Incorporating Global Visual Features into Attention-Based Neural Machine Translation"
50 / 66 papers shown
IndicVisionBench: Benchmarking Cultural and Multilingual Understanding in VLMs
Ali Faraz
Akash
Shaharukh Khan
Raja Kolla
Akshat Patidar
Suranjan Goswami
Abhinav Ravi
Chandra Khatri
Shubham Agarwal
VLM
238
3
0
06 Nov 2025
Dual-branch Prompting for Multimodal Machine Translation
Jie Wang
Zhendong Yang
Liansong Zong
Xiaobo Zhang
D. Wang
Ji Zhang
160
3
0
23 Jul 2025
Multimodal Machine Translation with Visual Scene Graph Pruning
Chenyu Lu
Shiliang Sun
Jing Zhao
N. Zhang
Tengfei Song
Hao Yang
467
1
0
26 May 2025
Memory Reviving, Continuing Learning and Beyond: Evaluation of Pre-trained Encoders and Decoders for Multimodal Machine Translation
Zhuang Yu
Shiliang Sun
Jing Zhao
Tengfei Song
Hao Yang
354
0
0
25 Apr 2025
Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation
Conference on Machine Translation (WMT), 2025
Shaharukh Khan
Ayush Tarun
Ali Faraz
Palash Kamble
Vivek Dahiya
Praveen Kumar Pokala
Ashish Kulkarni
Chandra Khatri
Abhinav Ravi
Shubham Agarwal
909
10
0
27 Feb 2025
Brotherhood at WMT 2024: Leveraging LLM-Generated Contextual Conversations for Cross-Lingual Image Captioning
Conference on Machine Translation (WMT), 2024
Siddharth Betala
Ishan Chokshi
VLM
174
1
0
23 Sep 2024
Towards Zero-Shot Multimodal Machine Translation
Matthieu Futeral
Cordelia Schmid
Benoît Sagot
Rachel Bawden
448
5
0
18 Jul 2024
A Survey on Multi-modal Machine Translation: Tasks, Methods and Challenges
Huangjun Shen
Liangying Shao
Wenbo Li
Zhibin Lan
Zhanyu Liu
Jinsong Su
434
7
0
21 May 2024
3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset
Xinyu Ma
Xuebo Liu
Yang Li
Jun Rao
Bei Li
Liang Ding
Lidia S. Chao
Dacheng Tao
Min Zhang
227
6
0
29 Apr 2024
Exploring the Necessity of Visual Modality in Multimodal Machine Translation using Authentic Datasets
Zi Long
Zhenhao Tang
Xianghua Fu
Jian Chen
Shilong Hou
Jinze Lyu
162
6
0
09 Apr 2024
The Case for Evaluating Multimodal Translation Models on Text Datasets
Vipin Vijayan
Braeden Bowen
Scott Grigsby
Timothy Anderson
Jeremy Gwinnup
268
4
0
05 Mar 2024
Incorporating Probing Signals into Multimodal Machine Translation via Visual Question-Answering Pairs
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yuxin Zuo
Bei Li
Chuanhao Lv
Tong Zheng
Tong Xiao
Jingbo Zhu
172
7
0
26 Oct 2023
Impact of Visual Context on Noisy Multimodal NMT: An Empirical Study for English to Indian Languages
Baban Gain
Dibyanayan Bandyopadhyay
Subhabrata Mukherjee
Chandranath Adak
Asif Ekbal
361
5
0
30 Aug 2023
CLIPTrans: Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation
IEEE International Conference on Computer Vision (ICCV), 2023
Devaansh Gupta
Siddhant Kharbanda
Jiawei Zhou
Wanhua Li
Hanspeter Pfister
D. Wei
VLM
290
27
0
29 Aug 2023
Modality Influence in Multimodal Machine Learning
Abdelhamid Haouhat
Slimane Bellaouar
A. Nehar
H. Cherroun
342
3
0
10 Jun 2023
Exploring Better Text Image Translation with Multimodal Codebook
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Zhibin Lan
Jiawei Yu
Xiang-Yang Li
Wen Zhang
Jian Luan
Bin Wang
Degen Huang
Jinsong Su
276
26
0
27 May 2023
Accessible Instruction-Following Agent
Kairui Zhou
206
1
0
08 May 2023
Generalization algorithm of multimodal pre-training model based on graph-text self-supervised training
ICON (ICON), 2023
Xiaobing Zhang
Zhenhao Tang
Zi Long
Xianghua Fu
SSL
188
0
0
16 Feb 2023
Tackling Ambiguity with Images: Improved Multimodal Machine Translation and Contrastive Evaluation
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Matthieu Futeral
Cordelia Schmid
Ivan Laptev
Benoît Sagot
Rachel Bawden
348
48
0
20 Dec 2022
Multilingual Multimodality: A Taxonomical Survey of Datasets, Techniques, Challenges and Opportunities
Khyathi Chandu
A. Geramifard
245
3
0
30 Oct 2022
LVP-M3: Language-aware Visual Prompt for Multilingual Multimodal Machine Translation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Hongcheng Guo
Jiaheng Liu
Haoyang Huang
Jian Yang
Zhoujun Li
Dongdong Zhang
Zheng Cui
Furu Wei
230
25
0
19 Oct 2022
Increasing Visual Awareness in Multimodal Neural Machine Translation from an Information Theoretic Perspective
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Baijun Ji
Tong Zhang
Yicheng Zou
Bojie Hu
Sitian Shen
216
18
0
16 Oct 2022
Low-resource Neural Machine Translation with Cross-modal Alignment
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Zhe Yang
Qingkai Fang
Yang Feng
VLM
232
10
0
13 Oct 2022
Distill the Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Ru Peng
Yawen Zeng
Jiaqi Zhao
296
26
0
10 Oct 2022
Multimodal Neural Machine Translation with Search Engine Based Image Retrieval
Workshop on Asian Translation (WAT), 2022
Zhenhao Tang
Xiaobing Zhang
Zi Long
Xianghua Fu
206
5
0
26 Jul 2022
Neural Machine Translation with Phrase-Level Universal Visual Representations
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Qingkai Fang
Yang Feng
217
57
0
19 Mar 2022
MSCTD: A Multimodal Sentiment Chat Translation Dataset
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Yunlong Liang
Fandong Meng
Jinan Xu
Jinan Xu
Jie Zhou
256
28
0
28 Feb 2022
Supervised Visual Attention for Simultaneous Multimodal Machine Translation
Journal of Artificial Intelligence Research (JAIR), 2022
Veneta Haralampieva
Ozan Caglayan
Lucia Specia
LRM
275
4
0
23 Jan 2022
Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training
ACM Multimedia (ACM MM), 2021
Yuqing Song
Shizhe Chen
Qin Jin
Wei Luo
Jun Xie
Fei Huang
232
28
0
25 Aug 2021
IITP at WAT 2021: System description for English-Hindi Multimodal Translation Task
Baban Gain
Dibyanayan Bandyopadhyay
Asif Ekbal
145
9
0
04 Jul 2021
ViTA: Visual-Linguistic Translation by Aligning Object Tags
Workshop on Asian Translation (WAT), 2021
Kshitij Gupta
Devansh Gautam
R. Mamidi
205
15
0
01 Jun 2021
UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training
Computer Vision and Pattern Recognition (CVPR), 2021
Mingyang Zhou
Luowei Zhou
Shuohang Wang
Yu Cheng
Linjie Li
Zhou Yu
Jingjing Liu
MLLM
VLM
284
110
0
01 Apr 2021
Gumbel-Attention for Multi-modal Machine Translation
Pengbo Liu
Hailong Cao
Tiejun Zhao
263
24
0
16 Mar 2021
Visual Cues and Error Correction for Translation Robustness
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Zhenhao Li
Marek Rei
Lucia Specia
290
7
0
12 Mar 2021
Exploiting Multimodal Reinforcement Learning for Simultaneous Machine Translation
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
Julia Ive
A. Li
Yishu Miao
Ozan Caglayan
Pranava Madhyastha
Lucia Specia
229
12
0
22 Feb 2021
End-to-End Video Question-Answer Generation with Generator-Pretester Network
Hung-Ting Su
Chen-Hsi Chang
Po-Wei Shen
Yu-Siang Wang
Ya-Liang Chang
Yu-Cheng Chang
Pu-Jen Cheng
Winston H. Hsu
188
39
0
05 Jan 2021
MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision and Language Research in Turkish
Machine Translation (MT), 2020
Begum Citamak
Ozan Caglayan
Menekse Kuyu
Erkut Erdem
Aykut Erdem
Pranava Madhyastha
Lucia Specia
274
9
0
13 Dec 2020
Multimodal Research in Vision and Language: A Review of Current and Emerging Trends
Shagun Uppal
Sarthak Bhagat
Devamanyu Hazarika
Navonil Majumdar
Soujanya Poria
Roger Zimmermann
Amir Zadeh
344
6
0
19 Oct 2020
A Corpus for English-Japanese Multimodal Neural Machine Translation with Comparable Sentences
Andrew C. Merritt
Chenhui Chu
Yuki Arase
197
6
0
17 Oct 2020
Visual Pivoting for (Unsupervised) Entity Alignment
Fangyu Liu
Muhao Chen
Dan Roth
Nigel Collier
OCL
433
156
0
28 Sep 2020
Generative Imagination Elevates Machine Translation
North American Chapter of the Association for Computational Linguistics (NAACL), 2020
Quanyu Long
Mingxuan Wang
Lei Li
212
48
0
21 Sep 2020
Simultaneous Machine Translation with Visual Context
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Ozan Caglayan
Julia Ive
Veneta Haralampieva
Pranava Madhyastha
Loïc Barrault
Lucia Specia
250
31
0
15 Sep 2020
Dynamic Context-guided Capsule Network for Multimodal Machine Translation
ACM Multimedia (ACM MM), 2020
Huan Lin
Fandong Meng
Jinsong Su
Yongjing Yin
Zhengyuan Yang
Yubin Ge
Jie Zhou
Jiebo Luo
258
94
0
04 Sep 2020
A Novel Graph-based Multi-modal Fusion Encoder for Neural Machine Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2020
Yongjing Yin
Fandong Meng
Jinsong Su
Chulun Zhou
Zhengyuan Yang
Jie Zhou
Jiebo Luo
221
164
0
17 Jul 2020
Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting
Annual Meeting of the Association for Computational Linguistics (ACL), 2020
Po-Yao (Bernie) Huang
Junjie Hu
Xiaojun Chang
Alexander G. Hauptmann
240
56
0
06 May 2020
Visual Agreement Regularized Training for Multi-Modal Machine Translation
AAAI Conference on Artificial Intelligence (AAAI), 2019
Pengcheng Yang
Boxing Chen
Pei Zhang
Xu Sun
266
34
0
27 Dec 2019
Multimodal Machine Translation through Visuals and Speech
Machine Translation (MT), 2019
U. Sulubacak
Ozan Caglayan
Stig-Arne Gronroos
Aku Rouhe
Desmond Elliott
Lucia Specia
Jörg Tiedemann
235
90
0
28 Nov 2019
Transformer-based Cascaded Multimodal Speech Translation
International Workshop on Spoken Language Translation (IWSLT), 2019
Zixiu "Alex" Wu
Ozan Caglayan
Julia Ive
Josiah Wang
Lucia Specia
241
8
0
29 Oct 2019
Probing Representations Learned by Multimodal Recurrent and Transformer Models
Jindrich Libovický
Pranava Madhyastha
167
1
0
29 Aug 2019
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Journal of Artificial Intelligence Research (JAIR), 2019
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
474
145
0
22 Jul 2019
1
2
Next
Page 1 of 2