ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1701.06521
  4. Cited By
Incorporating Global Visual Features into Attention-Based Neural Machine
  Translation

Incorporating Global Visual Features into Attention-Based Neural Machine Translation

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2017
23 January 2017
Iacer Calixto
Qun Liu
Nick Campbell
ArXiv (abs)PDFHTML

Papers citing "Incorporating Global Visual Features into Attention-Based Neural Machine Translation"

50 / 66 papers shown
IndicVisionBench: Benchmarking Cultural and Multilingual Understanding in VLMs
IndicVisionBench: Benchmarking Cultural and Multilingual Understanding in VLMs
Ali Faraz
Akash
Shaharukh Khan
Raja Kolla
Akshat Patidar
Suranjan Goswami
Abhinav Ravi
Chandra Khatri
Shubham Agarwal
VLM
238
3
0
06 Nov 2025
Dual-branch Prompting for Multimodal Machine Translation
Dual-branch Prompting for Multimodal Machine Translation
Jie Wang
Zhendong Yang
Liansong Zong
Xiaobo Zhang
D. Wang
Ji Zhang
160
3
0
23 Jul 2025
Multimodal Machine Translation with Visual Scene Graph Pruning
Multimodal Machine Translation with Visual Scene Graph Pruning
Chenyu Lu
Shiliang Sun
Jing Zhao
N. Zhang
Tengfei Song
Hao Yang
467
1
0
26 May 2025
Memory Reviving, Continuing Learning and Beyond: Evaluation of Pre-trained Encoders and Decoders for Multimodal Machine Translation
Memory Reviving, Continuing Learning and Beyond: Evaluation of Pre-trained Encoders and Decoders for Multimodal Machine Translation
Zhuang Yu
Shiliang Sun
Jing Zhao
Tengfei Song
Hao Yang
354
0
0
25 Apr 2025
Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation
Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal TranslationConference on Machine Translation (WMT), 2025
Shaharukh Khan
Ayush Tarun
Ali Faraz
Palash Kamble
Vivek Dahiya
Praveen Kumar Pokala
Ashish Kulkarni
Chandra Khatri
Abhinav Ravi
Shubham Agarwal
909
10
0
27 Feb 2025
Brotherhood at WMT 2024: Leveraging LLM-Generated Contextual
  Conversations for Cross-Lingual Image Captioning
Brotherhood at WMT 2024: Leveraging LLM-Generated Contextual Conversations for Cross-Lingual Image CaptioningConference on Machine Translation (WMT), 2024
Siddharth Betala
Ishan Chokshi
VLM
174
1
0
23 Sep 2024
Towards Zero-Shot Multimodal Machine Translation
Towards Zero-Shot Multimodal Machine Translation
Matthieu Futeral
Cordelia Schmid
Benoît Sagot
Rachel Bawden
448
5
0
18 Jul 2024
A Survey on Multi-modal Machine Translation: Tasks, Methods and
  Challenges
A Survey on Multi-modal Machine Translation: Tasks, Methods and Challenges
Huangjun Shen
Liangying Shao
Wenbo Li
Zhibin Lan
Zhanyu Liu
Jinsong Su
434
7
0
21 May 2024
3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset
3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset
Xinyu Ma
Xuebo Liu
Yang Li
Jun Rao
Bei Li
Liang Ding
Lidia S. Chao
Dacheng Tao
Min Zhang
227
6
0
29 Apr 2024
Exploring the Necessity of Visual Modality in Multimodal Machine
  Translation using Authentic Datasets
Exploring the Necessity of Visual Modality in Multimodal Machine Translation using Authentic Datasets
Zi Long
Zhenhao Tang
Xianghua Fu
Jian Chen
Shilong Hou
Jinze Lyu
162
6
0
09 Apr 2024
The Case for Evaluating Multimodal Translation Models on Text Datasets
The Case for Evaluating Multimodal Translation Models on Text Datasets
Vipin Vijayan
Braeden Bowen
Scott Grigsby
Timothy Anderson
Jeremy Gwinnup
268
4
0
05 Mar 2024
Incorporating Probing Signals into Multimodal Machine Translation via
  Visual Question-Answering Pairs
Incorporating Probing Signals into Multimodal Machine Translation via Visual Question-Answering PairsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yuxin Zuo
Bei Li
Chuanhao Lv
Tong Zheng
Tong Xiao
Jingbo Zhu
172
7
0
26 Oct 2023
Impact of Visual Context on Noisy Multimodal NMT: An Empirical Study for English to Indian Languages
Impact of Visual Context on Noisy Multimodal NMT: An Empirical Study for English to Indian Languages
Baban Gain
Dibyanayan Bandyopadhyay
Subhabrata Mukherjee
Chandranath Adak
Asif Ekbal
361
5
0
30 Aug 2023
CLIPTrans: Transferring Visual Knowledge with Pre-trained Models for
  Multimodal Machine Translation
CLIPTrans: Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine TranslationIEEE International Conference on Computer Vision (ICCV), 2023
Devaansh Gupta
Siddhant Kharbanda
Jiawei Zhou
Wanhua Li
Hanspeter Pfister
D. Wei
VLM
290
27
0
29 Aug 2023
Modality Influence in Multimodal Machine Learning
Modality Influence in Multimodal Machine Learning
Abdelhamid Haouhat
Slimane Bellaouar
A. Nehar
H. Cherroun
342
3
0
10 Jun 2023
Exploring Better Text Image Translation with Multimodal Codebook
Exploring Better Text Image Translation with Multimodal CodebookAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Zhibin Lan
Jiawei Yu
Xiang-Yang Li
Wen Zhang
Jian Luan
Bin Wang
Degen Huang
Jinsong Su
276
26
0
27 May 2023
Accessible Instruction-Following Agent
Accessible Instruction-Following Agent
Kairui Zhou
206
1
0
08 May 2023
Generalization algorithm of multimodal pre-training model based on
  graph-text self-supervised training
Generalization algorithm of multimodal pre-training model based on graph-text self-supervised trainingICON (ICON), 2023
Xiaobing Zhang
Zhenhao Tang
Zi Long
Xianghua Fu
SSL
188
0
0
16 Feb 2023
Tackling Ambiguity with Images: Improved Multimodal Machine Translation
  and Contrastive Evaluation
Tackling Ambiguity with Images: Improved Multimodal Machine Translation and Contrastive EvaluationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Matthieu Futeral
Cordelia Schmid
Ivan Laptev
Benoît Sagot
Rachel Bawden
348
48
0
20 Dec 2022
Multilingual Multimodality: A Taxonomical Survey of Datasets,
  Techniques, Challenges and Opportunities
Multilingual Multimodality: A Taxonomical Survey of Datasets, Techniques, Challenges and Opportunities
Khyathi Chandu
A. Geramifard
245
3
0
30 Oct 2022
LVP-M3: Language-aware Visual Prompt for Multilingual Multimodal Machine
  Translation
LVP-M3: Language-aware Visual Prompt for Multilingual Multimodal Machine TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Hongcheng Guo
Jiaheng Liu
Haoyang Huang
Jian Yang
Zhoujun Li
Dongdong Zhang
Zheng Cui
Furu Wei
230
25
0
19 Oct 2022
Increasing Visual Awareness in Multimodal Neural Machine Translation
  from an Information Theoretic Perspective
Increasing Visual Awareness in Multimodal Neural Machine Translation from an Information Theoretic PerspectiveConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Baijun Ji
Tong Zhang
Yicheng Zou
Bojie Hu
Sitian Shen
216
18
0
16 Oct 2022
Low-resource Neural Machine Translation with Cross-modal Alignment
Low-resource Neural Machine Translation with Cross-modal AlignmentConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Zhe Yang
Qingkai Fang
Yang Feng
VLM
232
10
0
13 Oct 2022
Distill the Image to Nowhere: Inversion Knowledge Distillation for
  Multimodal Machine Translation
Distill the Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Ru Peng
Yawen Zeng
Jiaqi Zhao
296
26
0
10 Oct 2022
Multimodal Neural Machine Translation with Search Engine Based Image
  Retrieval
Multimodal Neural Machine Translation with Search Engine Based Image RetrievalWorkshop on Asian Translation (WAT), 2022
Zhenhao Tang
Xiaobing Zhang
Zi Long
Xianghua Fu
206
5
0
26 Jul 2022
Neural Machine Translation with Phrase-Level Universal Visual
  Representations
Neural Machine Translation with Phrase-Level Universal Visual RepresentationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Qingkai Fang
Yang Feng
217
57
0
19 Mar 2022
MSCTD: A Multimodal Sentiment Chat Translation Dataset
MSCTD: A Multimodal Sentiment Chat Translation DatasetAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Yunlong Liang
Fandong Meng
Jinan Xu
Jinan Xu
Jie Zhou
256
28
0
28 Feb 2022
Supervised Visual Attention for Simultaneous Multimodal Machine
  Translation
Supervised Visual Attention for Simultaneous Multimodal Machine TranslationJournal of Artificial Intelligence Research (JAIR), 2022
Veneta Haralampieva
Ozan Caglayan
Lucia Specia
LRM
275
4
0
23 Jan 2022
Product-oriented Machine Translation with Cross-modal Cross-lingual
  Pre-training
Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-trainingACM Multimedia (ACM MM), 2021
Yuqing Song
Shizhe Chen
Qin Jin
Wei Luo
Jun Xie
Fei Huang
232
28
0
25 Aug 2021
IITP at WAT 2021: System description for English-Hindi Multimodal
  Translation Task
IITP at WAT 2021: System description for English-Hindi Multimodal Translation Task
Baban Gain
Dibyanayan Bandyopadhyay
Asif Ekbal
145
9
0
04 Jul 2021
ViTA: Visual-Linguistic Translation by Aligning Object Tags
ViTA: Visual-Linguistic Translation by Aligning Object TagsWorkshop on Asian Translation (WAT), 2021
Kshitij Gupta
Devansh Gautam
R. Mamidi
205
15
0
01 Jun 2021
UC2: Universal Cross-lingual Cross-modal Vision-and-Language
  Pre-training
UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-trainingComputer Vision and Pattern Recognition (CVPR), 2021
Mingyang Zhou
Luowei Zhou
Shuohang Wang
Yu Cheng
Linjie Li
Zhou Yu
Jingjing Liu
MLLMVLM
284
110
0
01 Apr 2021
Gumbel-Attention for Multi-modal Machine Translation
Gumbel-Attention for Multi-modal Machine Translation
Pengbo Liu
Hailong Cao
Tiejun Zhao
263
24
0
16 Mar 2021
Visual Cues and Error Correction for Translation Robustness
Visual Cues and Error Correction for Translation RobustnessConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Zhenhao Li
Marek Rei
Lucia Specia
290
7
0
12 Mar 2021
Exploiting Multimodal Reinforcement Learning for Simultaneous Machine
  Translation
Exploiting Multimodal Reinforcement Learning for Simultaneous Machine TranslationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
Julia Ive
A. Li
Yishu Miao
Ozan Caglayan
Pranava Madhyastha
Lucia Specia
229
12
0
22 Feb 2021
End-to-End Video Question-Answer Generation with Generator-Pretester
  Network
End-to-End Video Question-Answer Generation with Generator-Pretester Network
Hung-Ting Su
Chen-Hsi Chang
Po-Wei Shen
Yu-Siang Wang
Ya-Liang Chang
Yu-Cheng Chang
Pu-Jen Cheng
Winston H. Hsu
188
39
0
05 Jan 2021
MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision
  and Language Research in Turkish
MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision and Language Research in TurkishMachine Translation (MT), 2020
Begum Citamak
Ozan Caglayan
Menekse Kuyu
Erkut Erdem
Aykut Erdem
Pranava Madhyastha
Lucia Specia
274
9
0
13 Dec 2020
Multimodal Research in Vision and Language: A Review of Current and
  Emerging Trends
Multimodal Research in Vision and Language: A Review of Current and Emerging Trends
Shagun Uppal
Sarthak Bhagat
Devamanyu Hazarika
Navonil Majumdar
Soujanya Poria
Roger Zimmermann
Amir Zadeh
344
6
0
19 Oct 2020
A Corpus for English-Japanese Multimodal Neural Machine Translation with
  Comparable Sentences
A Corpus for English-Japanese Multimodal Neural Machine Translation with Comparable Sentences
Andrew C. Merritt
Chenhui Chu
Yuki Arase
197
6
0
17 Oct 2020
Visual Pivoting for (Unsupervised) Entity Alignment
Visual Pivoting for (Unsupervised) Entity Alignment
Fangyu Liu
Muhao Chen
Dan Roth
Nigel Collier
OCL
433
156
0
28 Sep 2020
Generative Imagination Elevates Machine Translation
Generative Imagination Elevates Machine TranslationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2020
Quanyu Long
Mingxuan Wang
Lei Li
212
48
0
21 Sep 2020
Simultaneous Machine Translation with Visual Context
Simultaneous Machine Translation with Visual ContextConference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Ozan Caglayan
Julia Ive
Veneta Haralampieva
Pranava Madhyastha
Loïc Barrault
Lucia Specia
250
31
0
15 Sep 2020
Dynamic Context-guided Capsule Network for Multimodal Machine
  Translation
Dynamic Context-guided Capsule Network for Multimodal Machine TranslationACM Multimedia (ACM MM), 2020
Huan Lin
Fandong Meng
Jinsong Su
Yongjing Yin
Zhengyuan Yang
Yubin Ge
Jie Zhou
Jiebo Luo
258
94
0
04 Sep 2020
A Novel Graph-based Multi-modal Fusion Encoder for Neural Machine
  Translation
A Novel Graph-based Multi-modal Fusion Encoder for Neural Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Yongjing Yin
Fandong Meng
Jinsong Su
Chulun Zhou
Zhengyuan Yang
Jie Zhou
Jiebo Luo
221
164
0
17 Jul 2020
Unsupervised Multimodal Neural Machine Translation with Pseudo Visual
  Pivoting
Unsupervised Multimodal Neural Machine Translation with Pseudo Visual PivotingAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Po-Yao (Bernie) Huang
Junjie Hu
Xiaojun Chang
Alexander G. Hauptmann
240
56
0
06 May 2020
Visual Agreement Regularized Training for Multi-Modal Machine
  Translation
Visual Agreement Regularized Training for Multi-Modal Machine TranslationAAAI Conference on Artificial Intelligence (AAAI), 2019
Pengcheng Yang
Boxing Chen
Pei Zhang
Xu Sun
266
34
0
27 Dec 2019
Multimodal Machine Translation through Visuals and Speech
Multimodal Machine Translation through Visuals and SpeechMachine Translation (MT), 2019
U. Sulubacak
Ozan Caglayan
Stig-Arne Gronroos
Aku Rouhe
Desmond Elliott
Lucia Specia
Jörg Tiedemann
235
90
0
28 Nov 2019
Transformer-based Cascaded Multimodal Speech Translation
Transformer-based Cascaded Multimodal Speech TranslationInternational Workshop on Spoken Language Translation (IWSLT), 2019
Zixiu "Alex" Wu
Ozan Caglayan
Julia Ive
Josiah Wang
Lucia Specia
241
8
0
29 Oct 2019
Probing Representations Learned by Multimodal Recurrent and Transformer
  Models
Probing Representations Learned by Multimodal Recurrent and Transformer Models
Jindrich Libovický
Pranava Madhyastha
167
1
0
29 Aug 2019
Trends in Integration of Vision and Language Research: A Survey of
  Tasks, Datasets, and Methods
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and MethodsJournal of Artificial Intelligence Research (JAIR), 2019
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
474
145
0
22 Jul 2019
12
Next
Page 1 of 2