ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.04350
  4. Cited By
Imagination improves Multimodal Translation
v1v2 (latest)

Imagination improves Multimodal Translation

11 May 2017
Desmond Elliott
Ákos Kádár
ArXiv (abs)PDFHTML

Papers citing "Imagination improves Multimodal Translation"

50 / 116 papers shown
IndicVisionBench: Benchmarking Cultural and Multilingual Understanding in VLMs
IndicVisionBench: Benchmarking Cultural and Multilingual Understanding in VLMs
Ali Faraz
Akash
Shaharukh Khan
Raja Kolla
Akshat Patidar
Suranjan Goswami
Abhinav Ravi
Chandra Khatri
Shubham Agarwal
VLM
164
0
0
06 Nov 2025
FOSSIL: Harnessing Feedback on Suboptimal Samples for Data-Efficient Generalisation with Imitation Learning for Embodied Vision-and-Language Tasks
FOSSIL: Harnessing Feedback on Suboptimal Samples for Data-Efficient Generalisation with Imitation Learning for Embodied Vision-and-Language Tasks
Sabrina McCallum
Amit Parekh
Alessandro Suglia
LM&Ro
116
0
0
13 Oct 2025
Dual-branch Prompting for Multimodal Machine Translation
Dual-branch Prompting for Multimodal Machine Translation
Jie Wang
Zhendong Yang
Liansong Zong
Xiaobo Zhang
D. Wang
Ji Zhang
90
0
0
23 Jul 2025
Multimodal Machine Translation with Visual Scene Graph Pruning
Multimodal Machine Translation with Visual Scene Graph Pruning
Chenyu Lu
Shiliang Sun
Jing Zhao
N. Zhang
Tengfei Song
Hao Yang
430
1
0
26 May 2025
TopicVD: A Topic-Based Dataset of Video-Guided Multimodal Machine Translation for Documentaries
TopicVD: A Topic-Based Dataset of Video-Guided Multimodal Machine Translation for DocumentariesInternational Conference on Applications of Natural Language to Data Bases (NLDB), 2025
Jinze Lv
Jian Chen
Zi Long
Xianghua Fu
Yin Chen
VGen
328
0
0
09 May 2025
Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation
Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal TranslationConference on Machine Translation (WMT), 2025
Shaharukh Khan
Ayush Tarun
Ali Faraz
Palash Kamble
Vivek Dahiya
Praveen Kumar Pokala
Ashish Kulkarni
Chandra Khatri
Abhinav Ravi
Shubham Agarwal
890
8
0
27 Feb 2025
Towards Zero-Shot Multimodal Machine Translation
Towards Zero-Shot Multimodal Machine Translation
Matthieu Futeral
Cordelia Schmid
Benoît Sagot
Rachel Bawden
402
5
0
18 Jul 2024
AnyTrans: Translate AnyText in the Image with Large Scale Models
AnyTrans: Translate AnyText in the Image with Large Scale Models
Zhipeng Qian
Pei Zhang
Baosong Yang
Kai Fan
Yiwei Ma
Yang Li
Xiaoshuai Sun
Rongrong Ji
VLM
246
3
0
17 Jun 2024
A Survey on Multi-modal Machine Translation: Tasks, Methods and
  Challenges
A Survey on Multi-modal Machine Translation: Tasks, Methods and Challenges
Huangjun Shen
Liangying Shao
Wenbo Li
Zhibin Lan
Zhanyu Liu
Jinsong Su
333
4
0
21 May 2024
3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset
3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset
Xinyu Ma
Xuebo Liu
Yang Li
Jun Rao
Bei Li
Liang Ding
Lidia S. Chao
Dacheng Tao
Min Zhang
178
5
0
29 Apr 2024
Incorporating Probing Signals into Multimodal Machine Translation via
  Visual Question-Answering Pairs
Incorporating Probing Signals into Multimodal Machine Translation via Visual Question-Answering PairsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yuxin Zuo
Bei Li
Chuanhao Lv
Tong Zheng
Tong Xiao
Jingbo Zhu
124
7
0
26 Oct 2023
Bridging the Gap between Synthetic and Authentic Images for Multimodal
  Machine Translation
Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation
Wenyu Guo
Qingkai Fang
Dong Yu
Yang Feng
225
15
0
20 Oct 2023
Impact of Visual Context on Noisy Multimodal NMT: An Empirical Study for English to Indian Languages
Impact of Visual Context on Noisy Multimodal NMT: An Empirical Study for English to Indian Languages
Baban Gain
Dibyanayan Bandyopadhyay
Subhabrata Mukherjee
Chandranath Adak
Asif Ekbal
301
3
0
30 Aug 2023
Towards Unseen Triples: Effective Text-Image-joint Learning for Scene
  Graph Generation
Towards Unseen Triples: Effective Text-Image-joint Learning for Scene Graph Generation
Qianji Di
Wenxing Ma
Chen Ma
Tianxiang Hou
Ying Shan
Hanzi Wang
145
1
0
23 Jun 2023
Modality Influence in Multimodal Machine Learning
Modality Influence in Multimodal Machine Learning
Abdelhamid Haouhat
Slimane Bellaouar
A. Nehar
H. Cherroun
232
3
0
10 Jun 2023
Exploring Better Text Image Translation with Multimodal Codebook
Exploring Better Text Image Translation with Multimodal CodebookAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Zhibin Lan
Jiawei Yu
Xiang-Yang Li
Wen Zhang
Jian Luan
Bin Wang
Degen Huang
Jinsong Su
221
19
0
27 May 2023
BigVideo: A Large-scale Video Subtitle Translation Dataset for
  Multimodal Machine Translation
BigVideo: A Large-scale Video Subtitle Translation Dataset for Multimodal Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Liyan Kang
Luyang Huang
Ningxin Peng
Peihao Zhu
Zewei Sun
Shanbo Cheng
Mingxuan Wang
Degen Huang
Jinsong Su
376
15
0
23 May 2023
Large Scale Multi-Lingual Multi-Modal Summarization Dataset
Large Scale Multi-Lingual Multi-Modal Summarization DatasetConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Yash Verma
Anubhav Jangra
Raghvendra Kumar
S. Saha
118
22
0
13 Feb 2023
Beyond Triplet: Leveraging the Most Data for Multimodal Machine
  Translation
Beyond Triplet: Leveraging the Most Data for Multimodal Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Yaoming Zhu
Zewei Sun
Shanbo Cheng
Yuyang Huang
Liwei Wu
Mingxuan Wang
277
20
0
20 Dec 2022
Tackling Ambiguity with Images: Improved Multimodal Machine Translation
  and Contrastive Evaluation
Tackling Ambiguity with Images: Improved Multimodal Machine Translation and Contrastive EvaluationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Matthieu Futeral
Cordelia Schmid
Ivan Laptev
Benoît Sagot
Rachel Bawden
311
46
0
20 Dec 2022
LVP-M3: Language-aware Visual Prompt for Multilingual Multimodal Machine
  Translation
LVP-M3: Language-aware Visual Prompt for Multilingual Multimodal Machine TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Hongcheng Guo
Jiaheng Liu
Haoyang Huang
Jian Yang
Zhoujun Li
Dongdong Zhang
Zheng Cui
Furu Wei
190
24
0
19 Oct 2022
Increasing Visual Awareness in Multimodal Neural Machine Translation
  from an Information Theoretic Perspective
Increasing Visual Awareness in Multimodal Neural Machine Translation from an Information Theoretic PerspectiveConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Baijun Ji
Tong Zhang
Yicheng Zou
Bojie Hu
Sitian Shen
182
17
0
16 Oct 2022
Distill the Image to Nowhere: Inversion Knowledge Distillation for
  Multimodal Machine Translation
Distill the Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Ru Peng
Yawen Zeng
Jiaqi Zhao
230
24
0
10 Oct 2022
Cross-Lingual Cross-Modal Retrieval with Noise-Robust Learning
Cross-Lingual Cross-Modal Retrieval with Noise-Robust LearningACM Multimedia (ACM MM), 2022
Yabing Wang
Jianfeng Dong
Tianxiang Liang
Minsong Zhang
Rui Cai
Xun Wang
275
27
0
26 Aug 2022
Neural Machine Translation with Phrase-Level Universal Visual
  Representations
Neural Machine Translation with Phrase-Level Universal Visual RepresentationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Qingkai Fang
Yang Feng
171
53
0
19 Mar 2022
Supervised Visual Attention for Simultaneous Multimodal Machine
  Translation
Supervised Visual Attention for Simultaneous Multimodal Machine TranslationJournal of Artificial Intelligence Research (JAIR), 2022
Veneta Haralampieva
Ozan Caglayan
Lucia Specia
LRM
217
4
0
23 Jan 2022
Transferring Knowledge from Vision to Language: How to Achieve it and
  how to Measure it?
Transferring Knowledge from Vision to Language: How to Achieve it and how to Measure it?BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2021
Tobias Norlund
Lovisa Hagström
Richard Johansson
272
25
0
23 Sep 2021
Product-oriented Machine Translation with Cross-modal Cross-lingual
  Pre-training
Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-trainingACM Multimedia (ACM MM), 2021
Yuqing Song
Shizhe Chen
Qin Jin
Wei Luo
Jun Xie
Fei Huang
202
25
0
25 Aug 2021
A Survey on Low-Resource Neural Machine Translation
A Survey on Low-Resource Neural Machine TranslationInternational Joint Conference on Artificial Intelligence (IJCAI), 2021
Rui Wang
Xu Tan
Renqian Luo
Tao Qin
Tie-Yan Liu
3DV
192
69
0
09 Jul 2021
Good for Misconceived Reasons: An Empirical Revisiting on the Need for
  Visual Context in Multimodal Machine Translation
Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Zhiyong Wu
Lingpeng Kong
W. Bi
Xiang Li
B. Kao
LRM
137
97
0
30 May 2021
UC2: Universal Cross-lingual Cross-modal Vision-and-Language
  Pre-training
UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-trainingComputer Vision and Pattern Recognition (CVPR), 2021
Mingyang Zhou
Luowei Zhou
Shuohang Wang
Yu Cheng
Linjie Li
Zhou Yu
Jingjing Liu
MLLMVLM
235
106
0
01 Apr 2021
Exploiting Multimodal Reinforcement Learning for Simultaneous Machine
  Translation
Exploiting Multimodal Reinforcement Learning for Simultaneous Machine TranslationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
Julia Ive
A. Li
Yishu Miao
Ozan Caglayan
Pranava Madhyastha
Lucia Specia
179
12
0
22 Feb 2021
Cross-lingual Visual Pre-training for Multimodal Machine Translation
Cross-lingual Visual Pre-training for Multimodal Machine TranslationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
Ozan Caglayan
Menekse Kuyu
Mustafa Sercan Amac
Pranava Madhyastha
Erkut Erdem
Aykut Erdem
Lucia Specia
VLM
196
53
0
25 Jan 2021
Efficient Object-Level Visual Context Modeling for Multimodal Machine
  Translation: Masking Irrelevant Objects Helps Grounding
Efficient Object-Level Visual Context Modeling for Multimodal Machine Translation: Masking Irrelevant Objects Helps GroundingAAAI Conference on Artificial Intelligence (AAAI), 2020
Dexin Wang
Deyi Xiong
105
44
0
18 Dec 2020
MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision
  and Language Research in Turkish
MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision and Language Research in TurkishMachine Translation (MT), 2020
Begum Citamak
Ozan Caglayan
Menekse Kuyu
Erkut Erdem
Aykut Erdem
Pranava Madhyastha
Lucia Specia
206
9
0
13 Dec 2020
Imagining Grounded Conceptual Representations from Perceptual
  Information in Situated Guessing Games
Imagining Grounded Conceptual Representations from Perceptual Information in Situated Guessing Games
Alessandro Suglia
Antonio Vergari
Ioannis Konstas
Yonatan Bisk
E. Bastianelli
Andrea Vanzo
Oliver Lemon
OCL
157
11
0
05 Nov 2020
Emergent Communication Pretraining for Few-Shot Machine Translation
Emergent Communication Pretraining for Few-Shot Machine TranslationInternational Conference on Computational Linguistics (COLING), 2020
Yaoyiran Li
Edoardo Ponti
Ivan Vulić
Anna Korhonen
217
20
0
02 Nov 2020
Generative Imagination Elevates Machine Translation
Generative Imagination Elevates Machine TranslationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2020
Quanyu Long
Mingxuan Wang
Lei Li
167
46
0
21 Sep 2020
Simultaneous Machine Translation with Visual Context
Simultaneous Machine Translation with Visual ContextConference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Ozan Caglayan
Julia Ive
Veneta Haralampieva
Pranava Madhyastha
Loïc Barrault
Lucia Specia
145
31
0
15 Sep 2020
Dynamic Context-guided Capsule Network for Multimodal Machine
  Translation
Dynamic Context-guided Capsule Network for Multimodal Machine TranslationACM Multimedia (ACM MM), 2020
Huan Lin
Fandong Meng
Jinsong Su
Yongjing Yin
Zhengyuan Yang
Yubin Ge
Jie Zhou
Jiebo Luo
219
91
0
04 Sep 2020
A Novel Graph-based Multi-modal Fusion Encoder for Neural Machine
  Translation
A Novel Graph-based Multi-modal Fusion Encoder for Neural Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Yongjing Yin
Fandong Meng
Jinsong Su
Chulun Zhou
Zhengyuan Yang
Jie Zhou
Jiebo Luo
188
160
0
17 Jul 2020
M3P: Learning Universal Representations via Multitask Multilingual
  Multimodal Pre-training
M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-training
Minheng Ni
Haoyang Huang
Lin Su
Edward Cui
Taroon Bharti
Lijuan Wang
Jianfeng Gao
Dongdong Zhang
Nan Duan
285
7
0
04 Jun 2020
CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language
  Learning
CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Alessandro Suglia
Ioannis Konstas
Andrea Vanzo
E. Bastianelli
Desmond Elliott
Stella Frank
Oliver Lemon
135
17
0
03 Jun 2020
Unsupervised Multimodal Neural Machine Translation with Pseudo Visual
  Pivoting
Unsupervised Multimodal Neural Machine Translation with Pseudo Visual PivotingAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Po-Yao (Bernie) Huang
Junjie Hu
Xiaojun Chang
Alexander G. Hauptmann
180
55
0
06 May 2020
Experience Grounds Language
Experience Grounds Language
Yonatan Bisk
Ari Holtzman
Jesse Thomason
Jacob Andreas
Yoshua Bengio
...
Angeliki Lazaridou
Jonathan May
Aleksandr Nisnevich
Nicolas Pinto
Joseph P. Turian
503
403
0
21 Apr 2020
Neural Machine Translation: Challenges, Progress and Future
Neural Machine Translation: Challenges, Progress and FutureScience China Technological Sciences (Sci China Technol Sci), 2020
Jiajun Zhang
Chengqing Zong
158
59
0
13 Apr 2020
Towards Multimodal Simultaneous Neural Machine Translation
Towards Multimodal Simultaneous Neural Machine TranslationConference on Machine Translation (WMT), 2020
Aizhan Imankulova
Masahiro Kaneko
Tosho Hirasawa
Mamoru Komachi
175
15
0
07 Apr 2020
Visual Agreement Regularized Training for Multi-Modal Machine
  Translation
Visual Agreement Regularized Training for Multi-Modal Machine TranslationAAAI Conference on Artificial Intelligence (AAAI), 2019
Pengcheng Yang
Boxing Chen
Pei Zhang
Xu Sun
216
34
0
27 Dec 2019
Multimodal Machine Translation through Visuals and Speech
Multimodal Machine Translation through Visuals and SpeechMachine Translation (MT), 2019
U. Sulubacak
Ozan Caglayan
Stig-Arne Gronroos
Aku Rouhe
Desmond Elliott
Lucia Specia
Jörg Tiedemann
201
88
0
28 Nov 2019
Bootstrapping Disjoint Datasets for Multilingual Multimodal
  Representation Learning
Bootstrapping Disjoint Datasets for Multilingual Multimodal Representation Learning
Ákos Kádár
Grzegorz Chrupała
Afra Alishahi
Desmond Elliott
227
1
0
09 Nov 2019
123
Next