v1v2 (latest)

Probing the Need for Visual Context in Multimodal Machine Translation

North American Chapter of the Association for Computational Linguistics (NAACL), 2019

20 March 2019

Pranava Madhyastha

Papers citing "Probing the Need for Visual Context in Multimodal Machine Translation"

50 / 75 papers shown

Human + AI for Accelerating Ad Localization Evaluation

202

16 Sep 2025

Dual-branch Prompting for Multimodal Machine Translation

23 Jul 2025

TopicVD: A Topic-Based Dataset of Video-Guided Multimodal Machine Translation for DocumentariesInternational Conference on Applications of Natural Language to Data Bases (NLDB), 2025

328

09 May 2025

Memory Reviving, Continuing Learning and Beyond: Evaluation of Pre-trained Encoders and Decoders for Multimodal Machine Translation

312

25 Apr 2025

Towards Zero-Shot Multimodal Machine Translation

407

18 Jul 2024

Detecting Frames in News Headlines and Lead Images in U.S. Gun Violence Coverage

...

Sha Lai

152

25 Jun 2024

A Survey on Multi-modal Machine Translation: Tasks, Methods and Challenges

349

21 May 2024

3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset

Liang Ding

Min Zhang

180

29 Apr 2024

Exploring the Necessity of Visual Modality in Multimodal Machine Translation using Authentic Datasets

141

09 Apr 2024

Detecting Concrete Visual Tokens for Multimodal Machine Translation

267

05 Mar 2024

Adding Multimodal Capabilities to a Text-only Translation Model

276

05 Mar 2024

The Case for Evaluating Multimodal Translation Models on Text Datasets

210

05 Mar 2024

Video-Helpful Multimodal Machine TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Sadao Kurohashi

175

31 Oct 2023

Incorporating Probing Signals into Multimodal Machine Translation via Visual Question-Answering PairsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Jingbo Zhu

136

26 Oct 2023

Visual Question Generation in Bengali

231

12 Oct 2023

Impact of Visual Context on Noisy Multimodal NMT: An Empirical Study for English to Indian Languages

Baban Gain

Dibyanayan Bandyopadhyay

Subhabrata Mukherjee

Chandranath Adak

Asif Ekbal

312

30 Aug 2023

CLIPTrans: Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine TranslationIEEE International Conference on Computer Vision (ICCV), 2023

267

29 Aug 2023

Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language Pretraining?

Liang Ding

Li Shen

240

24 Aug 2023

Exploring Better Text Image Translation with Multimodal CodebookAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

225

27 May 2023

BigVideo: A Large-scale Video Subtitle Translation Dataset for Multimodal Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

385

23 May 2023

RC3: Regularized Contrastive Cross-lingual Cross-modal Pre-trainingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Jie Zhou

226

13 May 2023

Multimodal Speech Recognition for Language-Guided Embodied AgentsInterspeech (Interspeech), 2023

359

27 Feb 2023

Generalization algorithm of multimodal pre-training model based on graph-text self-supervised trainingICON (ICON), 2023

143

16 Feb 2023

Multimodality Representation Learning: A Survey on Evolution, Pretraining and Its Applications

Muhammad Arslan Manzoor

342

01 Feb 2023

Universal Multimodal Representation for Language UnderstandingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

Rui Wang

291

09 Jan 2023

Beyond Triplet: Leveraging the Most Data for Multimodal Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

277

20 Dec 2022

Tackling Ambiguity with Images: Improved Multimodal Machine Translation and Contrastive EvaluationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

320

20 Dec 2022

Low-resource Neural Machine Translation with Cross-modal AlignmentConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

202

13 Oct 2022

Distill the Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Ru Peng

Yawen Zeng

Jiaqi Zhao

240

10 Oct 2022

Multimodal Neural Machine Translation with Search Engine Based Image RetrievalWorkshop on Asian Translation (WAT), 2022

181

26 Jul 2022

VALHALLA: Visual Hallucination for Machine TranslationComputer Vision and Pattern Recognition (CVPR), 2022

458

31 May 2022

BAN-Cap: A Multi-Purpose English-Bangla Image Descriptions DatasetInternational Conference on Language Resources and Evaluation (LREC), 2022

Mohammad Faiyaz Khan

S. M. S. Shifath

Md. Saiful Islam

237

28 May 2022

Neural Machine Translation with Phrase-Level Universal Visual RepresentationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Qingkai Fang

Yang Feng

172

19 Mar 2022

MMLatch: Bottom-up Top-down Fusion for Multimodal Sentiment AnalysisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Georgios Paraskevopoulos

Efthymios Georgiou

Alexandros Potamianos

126

24 Jan 2022

VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine TranslationInternational Conference on Language Resources and Evaluation (LREC), 2022

Sadao Kurohashi

237

20 Jan 2022

Guiding Visual Question Generation

348

15 Oct 2021

A Survey on Multi-modal Summarization

206

11 Sep 2021

Vision Matters When It Should: Sanity Checking Multimodal Machine Translation ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Jiaoda Li

Duygu Ataman

Rico Sennrich

198

08 Sep 2021

Journalistic Guidelines Aware News Image CaptioningConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

259

07 Sep 2021

ReFormer: The Relational Transformer for Image CaptioningACM Multimedia (ACM MM), 2021

219

29 Jul 2021

Multimodal Co-learning: Challenges, Applications with Datasets, Recent Advances and Future DirectionsInformation Fusion (Inf. Fusion), 2021

402

176

29 Jul 2021

BERTGEN: Multi-task Generation through BERTAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Pranava Madhyastha

123

07 Jun 2021

ViTA: Visual-Linguistic Translation by Aligning Object TagsWorkshop on Asian Translation (WAT), 2021

Kshitij Gupta

Devansh Gautam

R. Mamidi

156

01 Jun 2021

Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Zhiyong Wu

Lingpeng Kong

W. Bi

Xiang Li

B. Kao

LRM

140

30 May 2021

"Wikily" Supervised Neural Translation Tailored to Cross-Lingual TasksConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Mohammad Sadegh Rasooli

Chris Callison-Burch

Derry Wijaya

CLIP

254

16 Apr 2021

Visual Cues and Error Correction for Translation RobustnessConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Zhenhao Li

Marek Rei

Lucia Specia

249

12 Mar 2021

Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual ConceptsComputer Vision and Pattern Recognition (CVPR), 2021

1.2K

1,370

17 Feb 2021

Unifying Vision-and-Language Tasks via Text GenerationInternational Conference on Machine Learning (ICML), 2021

614

611

04 Feb 2021

Cross-lingual Visual Pre-training for Multimodal Machine TranslationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021

Pranava Madhyastha

204

25 Jan 2021

Multimodal Variational Autoencoders for Semi-Supervised Learning: In Defense of Product-of-Experts

241

18 Jan 2021