STAIR Captions: Constructing a Large-Scale Japanese Image Caption Dataset

2 May 2017

Papers citing "STAIR Captions: Constructing a Large-Scale Japanese Image Caption Dataset"

16 / 66 papers shown

M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-training

288

04 Jun 2020

Captioning Images Taken by People Who Are BlindEuropean Conference on Computer Vision (ECCV), 2020

334

203

20 Feb 2020

UIT-ViIC: A Dataset for the First Evaluation on Vietnamese Image CaptioningInternational Conference on Computational Collective Intelligence (ICCCI), 2020

133

01 Feb 2020

Multimodal Machine Translation through Visuals and SpeechMachine Translation (MT), 2019

201

28 Nov 2019

Bootstrapping Disjoint Datasets for Multilingual Multimodal Representation Learning

227

09 Nov 2019

Aligning Multilingual Word Embeddings for Cross-Modal Retrieval TaskConference on Empirical Methods in Natural Language Processing (EMNLP), 2019

Alireza Mohammadshahi

R. Lebret

Karl Aberer

118

08 Oct 2019

Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and MethodsJournal of Artificial Intelligence Research (JAIR), 2019

406

142

22 Jul 2019

Unsupervised Bilingual Lexicon Induction from Mono-lingual Multimodal DataAAAI Conference on Artificial Intelligence (AAAI), 2019

Shizhe Chen

Qin Jin

Alexander G. Hauptmann

SSL

02 Jun 2019

Models of Visually Grounded Speech Signal Pay Attention To Nouns: a Bilingual Experiment on English and Japanese

William N. Havard

Jean-Pierre Chevrot

Laurent Besacier

155

08 Feb 2019

How2: A Large-scale Dataset for Multimodal Language Understanding

253

313

01 Nov 2018

Neural Joking Machine : Humorous image captioning

106

30 May 2018

COCO-CN for Cross-Lingual Image Tagging, Captioning and Retrieval

290

181

22 May 2018

Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description

188

230

19 Oct 2017

Emergent Translation in Multi-Agent Communication

Jason D. Lee

Dong Wang

Jason Weston

Douwe Kiela

215

12 Oct 2017

Image Pivoting for Learning Multilingual Multimodal Representations

152

24 Jul 2017

Cross-linguistic differences and similarities in image descriptions

194

06 Jul 2017