Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.06647
Cited By
Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning Challenge
21 September 2016
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning Challenge"
17 / 67 papers shown
Title
Shuffle-Then-Assemble: Learning Object-Agnostic Visual Relationship Features
Xu Yang
Hanwang Zhang
Jianfei Cai
42
74
0
01 Aug 2018
Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition
Chun-Fu Chen
Quanfu Fan
Neil Rohit Mallinar
Tom Sercu
Rogerio Feris
17
96
0
10 Jul 2018
Topic-Guided Attention for Image Captioning
Zhihao Zhu
Zhan Xue
Zejian Yuan
14
23
0
10 Jul 2018
Natural Language Generation for Electronic Health Records
Scott H. Lee
SyDa
6
81
0
01 Jun 2018
Fast, Diverse and Accurate Image Captioning Guided By Part-of-Speech
Aditya Deshpande
J. Aneja
Liwei Wang
A. Schwing
David A. Forsyth
17
146
0
31 May 2018
SemStyle: Learning to Generate Stylised Image Captions using Unaligned Text
A. Mathews
Lexing Xie
Xuming He
VLM
19
115
0
18 May 2018
Object Counts! Bringing Explicit Detections Back into Image Captioning
Josiah Wang
Pranava Madhyastha
Lucia Specia
ObjD
14
37
0
23 Apr 2018
Reconstruction Network for Video Captioning
Bairui Wang
Lin Ma
Wei Zhang
W. Liu
24
316
0
30 Mar 2018
HoME: a Household Multimodal Environment
Simon Brodeur
Ethan Perez
Ankesh Anand
Florian Golemo
Luca Herranz-Celotti
Florian Strub
Jean Rouat
Hugo Larochelle
Aaron Courville
LM&Ro
28
103
0
29 Nov 2017
Convolutional Image Captioning
J. Aneja
Aditya Deshpande
A. Schwing
VLM
23
359
0
24 Nov 2017
Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space
Liwei Wang
A. Schwing
Svetlana Lazebnik
CoGe
24
175
0
19 Nov 2017
I2T2I: Learning Text to Image Synthesis with Textual Data Augmentation
Hao Dong
Jingqing Zhang
Douglas McIlwraith
Yike Guo
30
58
0
20 Mar 2017
Visual Translation Embedding Network for Visual Relation Detection
Hanwang Zhang
Zawlin Kyaw
Shih-Fu Chang
Tat-Seng Chua
ViT
142
560
0
27 Feb 2017
Learning Visual N-Grams from Web Data
Ang Li
Allan Jabri
Armand Joulin
L. V. D. van der Maaten
VLM
18
136
0
29 Dec 2016
An Empirical Study of Language CNN for Image Captioning
Jiuxiang Gu
G. Wang
Jianfei Cai
Tsuhan Chen
23
132
0
21 Dec 2016
Self-critical Sequence Training for Image Captioning
Steven J. Rennie
E. Marcheret
Youssef Mroueh
Jerret Ross
Vaibhava Goel
11
1,876
0
02 Dec 2016
Semantic Regularisation for Recurrent Image Annotation
Feng Liu
Tao Xiang
Timothy M. Hospedales
Wankou Yang
Changyin Sun
29
103
0
16 Nov 2016
Previous
1
2