Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1502.03044
Cited By
v1
v2
v3 (latest)
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Dong Wang
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
50 / 3,580 papers shown
Does Multimodality Help Human and Machine for Translation and Image Captioning?
Ozan Caglayan
Walid Aransa
Yaxing Wang
Marc Masana
Mercedes García-Martínez
Fethi Bougares
Loïc Barrault
Joost van de Weijer
205
87
0
30 May 2016
Video Summarization with Long Short-term Memory
Ke Zhang
Wei-Lun Chao
Fei Sha
Kristen Grauman
253
747
0
26 May 2016
Review Networks for Caption Generation
Zhilin Yang
Ye Yuan
Yuexin Wu
Ruslan Salakhutdinov
William W. Cohen
3DV
276
87
0
25 May 2016
BattRAE: Bidimensional Attention-Based Recursive Autoencoders for Learning Bilingual Phrase Embeddings
Biao Zhang
Deyi Xiong
Jinsong Su
74
20
0
25 May 2016
Localizing by Describing: Attribute-Guided Attention Localization for Fine-Grained Recognition
Xiao-Chang Liu
Jiang Wang
Shilei Wen
Errui Ding
Yuanqing Lin
150
79
0
20 May 2016
Generative Adversarial Text to Image Synthesis
Scott E. Reed
Zeynep Akata
Xinchen Yan
Lajanugen Logeswaran
Bernt Schiele
Honglak Lee
GAN
465
3,338
0
17 May 2016
Learning Deep Representations of Fine-grained Visual Descriptions
Scott E. Reed
Zeynep Akata
Bernt Schiele
Honglak Lee
OCL
VLM
425
891
0
17 May 2016
Movie Description
Anna Rohrbach
Atousa Torabi
Marcus Rohrbach
Niket Tandon
C. Pal
Hugo Larochelle
Aaron Courville
Bernt Schiele
3DV
VGen
267
387
0
12 May 2016
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Mateusz Malinowski
Marcus Rohrbach
Mario Fritz
247
104
0
09 May 2016
Chained Predictions Using Convolutional Neural Networks
Georgia Gkioxari
Alexander Toshev
Navdeep Jaitly
BDL
226
195
0
08 May 2016
DeepPicker: a Deep Learning Approach for Fully Automated Particle Picking in Cryo-EM
Feng Wang
Huichao Gong
Gaochao liu
Meijing Li
Chuangye Yan
Tian Xia
Xueming Li
Jianyang Zeng
123
181
0
06 May 2016
Leveraging Visual Question Answering for Image-Caption Ranking
Xiaoyu Lin
Devi Parikh
CoGe
265
88
0
04 May 2016
Multi30K: Multilingual English-German Image Descriptions
Desmond Elliott
Stella Frank
K. Simaán
Lucia Specia
VLM
281
629
0
02 May 2016
Look-ahead before you leap: end-to-end active recognition by forecasting the effect of motion
Dinesh Jayaraman
Kristen Grauman
273
95
0
30 Apr 2016
Joint Line Segmentation and Transcription for End-to-End Handwritten Paragraph Recognition
Théodore Bluche
AI4TS
247
201
0
28 Apr 2016
Dialog-based Language Learning
Jason Weston
LLMAG
412
110
0
20 Apr 2016
Annotation Order Matters: Recurrent Image Annotator for Arbitrary Length Image Tagging
Jiren Jin
Hideki Nakayama
3DV
VLM
205
72
0
18 Apr 2016
Parallelizing Word2Vec in Shared and Distributed Memory
Shihao Ji
N. Satish
Sheng Li
Pradeep Dubey
VLM
MoE
211
72
0
15 Apr 2016
Learning Visual Storylines with Skipping Recurrent Neural Networks
Gunnar Sigurdsson
Xinlei Chen
Abhinav Gupta
150
39
0
14 Apr 2016
Filling in the details: Perceiving from low fidelity images
F. Wick
Michael L. Wick
M. Pomplun
3DH
53
1
0
14 Apr 2016
Visual Storytelling
Ting-Hao 'Kenneth' Huang
Huang
Francis Ferraro
N. Mostafazadeh
Ishan Misra
...
C. L. Zitnick
Devi Parikh
Lucy Vanderwende
Michel Galley
Margaret Mitchell
VGen
209
525
0
13 Apr 2016
Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention
Théodore Bluche
J. Louradour
Ronaldo O. Messina
VLM
176
183
0
12 Apr 2016
TGIF: A New Dataset and Benchmark on Animated GIF Description
Yuncheng Li
Yale Song
Liangliang Cao
Joel R. Tetreault
Larry Goldberg
A. Jaimes
Jiebo Luo
199
295
0
10 Apr 2016
Optimizing Performance of Recurrent Neural Networks on GPUs
J. Appleyard
Tomás Kociský
Phil Blunsom
139
93
0
07 Apr 2016
Advances in Very Deep Convolutional Neural Networks for LVCSR
Tom Sercu
Vaibhava Goel
204
44
0
06 Apr 2016
Correlated and Individual Multi-Modal Deep Learning for RGB-D Object Recognition
Ziyan Wang
Jiwen Lu
Ruogu Lin
Jianjiang Feng
Jie zhou
276
29
0
06 Apr 2016
Character-Level Neural Translation for Multilingual Media Monitoring in the SUMMA Project
Guntis Barzdins
Steve Renals
D. Gosko
68
6
0
05 Apr 2016
Image Captioning with Deep Bidirectional LSTMs
Cheng Wang
Haojin Yang
Christian Bartz
Christoph Meinel
VLM
212
294
0
04 Apr 2016
Character-Level Question Answering with Attention
David Golub
Xiaodong He
294
189
0
04 Apr 2016
Reasoning About Pragmatics with Neural Listeners and Speakers
Jacob Andreas
Dan Klein
ReLM
LRM
199
185
0
02 Apr 2016
Automatic Annotation of Structured Facts in Images
Mohamed Elhoseiny
Scott D. Cohen
W. Chang
Brian L. Price
Ahmed Elgammal
171
9
0
02 Apr 2016
AttSum: Joint Learning of Focusing and Summarization with Neural Attention
Ziqiang Cao
Wenjie Li
Sujian Li
Furu Wei
Yanran Li
266
119
0
01 Apr 2016
Neural Attention Models for Sequence Classification: Analysis and Application to Key Term Extraction and Dialogue Act Detection
Sheng-syun Shen
Hung-yi Lee
172
69
0
31 Mar 2016
Minimal Gated Unit for Recurrent Neural Networks
Guoxiang Zhou
Jianxin Wu
Chen-Da Liu-Zhang
Zhi Zhou
185
357
0
31 Mar 2016
Dense Image Representation with Spatial Pyramid VLAD Coding of CNN for Locally Robust Captioning
Andrew Shin
Masataka Yamaguchi
Katsunori Ohnishi
Tatsuya Harada
136
8
0
30 Mar 2016
Recurrent Batch Normalization
Tim Cooijmans
Nicolas Ballas
César Laurent
Çağlar Gülçehre
Aaron Courville
ODL
625
414
0
30 Mar 2016
Rich Image Captioning in the Wild
Kenneth Tran
Xiaodong He
Lei Zhang
Jian Sun
Cornelia Carapcea
Chris Thrasher
Chris Buehler
Chris Sienkiewicz
VLM
146
127
0
30 Mar 2016
Generating Visual Explanations
Lisa Anne Hendricks
Zeynep Akata
Marcus Rohrbach
Jeff Donahue
Bernt Schiele
Trevor Darrell
VLM
FAtt
277
646
0
28 Mar 2016
Learning to Read Chest X-Rays: Recurrent Neural Cascade Model for Automated Image Annotation
Hoo-Chang Shin
Kirk Roberts
Le Lu
Dina Demner-Fushman
Jianhua Yao
Ronald M. Summers
128
382
0
28 Mar 2016
Audio Visual Emotion Recognition with Temporal Alignment and Perception Attention
Linlin Chao
Jianhua Tao
Minghao Yang
Ya Li
Zhengqi Wen
118
31
0
28 Mar 2016
Recurrent Mixture Density Network for Spatiotemporal Visual Attention
Loris Bazzani
Hugo Larochelle
Lorenzo Torresani
322
139
0
27 Mar 2016
Neural Text Generation from Structured Data with Application to the Biography Domain
R. Lebret
David Grangier
Michael Auli
253
49
0
24 Mar 2016
Attentive Contexts for Object Detection
Jianan Li
Yunchao Wei
Xiaodan Liang
Jian Dong
Tingfa Xu
Jiashi Feng
Shuicheng Yan
ObjD
128
230
0
24 Mar 2016
BreakingNews: Article Annotation by Image and Text Processing
Arnau Ramisa
F. Yan
Francesc Moreno-Noguer
K. Mikolajczyk
211
113
0
23 Mar 2016
Semantic Object Parsing with Graph LSTM
Xiaodan Liang
Xiaohui Shen
Jiashi Feng
Liang Lin
Shuicheng Yan
327
364
0
23 Mar 2016
Deep Learning in Bioinformatics
Seonwoo Min
Byunghan Lee
Sungroh Yoon
AI4CE
3DV
368
1,433
0
21 Mar 2016
Segmentation from Natural Language Expressions
Ronghang Hu
Marcus Rohrbach
Trevor Darrell
VLM
EgoV
266
506
0
20 Mar 2016
One-Shot Generalization in Deep Generative Models
Danilo Jimenez Rezende
S. Mohamed
Ivo Danihelka
Karol Gregor
Daan Wierstra
BDL
VLM
DRL
LRM
255
260
0
16 Mar 2016
Image Captioning with Semantic Attention
Quanzeng You
Hailin Jin
Zhaowen Wang
Chen Fang
Jiebo Luo
VLM
383
1,761
0
12 Mar 2016
Neural Discourse Relation Recognition with Semantic Memory
Biao Zhang
Deyi Xiong
Jinsong Su
87
17
0
12 Mar 2016
Previous
1
2
3
...
68
69
70
71
72
Next