ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Dong Wang
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,580 papers shown
Does Multimodality Help Human and Machine for Translation and Image
  Captioning?
Does Multimodality Help Human and Machine for Translation and Image Captioning?
Ozan Caglayan
Walid Aransa
Yaxing Wang
Marc Masana
Mercedes García-Martínez
Fethi Bougares
Loïc Barrault
Joost van de Weijer
205
87
0
30 May 2016
Video Summarization with Long Short-term Memory
Video Summarization with Long Short-term Memory
Ke Zhang
Wei-Lun Chao
Fei Sha
Kristen Grauman
253
747
0
26 May 2016
Review Networks for Caption Generation
Review Networks for Caption Generation
Zhilin Yang
Ye Yuan
Yuexin Wu
Ruslan Salakhutdinov
William W. Cohen
3DV
276
87
0
25 May 2016
BattRAE: Bidimensional Attention-Based Recursive Autoencoders for
  Learning Bilingual Phrase Embeddings
BattRAE: Bidimensional Attention-Based Recursive Autoencoders for Learning Bilingual Phrase Embeddings
Biao Zhang
Deyi Xiong
Jinsong Su
74
20
0
25 May 2016
Localizing by Describing: Attribute-Guided Attention Localization for
  Fine-Grained Recognition
Localizing by Describing: Attribute-Guided Attention Localization for Fine-Grained Recognition
Xiao-Chang Liu
Jiang Wang
Shilei Wen
Errui Ding
Yuanqing Lin
150
79
0
20 May 2016
Generative Adversarial Text to Image Synthesis
Generative Adversarial Text to Image Synthesis
Scott E. Reed
Zeynep Akata
Xinchen Yan
Lajanugen Logeswaran
Bernt Schiele
Honglak Lee
GAN
465
3,338
0
17 May 2016
Learning Deep Representations of Fine-grained Visual Descriptions
Learning Deep Representations of Fine-grained Visual Descriptions
Scott E. Reed
Zeynep Akata
Bernt Schiele
Honglak Lee
OCLVLM
425
891
0
17 May 2016
Movie Description
Movie Description
Anna Rohrbach
Atousa Torabi
Marcus Rohrbach
Niket Tandon
C. Pal
Hugo Larochelle
Aaron Courville
Bernt Schiele
3DVVGen
267
387
0
12 May 2016
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Mateusz Malinowski
Marcus Rohrbach
Mario Fritz
247
104
0
09 May 2016
Chained Predictions Using Convolutional Neural Networks
Chained Predictions Using Convolutional Neural Networks
Georgia Gkioxari
Alexander Toshev
Navdeep Jaitly
BDL
226
195
0
08 May 2016
DeepPicker: a Deep Learning Approach for Fully Automated Particle
  Picking in Cryo-EM
DeepPicker: a Deep Learning Approach for Fully Automated Particle Picking in Cryo-EM
Feng Wang
Huichao Gong
Gaochao liu
Meijing Li
Chuangye Yan
Tian Xia
Xueming Li
Jianyang Zeng
123
181
0
06 May 2016
Leveraging Visual Question Answering for Image-Caption Ranking
Leveraging Visual Question Answering for Image-Caption Ranking
Xiaoyu Lin
Devi Parikh
CoGe
265
88
0
04 May 2016
Multi30K: Multilingual English-German Image Descriptions
Multi30K: Multilingual English-German Image Descriptions
Desmond Elliott
Stella Frank
K. Simaán
Lucia Specia
VLM
281
629
0
02 May 2016
Look-ahead before you leap: end-to-end active recognition by forecasting
  the effect of motion
Look-ahead before you leap: end-to-end active recognition by forecasting the effect of motion
Dinesh Jayaraman
Kristen Grauman
273
95
0
30 Apr 2016
Joint Line Segmentation and Transcription for End-to-End Handwritten
  Paragraph Recognition
Joint Line Segmentation and Transcription for End-to-End Handwritten Paragraph Recognition
Théodore Bluche
AI4TS
247
201
0
28 Apr 2016
Dialog-based Language Learning
Dialog-based Language Learning
Jason Weston
LLMAG
412
110
0
20 Apr 2016
Annotation Order Matters: Recurrent Image Annotator for Arbitrary Length
  Image Tagging
Annotation Order Matters: Recurrent Image Annotator for Arbitrary Length Image Tagging
Jiren Jin
Hideki Nakayama
3DVVLM
205
72
0
18 Apr 2016
Parallelizing Word2Vec in Shared and Distributed Memory
Parallelizing Word2Vec in Shared and Distributed Memory
Shihao Ji
N. Satish
Sheng Li
Pradeep Dubey
VLMMoE
211
72
0
15 Apr 2016
Learning Visual Storylines with Skipping Recurrent Neural Networks
Learning Visual Storylines with Skipping Recurrent Neural Networks
Gunnar Sigurdsson
Xinlei Chen
Abhinav Gupta
150
39
0
14 Apr 2016
Filling in the details: Perceiving from low fidelity images
Filling in the details: Perceiving from low fidelity images
F. Wick
Michael L. Wick
M. Pomplun
3DH
53
1
0
14 Apr 2016
Visual Storytelling
Visual Storytelling
Ting-Hao 'Kenneth' Huang
Huang
Francis Ferraro
N. Mostafazadeh
Ishan Misra
...
C. L. Zitnick
Devi Parikh
Lucy Vanderwende
Michel Galley
Margaret Mitchell
VGen
209
525
0
13 Apr 2016
Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with
  MDLSTM Attention
Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention
Théodore Bluche
J. Louradour
Ronaldo O. Messina
VLM
176
183
0
12 Apr 2016
TGIF: A New Dataset and Benchmark on Animated GIF Description
TGIF: A New Dataset and Benchmark on Animated GIF Description
Yuncheng Li
Yale Song
Liangliang Cao
Joel R. Tetreault
Larry Goldberg
A. Jaimes
Jiebo Luo
199
295
0
10 Apr 2016
Optimizing Performance of Recurrent Neural Networks on GPUs
Optimizing Performance of Recurrent Neural Networks on GPUs
J. Appleyard
Tomás Kociský
Phil Blunsom
139
93
0
07 Apr 2016
Advances in Very Deep Convolutional Neural Networks for LVCSR
Advances in Very Deep Convolutional Neural Networks for LVCSR
Tom Sercu
Vaibhava Goel
204
44
0
06 Apr 2016
Correlated and Individual Multi-Modal Deep Learning for RGB-D Object
  Recognition
Correlated and Individual Multi-Modal Deep Learning for RGB-D Object Recognition
Ziyan Wang
Jiwen Lu
Ruogu Lin
Jianjiang Feng
Jie zhou
276
29
0
06 Apr 2016
Character-Level Neural Translation for Multilingual Media Monitoring in
  the SUMMA Project
Character-Level Neural Translation for Multilingual Media Monitoring in the SUMMA Project
Guntis Barzdins
Steve Renals
D. Gosko
68
6
0
05 Apr 2016
Image Captioning with Deep Bidirectional LSTMs
Image Captioning with Deep Bidirectional LSTMs
Cheng Wang
Haojin Yang
Christian Bartz
Christoph Meinel
VLM
212
294
0
04 Apr 2016
Character-Level Question Answering with Attention
Character-Level Question Answering with Attention
David Golub
Xiaodong He
294
189
0
04 Apr 2016
Reasoning About Pragmatics with Neural Listeners and Speakers
Reasoning About Pragmatics with Neural Listeners and Speakers
Jacob Andreas
Dan Klein
ReLMLRM
199
185
0
02 Apr 2016
Automatic Annotation of Structured Facts in Images
Automatic Annotation of Structured Facts in Images
Mohamed Elhoseiny
Scott D. Cohen
W. Chang
Brian L. Price
Ahmed Elgammal
171
9
0
02 Apr 2016
AttSum: Joint Learning of Focusing and Summarization with Neural
  Attention
AttSum: Joint Learning of Focusing and Summarization with Neural Attention
Ziqiang Cao
Wenjie Li
Sujian Li
Furu Wei
Yanran Li
266
119
0
01 Apr 2016
Neural Attention Models for Sequence Classification: Analysis and
  Application to Key Term Extraction and Dialogue Act Detection
Neural Attention Models for Sequence Classification: Analysis and Application to Key Term Extraction and Dialogue Act Detection
Sheng-syun Shen
Hung-yi Lee
172
69
0
31 Mar 2016
Minimal Gated Unit for Recurrent Neural Networks
Minimal Gated Unit for Recurrent Neural Networks
Guoxiang Zhou
Jianxin Wu
Chen-Da Liu-Zhang
Zhi Zhou
185
357
0
31 Mar 2016
Dense Image Representation with Spatial Pyramid VLAD Coding of CNN for
  Locally Robust Captioning
Dense Image Representation with Spatial Pyramid VLAD Coding of CNN for Locally Robust Captioning
Andrew Shin
Masataka Yamaguchi
Katsunori Ohnishi
Tatsuya Harada
136
8
0
30 Mar 2016
Recurrent Batch Normalization
Recurrent Batch Normalization
Tim Cooijmans
Nicolas Ballas
César Laurent
Çağlar Gülçehre
Aaron Courville
ODL
625
414
0
30 Mar 2016
Rich Image Captioning in the Wild
Rich Image Captioning in the Wild
Kenneth Tran
Xiaodong He
Lei Zhang
Jian Sun
Cornelia Carapcea
Chris Thrasher
Chris Buehler
Chris Sienkiewicz
VLM
146
127
0
30 Mar 2016
Generating Visual Explanations
Generating Visual Explanations
Lisa Anne Hendricks
Zeynep Akata
Marcus Rohrbach
Jeff Donahue
Bernt Schiele
Trevor Darrell
VLMFAtt
277
646
0
28 Mar 2016
Learning to Read Chest X-Rays: Recurrent Neural Cascade Model for
  Automated Image Annotation
Learning to Read Chest X-Rays: Recurrent Neural Cascade Model for Automated Image Annotation
Hoo-Chang Shin
Kirk Roberts
Le Lu
Dina Demner-Fushman
Jianhua Yao
Ronald M. Summers
128
382
0
28 Mar 2016
Audio Visual Emotion Recognition with Temporal Alignment and Perception
  Attention
Audio Visual Emotion Recognition with Temporal Alignment and Perception Attention
Linlin Chao
Jianhua Tao
Minghao Yang
Ya Li
Zhengqi Wen
118
31
0
28 Mar 2016
Recurrent Mixture Density Network for Spatiotemporal Visual Attention
Recurrent Mixture Density Network for Spatiotemporal Visual Attention
Loris Bazzani
Hugo Larochelle
Lorenzo Torresani
322
139
0
27 Mar 2016
Neural Text Generation from Structured Data with Application to the
  Biography Domain
Neural Text Generation from Structured Data with Application to the Biography Domain
R. Lebret
David Grangier
Michael Auli
253
49
0
24 Mar 2016
Attentive Contexts for Object Detection
Attentive Contexts for Object Detection
Jianan Li
Yunchao Wei
Xiaodan Liang
Jian Dong
Tingfa Xu
Jiashi Feng
Shuicheng Yan
ObjD
128
230
0
24 Mar 2016
BreakingNews: Article Annotation by Image and Text Processing
BreakingNews: Article Annotation by Image and Text Processing
Arnau Ramisa
F. Yan
Francesc Moreno-Noguer
K. Mikolajczyk
211
113
0
23 Mar 2016
Semantic Object Parsing with Graph LSTM
Semantic Object Parsing with Graph LSTM
Xiaodan Liang
Xiaohui Shen
Jiashi Feng
Liang Lin
Shuicheng Yan
327
364
0
23 Mar 2016
Deep Learning in Bioinformatics
Deep Learning in Bioinformatics
Seonwoo Min
Byunghan Lee
Sungroh Yoon
AI4CE3DV
368
1,433
0
21 Mar 2016
Segmentation from Natural Language Expressions
Segmentation from Natural Language Expressions
Ronghang Hu
Marcus Rohrbach
Trevor Darrell
VLMEgoV
266
506
0
20 Mar 2016
One-Shot Generalization in Deep Generative Models
One-Shot Generalization in Deep Generative Models
Danilo Jimenez Rezende
S. Mohamed
Ivo Danihelka
Karol Gregor
Daan Wierstra
BDLVLMDRLLRM
255
260
0
16 Mar 2016
Image Captioning with Semantic Attention
Image Captioning with Semantic Attention
Quanzeng You
Hailin Jin
Zhaowen Wang
Chen Fang
Jiebo Luo
VLM
383
1,761
0
12 Mar 2016
Neural Discourse Relation Recognition with Semantic Memory
Neural Discourse Relation Recognition with Semantic Memory
Biao Zhang
Deyi Xiong
Jinsong Su
87
17
0
12 Mar 2016
Previous
123...6869707172
Next