Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
1502.03044
Cited By

Show, Attend and Tell: Neural Image Caption Generation with Visual
Attention

v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015

Jimmy Ba

Aaron Courville

Ruslan Salakhutdinov

ArXiv (abs)PDF HTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,580 papers shown

Does Multimodality Help Human and Machine for Translation and Image
Captioning?

Does Multimodality Help Human and Machine for Translation and Image Captioning?

Mercedes García-Martínez

Joost van de Weijer

205

87

0

30 May 2016

Video Summarization with Long Short-term Memory

Video Summarization with Long Short-term Memory

Wei-Lun Chao

Kristen Grauman

253

747

0

26 May 2016

Review Networks for Caption Generation

Review Networks for Caption Generation

Ruslan Salakhutdinov

William W. Cohen

276

87

0

25 May 2016

BattRAE: Bidimensional Attention-Based Recursive Autoencoders for
Learning Bilingual Phrase Embeddings

BattRAE: Bidimensional Attention-Based Recursive Autoencoders for Learning Bilingual Phrase Embeddings

74

20

0

25 May 2016

Localizing by Describing: Attribute-Guided Attention Localization for
Fine-Grained Recognition

Localizing by Describing: Attribute-Guided Attention Localization for Fine-Grained Recognition

Errui Ding

150

79

0

20 May 2016

Generative Adversarial Text to Image Synthesis

Generative Adversarial Text to Image Synthesis

Lajanugen Logeswaran

Bernt Schiele

465

3,338

0

17 May 2016

Learning Deep Representations of Fine-grained Visual Descriptions

Learning Deep Representations of Fine-grained Visual Descriptions

Bernt Schiele

425

891

0

17 May 2016

Movie Description

Movie Description

Marcus Rohrbach

Hugo Larochelle

Aaron Courville

Bernt Schiele

267

387

0

12 May 2016

Ask Your Neurons: A Deep Learning Approach to Visual Question Answering

Ask Your Neurons: A Deep Learning Approach to Visual Question Answering

Mateusz Malinowski

Marcus Rohrbach

Mario Fritz

247

104

0

09 May 2016

Chained Predictions Using Convolutional Neural Networks

Chained Predictions Using Convolutional Neural Networks

Georgia Gkioxari

Alexander Toshev

226

195

0

08 May 2016

DeepPicker: a Deep Learning Approach for Fully Automated Particle
Picking in Cryo-EM

DeepPicker: a Deep Learning Approach for Fully Automated Particle Picking in Cryo-EM

123

181

0

06 May 2016

Leveraging Visual Question Answering for Image-Caption Ranking

Leveraging Visual Question Answering for Image-Caption Ranking

Devi Parikh

265

88

0

04 May 2016

Multi30K: Multilingual English-German Image Descriptions

Multi30K: Multilingual English-German Image Descriptions

Desmond Elliott

281

629

0

02 May 2016

Look-ahead before you leap: end-to-end active recognition by forecasting
the effect of motion

Look-ahead before you leap: end-to-end active recognition by forecasting the effect of motion

Dinesh Jayaraman

Kristen Grauman

273

95

0

30 Apr 2016

Joint Line Segmentation and Transcription for End-to-End Handwritten
Paragraph Recognition

Joint Line Segmentation and Transcription for End-to-End Handwritten Paragraph Recognition

Théodore Bluche

247

201

0

28 Apr 2016

Dialog-based Language Learning

Dialog-based Language Learning

Jason Weston

412

110

0

20 Apr 2016

Annotation Order Matters: Recurrent Image Annotator for Arbitrary Length
Image Tagging

Annotation Order Matters: Recurrent Image Annotator for Arbitrary Length Image Tagging

Hideki Nakayama

205

72

0

18 Apr 2016

Parallelizing Word2Vec in Shared and Distributed Memory

Parallelizing Word2Vec in Shared and Distributed Memory

211

72

0

15 Apr 2016

Learning Visual Storylines with Skipping Recurrent Neural Networks

Learning Visual Storylines with Skipping Recurrent Neural Networks

Gunnar Sigurdsson

150

39

0

14 Apr 2016

Filling in the details: Perceiving from low fidelity images

Filling in the details: Perceiving from low fidelity images

Michael L. Wick

53

1

0

14 Apr 2016

Visual Storytelling

Visual Storytelling

Ting-Hao 'Kenneth' Huang

Francis Ferraro

N. Mostafazadeh

...

Devi Parikh

Lucy Vanderwende

Margaret Mitchell

209

525

0

13 Apr 2016

Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with
MDLSTM Attention

Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention

Théodore Bluche

Ronaldo O. Messina

176

183

0

12 Apr 2016

TGIF: A New Dataset and Benchmark on Animated GIF Description

TGIF: A New Dataset and Benchmark on Animated GIF Description

Joel R. Tetreault

199

295

0

10 Apr 2016

Optimizing Performance of Recurrent Neural Networks on GPUs

Optimizing Performance of Recurrent Neural Networks on GPUs

Tomás Kociský

139

93

0

07 Apr 2016

Advances in Very Deep Convolutional Neural Networks for LVCSR

Advances in Very Deep Convolutional Neural Networks for LVCSR

204

44

0

06 Apr 2016

Correlated and Individual Multi-Modal Deep Learning for RGB-D Object
Recognition

Correlated and Individual Multi-Modal Deep Learning for RGB-D Object Recognition

Jie zhou

276

29

0

06 Apr 2016

Character-Level Neural Translation for Multilingual Media Monitoring in
the SUMMA Project

Character-Level Neural Translation for Multilingual Media Monitoring in the SUMMA Project

Guntis Barzdins

68

6

0

05 Apr 2016

Image Captioning with Deep Bidirectional LSTMs

Image Captioning with Deep Bidirectional LSTMs

Cheng Wang

Christian Bartz

Christoph Meinel

212

294

0

04 Apr 2016

Character-Level Question Answering with Attention

Character-Level Question Answering with Attention

294

189

0

04 Apr 2016

Reasoning About Pragmatics with Neural Listeners and Speakers

Reasoning About Pragmatics with Neural Listeners and Speakers

199

185

0

02 Apr 2016

Automatic Annotation of Structured Facts in Images

Automatic Annotation of Structured Facts in Images

Mohamed Elhoseiny

171

9

0

02 Apr 2016

AttSum: Joint Learning of Focusing and Summarization with Neural
Attention

AttSum: Joint Learning of Focusing and Summarization with Neural Attention

Sujian Li

266

119

0

01 Apr 2016

Neural Attention Models for Sequence Classification: Analysis and
Application to Key Term Extraction and Dialogue Act Detection

Neural Attention Models for Sequence Classification: Analysis and Application to Key Term Extraction and Dialogue Act Detection

Sheng-syun Shen

172

69

0

31 Mar 2016

Minimal Gated Unit for Recurrent Neural Networks

Minimal Gated Unit for Recurrent Neural Networks

Chen-Da Liu-Zhang

185

357

0

31 Mar 2016

Dense Image Representation with Spatial Pyramid VLAD Coding of CNN for
Locally Robust Captioning

Dense Image Representation with Spatial Pyramid VLAD Coding of CNN for Locally Robust Captioning

Masataka Yamaguchi

Katsunori Ohnishi

136

8

0

30 Mar 2016

Recurrent Batch Normalization

Recurrent Batch Normalization

Çağlar Gülçehre

Aaron Courville

625

414

0

30 Mar 2016

Rich Image Captioning in the Wild

Rich Image Captioning in the Wild

Lei Zhang

Cornelia Carapcea

Chris Sienkiewicz

146

127

0

30 Mar 2016

Generating Visual Explanations

Generating Visual Explanations

Lisa Anne Hendricks

Marcus Rohrbach

Bernt Schiele

277

646

0

28 Mar 2016

Learning to Read Chest X-Rays: Recurrent Neural Cascade Model for
Automated Image Annotation

Learning to Read Chest X-Rays: Recurrent Neural Cascade Model for Automated Image Annotation

Dina Demner-Fushman

Ronald M. Summers

128

382

0

28 Mar 2016

Audio Visual Emotion Recognition with Temporal Alignment and Perception
Attention

Audio Visual Emotion Recognition with Temporal Alignment and Perception Attention

118

31

0

28 Mar 2016

Recurrent Mixture Density Network for Spatiotemporal Visual Attention

Recurrent Mixture Density Network for Spatiotemporal Visual Attention

Hugo Larochelle

Lorenzo Torresani

322

139

0

27 Mar 2016

Neural Text Generation from Structured Data with Application to the
Biography Domain

Neural Text Generation from Structured Data with Application to the Biography Domain

253

49

0

24 Mar 2016

Attentive Contexts for Object Detection

Attentive Contexts for Object Detection

Xiaodan Liang

128

230

0

24 Mar 2016

BreakingNews: Article Annotation by Image and Text Processing

BreakingNews: Article Annotation by Image and Text Processing

Francesc Moreno-Noguer

211

113

0

23 Mar 2016

Semantic Object Parsing with Graph LSTM

Semantic Object Parsing with Graph LSTM

Xiaodan Liang

327

364

0

23 Mar 2016

Deep Learning in Bioinformatics

Deep Learning in Bioinformatics

368

1,433

0

21 Mar 2016

Segmentation from Natural Language Expressions

Segmentation from Natural Language Expressions

Marcus Rohrbach

266

506

0

20 Mar 2016

One-Shot Generalization in Deep Generative Models

One-Shot Generalization in Deep Generative Models

Danilo Jimenez Rezende

Ivo Danihelka

BDL VLM DRL LRM

255

260

0

16 Mar 2016

Image Captioning with Semantic Attention

Image Captioning with Semantic Attention

383

1,761

0

12 Mar 2016

Neural Discourse Relation Recognition with Semantic Memory

Neural Discourse Relation Recognition with Semantic Memory

87

17

0

12 Mar 2016

1 2 3...68 69 70 71 72