Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
1502.03044
Cited By

Show, Attend and Tell: Neural Image Caption Generation with Visual
Attention

v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015

Jimmy Ba

Aaron Courville

Ruslan Salakhutdinov

ArXiv (abs)PDF HTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,580 papers shown

Recursive Recurrent Nets with Attention Modeling for OCR in the Wild

Recursive Recurrent Nets with Attention Modeling for OCR in the Wild

Chen-Yu Lee

220

484

0

09 Mar 2016

Image Captioning and Visual Question Answering Based on Attributes and
External Knowledge

Image Captioning and Visual Question Answering Based on Attributes and External Knowledge

Qi Wu

Chunhua Shen

Anton Van Den Hengel

Peng Wang

231

374

0

09 Mar 2016

Dynamic Memory Networks for Visual and Textual Question Answering

Dynamic Memory Networks for Visual and Textual Question Answering

235

766

0

04 Mar 2016

Noisy Activation Functions

Noisy Activation Functions

Çağlar Gülçehre

Marcin Moczulski

263

301

0

01 Mar 2016

Recurrent Neural Network Grammars

Recurrent Neural Network Grammars

Miguel Ballesteros

325

539

0

25 Feb 2016

Learning to Generate with Memory

Learning to Generate with Memory

Jun Zhu

233

44

0

24 Feb 2016

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense
Image Annotations

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

...

Yannis Kalantidis

David A. Shamma

Michael S. Bernstein

Fei-Fei Li

2.0K

6,245

0

23 Feb 2016

Contextual LSTM (CLSTM) models for Large scale NLP tasks

Contextual LSTM (CLSTM) models for Large scale NLP tasks

170

218

0

19 Feb 2016

"Why Should I Trust You?": Explaining the Predictions of Any Classifier

"Why Should I Trust You?": Explaining the Predictions of Any Classifier

Marco Tulio Ribeiro

Carlos Guestrin

2.5K

19,924

0

16 Feb 2016

Look, Listen and Learn - A Multimodal LSTM for Speaker Identification

Look, Listen and Learn - A Multimodal LSTM for Speaker Identification

Jimmy S. J. Ren

167

111

0

13 Feb 2016

Global Deconvolutional Networks for Semantic Segmentation

Global Deconvolutional Networks for Semantic Segmentation

Vladimir Nekrasov

140

12

0

12 Feb 2016

Attentive Pooling Networks

Attentive Pooling Networks

Cicero Nogueira dos Santos

202

354

0

11 Feb 2016

A Convolutional Attention Network for Extreme Summarization of Source
Code

A Convolutional Attention Network for Extreme Summarization of Source Code

Miltiadis Allamanis

Hao Peng

291

609

0

09 Feb 2016

Value Iteration Networks

Value Iteration Networks

Pieter Abbeel

440

676

0

09 Feb 2016

Predicting Clinical Events by Combining Static and Dynamic Information
Using Recurrent Neural Networks

Predicting Clinical Events by Combining Static and Dynamic Information Using Recurrent Neural Networks

Cristóbal Esteban

220

163

0

08 Feb 2016

From Softmax to Sparsemax: A Sparse Model of Attention and Multi-Label
Classification

From Softmax to Sparsemax: A Sparse Model of Attention and Multi-Label Classification

André F. T. Martins

Ramón Fernández Astudillo

522

805

0

05 Feb 2016

Long-term Planning by Short-term Prediction

Long-term Planning by Short-term Prediction

Shai Shalev-Shwartz

124

62

0

04 Feb 2016

Survey on the attention based RNN model and its applications in computer
vision

Survey on the attention based RNN model and its applications in computer vision

139

128

0

25 Jan 2016

Modeling Coverage for Neural Machine Translation

Modeling Coverage for Neural Machine Translation

233

762

0

19 Jan 2016

Multimodal Pivots for Image Caption Translation

Multimodal Pivots for Image Caption Translation

Julian Hitschler

Shigehiko Schamoni

333

100

0

15 Jan 2016

Automatic Description Generation from Images: A Survey of Models,
Datasets, and Evaluation Measures

Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures

Raffaella Bernardi

Desmond Elliott

Nazli Ikizler-Cinbis

226

378

0

15 Jan 2016

Implicit Distortion and Fertility Models for Attention-based
Encoder-Decoder NMT Model

Implicit Distortion and Fertility Models for Attention-based Encoder-Decoder NMT Model

225

46

0

13 Jan 2016

Learning to Compose Neural Networks for Question Answering

Learning to Compose Neural Networks for Question Answering

Marcus Rohrbach

NAI KELM BDL CoGe

461

578

0

07 Jan 2016

Language to Logical Form with Neural Attention

Language to Logical Form with Neural Attention

334

761

0

06 Jan 2016

Incorporating Structural Alignment Biases into an Attentional Neural
Translation Model

Incorporating Structural Alignment Biases into an Attentional Neural Translation Model

Cong Duy Vu Hoang

Ekaterina Vymolova

Gholamreza Haffari

217

175

0

06 Jan 2016

Mutual Information and Diverse Decoding Improve Neural Machine
Translation

Mutual Information and Diverse Decoding Improve Neural Machine Translation

Jiwei Li

Dan Jurafsky

167

126

0

04 Jan 2016

Write a Classifier: Predicting Visual Classifiers from Unstructured Text

Write a Classifier: Predicting Visual Classifiers from Unstructured Text

Mohamed Elhoseiny

368

41

0

31 Dec 2015

Feed-Forward Networks with Attention Can Solve Some Long-Term Memory
Problems

Feed-Forward Networks with Attention Can Solve Some Long-Term Memory Problems

268

316

0

29 Dec 2015

Learning Transferrable Knowledge for Semantic Segmentation with Deep
Convolutional Neural Network

Learning Transferrable Knowledge for Semantic Segmentation with Deep Convolutional Neural Network

260

174

0

24 Dec 2015

A Planning based Framework for Essay Generation

A Planning based Framework for Essay Generation

106

5

0

18 Dec 2015

ABCNN: Attention-Based Convolutional Neural Network for Modeling
Sentence Pairs

ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs

Hinrich Schütze

377

963

0

16 Dec 2015

Agreement-based Joint Training for Bidirectional Attention-based Neural
Machine Translation

Agreement-based Joint Training for Bidirectional Attention-based Neural Machine Translation

Maosong Sun

Yang Liu

262

74

0

15 Dec 2015

Distilling Knowledge from Deep Networks with Applications to Healthcare
Domain

Distilling Knowledge from Deep Networks with Applications to Healthcare Domain

186

143

0

11 Dec 2015

Explaining NonLinear Classification Decisions with Deep Taylor
Decomposition

Explaining NonLinear Classification Decisions with Deep Taylor Decomposition

Sebastian Lapuschkin

Alexander Binder

Klaus-Robert Muller

257

823

0

08 Dec 2015

Thinking Required

Thinking Required

103

0

0

07 Dec 2015

Deep Attention Recurrent Q-Network

Deep Attention Recurrent Q-Network

Alexey Seleznev

Anastasiia Ignateva

198

167

0

05 Dec 2015

Attribute2Image: Conditional Image Generation from Visual Attributes

Attribute2Image: Conditional Image Generation from Visual Attributes

329

795

0

02 Dec 2015

A C-LSTM Neural Network for Text Classification

A C-LSTM Neural Network for Text Classification

275

914

0

27 Nov 2015

Recurrent Instance Segmentation

Recurrent Instance Segmentation

Bernardino Romera-Paredes

319

334

0

25 Nov 2015

Towards Universal Paraphrastic Sentence Embeddings

Towards Universal Paraphrastic Sentence Embeddings

Joey Tianyi Zhou

388

565

0

25 Nov 2015

Learning with Memory Embeddings

Learning with Memory Embeddings

Cristóbal Esteban

545

32

0

25 Nov 2015

Natural Language Understanding with Distributed Representation

Natural Language Understanding with Distributed Representation

203

55

0

24 Nov 2015

DenseCap: Fully Convolutional Localization Networks for Dense Captioning

DenseCap: Fully Convolutional Localization Networks for Dense Captioning

Li Fei-Fei

379

1,218

0

24 Nov 2015

Where To Look: Focus Regions for Visual Question Answering

Where To Look: Focus Regions for Visual Question Answering

278

477

0

23 Nov 2015

ReSeg: A Recurrent Neural Network-based Model for Semantic Segmentation

ReSeg: A Recurrent Neural Network-based Model for Semantic Segmentation

Francesco Visin

Matteo Matteucci

Aaron Courville

345

262

0

22 Nov 2015

End-to-end Learning of Action Detection from Frame Glimpses in Videos

End-to-end Learning of Action Detection from Frame Glimpses in Videos

Olga Russakovsky

Li Fei-Fei

422

622

0

22 Nov 2015

Sequence Level Training with Recurrent Neural Networks

Sequence Level Training with Recurrent Neural Networks

MarcÁurelio Ranzato

Wojciech Zaremba

758

1,723

0

20 Nov 2015

First Step toward Model-Free, Anonymous Object Tracking with Recurrent
Neural Networks

First Step toward Model-Free, Anonymous Object Tracking with Recurrent Neural Networks

Qipeng Guo

199

52

0

19 Nov 2015

Feature-based Attention in Convolutional Neural Networks

Feature-based Attention in Convolutional Neural Networks

Grace W. Lindsay

147

17

0

19 Nov 2015

Recurrent Models for Auditory Attention in Multi-Microphone Distance
Speech Recognition

Recurrent Models for Auditory Attention in Multi-Microphone Distance Speech Recognition

102

26

0

19 Nov 2015

1 2 3...69 70 71 72

Page 70 of 72

Pageof 72