ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXivPDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,508 papers shown
Title
TGIF: A New Dataset and Benchmark on Animated GIF Description
TGIF: A New Dataset and Benchmark on Animated GIF Description
Yuncheng Li
Yale Song
Liangliang Cao
Joel R. Tetreault
Larry Goldberg
A. Jaimes
Jiebo Luo
11
269
0
10 Apr 2016
Optimizing Performance of Recurrent Neural Networks on GPUs
Optimizing Performance of Recurrent Neural Networks on GPUs
J. Appleyard
Tomás Kociský
Phil Blunsom
9
91
0
07 Apr 2016
Advances in Very Deep Convolutional Neural Networks for LVCSR
Advances in Very Deep Convolutional Neural Networks for LVCSR
Tom Sercu
Vaibhava Goel
9
44
0
06 Apr 2016
Correlated and Individual Multi-Modal Deep Learning for RGB-D Object
  Recognition
Correlated and Individual Multi-Modal Deep Learning for RGB-D Object Recognition
Ziyan Wang
Jiwen Lu
Ruogu Lin
Jianjiang Feng
Jie zhou
21
29
0
06 Apr 2016
Character-Level Neural Translation for Multilingual Media Monitoring in
  the SUMMA Project
Character-Level Neural Translation for Multilingual Media Monitoring in the SUMMA Project
Guntis Barzdins
Steve Renals
D. Gosko
13
5
0
05 Apr 2016
Image Captioning with Deep Bidirectional LSTMs
Image Captioning with Deep Bidirectional LSTMs
Cheng Wang
Haojin Yang
Christian Bartz
Christoph Meinel
VLM
10
278
0
04 Apr 2016
Character-Level Question Answering with Attention
Character-Level Question Answering with Attention
David Golub
Xiaodong He
19
184
0
04 Apr 2016
Reasoning About Pragmatics with Neural Listeners and Speakers
Reasoning About Pragmatics with Neural Listeners and Speakers
Jacob Andreas
Dan Klein
ReLM
LRM
18
173
0
02 Apr 2016
Automatic Annotation of Structured Facts in Images
Automatic Annotation of Structured Facts in Images
Mohamed Elhoseiny
Scott D. Cohen
W. Chang
Brian L. Price
Ahmed Elgammal
10
9
0
02 Apr 2016
AttSum: Joint Learning of Focusing and Summarization with Neural
  Attention
AttSum: Joint Learning of Focusing and Summarization with Neural Attention
Ziqiang Cao
Wenjie Li
Sujian Li
Furu Wei
Yanran Li
16
115
0
01 Apr 2016
Neural Attention Models for Sequence Classification: Analysis and
  Application to Key Term Extraction and Dialogue Act Detection
Neural Attention Models for Sequence Classification: Analysis and Application to Key Term Extraction and Dialogue Act Detection
Sheng-syun Shen
Hung-yi Lee
9
65
0
31 Mar 2016
Minimal Gated Unit for Recurrent Neural Networks
Minimal Gated Unit for Recurrent Neural Networks
Guoxiang Zhou
Jianxin Wu
Chen-Da Liu-Zhang
Zhi-Hua Zhou
20
325
0
31 Mar 2016
Dense Image Representation with Spatial Pyramid VLAD Coding of CNN for
  Locally Robust Captioning
Dense Image Representation with Spatial Pyramid VLAD Coding of CNN for Locally Robust Captioning
Andrew Shin
Masataka Yamaguchi
Katsunori Ohnishi
Tatsuya Harada
42
8
0
30 Mar 2016
Recurrent Batch Normalization
Recurrent Batch Normalization
Tim Cooijmans
Nicolas Ballas
César Laurent
Çağlar Gülçehre
Aaron Courville
ODL
11
409
0
30 Mar 2016
Rich Image Captioning in the Wild
Rich Image Captioning in the Wild
Kenneth Tran
Xiaodong He
Lei Zhang
Jian Sun
Cornelia Carapcea
Chris Thrasher
Chris Buehler
Chris Sienkiewicz
VLM
17
123
0
30 Mar 2016
Generating Visual Explanations
Generating Visual Explanations
Lisa Anne Hendricks
Zeynep Akata
Marcus Rohrbach
Jeff Donahue
Bernt Schiele
Trevor Darrell
VLM
FAtt
22
618
0
28 Mar 2016
Learning to Read Chest X-Rays: Recurrent Neural Cascade Model for
  Automated Image Annotation
Learning to Read Chest X-Rays: Recurrent Neural Cascade Model for Automated Image Annotation
Hoo-Chang Shin
Kirk Roberts
Le Lu
Dina Demner-Fushman
Jianhua Yao
Ronald M. Summers
11
347
0
28 Mar 2016
Audio Visual Emotion Recognition with Temporal Alignment and Perception
  Attention
Audio Visual Emotion Recognition with Temporal Alignment and Perception Attention
Linlin Chao
J. Tao
Minghao Yang
Ya Li
Zhengqi Wen
6
30
0
28 Mar 2016
Recurrent Mixture Density Network for Spatiotemporal Visual Attention
Recurrent Mixture Density Network for Spatiotemporal Visual Attention
Loris Bazzani
Hugo Larochelle
Lorenzo Torresani
11
134
0
27 Mar 2016
Neural Text Generation from Structured Data with Application to the
  Biography Domain
Neural Text Generation from Structured Data with Application to the Biography Domain
R. Lebret
David Grangier
Michael Auli
14
45
0
24 Mar 2016
Attentive Contexts for Object Detection
Attentive Contexts for Object Detection
Jianan Li
Yunchao Wei
Xiaodan Liang
Jian Dong
Tingfa Xu
Jiashi Feng
Shuicheng Yan
ObjD
12
221
0
24 Mar 2016
BreakingNews: Article Annotation by Image and Text Processing
BreakingNews: Article Annotation by Image and Text Processing
Arnau Ramisa
F. Yan
Francesc Moreno-Noguer
K. Mikolajczyk
21
105
0
23 Mar 2016
Semantic Object Parsing with Graph LSTM
Semantic Object Parsing with Graph LSTM
Xiaodan Liang
Xiaohui Shen
Jiashi Feng
Liang Lin
Shuicheng Yan
19
354
0
23 Mar 2016
Deep Learning in Bioinformatics
Deep Learning in Bioinformatics
Seonwoo Min
Byunghan Lee
Sungroh Yoon
AI4CE
3DV
22
1,350
0
21 Mar 2016
Segmentation from Natural Language Expressions
Segmentation from Natural Language Expressions
Ronghang Hu
Marcus Rohrbach
Trevor Darrell
VLM
EgoV
19
426
0
20 Mar 2016
One-Shot Generalization in Deep Generative Models
One-Shot Generalization in Deep Generative Models
Danilo Jimenez Rezende
S. Mohamed
Ivo Danihelka
Karol Gregor
Daan Wierstra
BDL
VLM
DRL
LRM
24
254
0
16 Mar 2016
Image Captioning with Semantic Attention
Image Captioning with Semantic Attention
Quanzeng You
Hailin Jin
Zhaowen Wang
Chen Fang
Jiebo Luo
VLM
9
1,651
0
12 Mar 2016
Neural Discourse Relation Recognition with Semantic Memory
Neural Discourse Relation Recognition with Semantic Memory
Biao Zhang
Deyi Xiong
Jinsong Su
25
16
0
12 Mar 2016
Recursive Recurrent Nets with Attention Modeling for OCR in the Wild
Recursive Recurrent Nets with Attention Modeling for OCR in the Wild
Chen-Yu Lee
Simon Osindero
VLM
21
458
0
09 Mar 2016
Image Captioning and Visual Question Answering Based on Attributes and
  External Knowledge
Image Captioning and Visual Question Answering Based on Attributes and External Knowledge
Qi Wu
Chunhua Shen
A. Hengel
Peng Wang
A. Dick
11
360
0
09 Mar 2016
Dynamic Memory Networks for Visual and Textual Question Answering
Dynamic Memory Networks for Visual and Textual Question Answering
Caiming Xiong
Stephen Merity
R. Socher
18
753
0
04 Mar 2016
Noisy Activation Functions
Noisy Activation Functions
Çağlar Gülçehre
Marcin Moczulski
Misha Denil
Yoshua Bengio
9
281
0
01 Mar 2016
Recurrent Neural Network Grammars
Recurrent Neural Network Grammars
Chris Dyer
A. Kuncoro
Miguel Ballesteros
Noah A. Smith
GNN
11
524
0
25 Feb 2016
Learning to Generate with Memory
Learning to Generate with Memory
Chongxuan Li
Jun Zhu
Bo Zhang
BDL
8
42
0
24 Feb 2016
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense
  Image Annotations
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
Ranjay Krishna
Yuke Zhu
Oliver Groth
Justin Johnson
Kenji Hata
...
Yannis Kalantidis
Li-Jia Li
David A. Shamma
Michael S. Bernstein
Fei-Fei Li
14
5,644
0
23 Feb 2016
Contextual LSTM (CLSTM) models for Large scale NLP tasks
Contextual LSTM (CLSTM) models for Large scale NLP tasks
Shalini Ghosh
Oriol Vinyals
B. Strope
Scott Roy
Tom Dean
Larry Heck
14
213
0
19 Feb 2016
"Why Should I Trust You?": Explaining the Predictions of Any Classifier
"Why Should I Trust You?": Explaining the Predictions of Any Classifier
Marco Tulio Ribeiro
Sameer Singh
Carlos Guestrin
FAtt
FaML
17
16,593
0
16 Feb 2016
Look, Listen and Learn - A Multimodal LSTM for Speaker Identification
Look, Listen and Learn - A Multimodal LSTM for Speaker Identification
Jimmy S. J. Ren
Yongtao Hu
Yu-Wing Tai
Chuan Wang
Li Xu
Wenxiu Sun
Qiong Yan
19
108
0
13 Feb 2016
Global Deconvolutional Networks for Semantic Segmentation
Global Deconvolutional Networks for Semantic Segmentation
Vladimir Nekrasov
Janghoon Ju
Jaesik Choi
SSeg
23
12
0
12 Feb 2016
Attentive Pooling Networks
Attentive Pooling Networks
Cicero Nogueira dos Santos
Ming Tan
Bing Xiang
Bowen Zhou
18
346
0
11 Feb 2016
A Convolutional Attention Network for Extreme Summarization of Source
  Code
A Convolutional Attention Network for Extreme Summarization of Source Code
Miltiadis Allamanis
Hao Peng
Charles Sutton
AI4TS
19
580
0
09 Feb 2016
Value Iteration Networks
Value Iteration Networks
Aviv Tamar
Yi Wu
G. Thomas
Sergey Levine
Pieter Abbeel
16
647
0
09 Feb 2016
Predicting Clinical Events by Combining Static and Dynamic Information
  Using Recurrent Neural Networks
Predicting Clinical Events by Combining Static and Dynamic Information Using Recurrent Neural Networks
Cristóbal Esteban
O. Staeck
Yinchong Yang
Volker Tresp
9
154
0
08 Feb 2016
From Softmax to Sparsemax: A Sparse Model of Attention and Multi-Label
  Classification
From Softmax to Sparsemax: A Sparse Model of Attention and Multi-Label Classification
André F. T. Martins
Ramón Fernández Astudillo
8
701
0
05 Feb 2016
Long-term Planning by Short-term Prediction
Long-term Planning by Short-term Prediction
Shai Shalev-Shwartz
Nir Ben-Zrihem
Aviad Cohen
Amnon Shashua
4
61
0
04 Feb 2016
Survey on the attention based RNN model and its applications in computer
  vision
Survey on the attention based RNN model and its applications in computer vision
Feng Wang
David Tax
AI4TS
AIMat
19
113
0
25 Jan 2016
Modeling Coverage for Neural Machine Translation
Modeling Coverage for Neural Machine Translation
Zhaopeng Tu
Zhengdong Lu
Yang Liu
Xiaohua Liu
Hang Li
10
746
0
19 Jan 2016
Multimodal Pivots for Image Caption Translation
Multimodal Pivots for Image Caption Translation
Julian Hitschler
Shigehiko Schamoni
Stefan Riezler
17
97
0
15 Jan 2016
Automatic Description Generation from Images: A Survey of Models,
  Datasets, and Evaluation Measures
Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures
Raffaella Bernardi
Ruken Cakici
Desmond Elliott
Aykut Erdem
Erkut Erdem
Nazli Ikizler-Cinbis
Frank Keller
A. Muscat
Barbara Plank
EGVM
VLM
8
363
0
15 Jan 2016
Implicit Distortion and Fertility Models for Attention-based
  Encoder-Decoder NMT Model
Implicit Distortion and Fertility Models for Attention-based Encoder-Decoder NMT Model
Shi Feng
Shujie Liu
Mu Li
M. Zhou
19
44
0
13 Jan 2016
Previous
123...6768697071
Next