ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03044
  4. Cited By
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
    DiffM
ArXivPDFHTML

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,508 papers shown
Title
"Show me the cup": Reference with Continuous Representations
"Show me the cup": Reference with Continuous Representations
Gemma Boleda
Sebastian Padó
Marco Baroni
10
3
0
28 Jun 2016
Diversified Visual Attention Networks for Fine-Grained Object
  Classification
Diversified Visual Attention Networks for Fine-Grained Object Classification
Bo Zhao
Xiao-Jun Wu
Jiashi Feng
Qiang Peng
Shuicheng Yan
14
365
0
28 Jun 2016
Sequence-Level Knowledge Distillation
Sequence-Level Knowledge Distillation
Yoon Kim
Alexander M. Rush
27
1,097
0
25 Jun 2016
CUNI System for WMT16 Automatic Post-Editing and Multimodal Translation
  Tasks
CUNI System for WMT16 Automatic Post-Editing and Multimodal Translation Tasks
Jindrich Libovický
Jindřich Helcl
Marek Tlustý
Pavel Pecina
Ondrej Bojar
6
67
0
23 Jun 2016
LSTMVis: A Tool for Visual Analysis of Hidden State Dynamics in
  Recurrent Neural Networks
LSTMVis: A Tool for Visual Analysis of Hidden State Dynamics in Recurrent Neural Networks
Hendrik Strobelt
Sebastian Gehrmann
Hanspeter Pfister
Alexander M. Rush
HAI
26
83
0
23 Jun 2016
Tagger: Deep Unsupervised Perceptual Grouping
Tagger: Deep Unsupervised Perceptual Grouping
Klaus Greff
Antti Rasmus
Mathias Berglund
T. Hao
Jürgen Schmidhuber
Harri Valpola
OCL
16
161
0
21 Jun 2016
Question Relevance in VQA: Identifying Non-Visual And False-Premise
  Questions
Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions
Arijit Ray
Gordon A. Christie
Mohit Bansal
Dhruv Batra
Devi Parikh
16
56
0
21 Jun 2016
Drawing and Recognizing Chinese Characters with Recurrent Neural Network
Drawing and Recognizing Chinese Characters with Recurrent Neural Network
Xu-Yao Zhang
Fei Yin
Yanming Zhang
Cheng-Lin Liu
Yoshua Bengio
42
320
0
21 Jun 2016
Using Visual Analytics to Interpret Predictive Machine Learning Models
Using Visual Analytics to Interpret Predictive Machine Learning Models
Josua Krause
Adam Perer
E. Bertini
HAI
36
65
0
17 Jun 2016
FVQA: Fact-based Visual Question Answering
FVQA: Fact-based Visual Question Answering
Peng Wang
Qi Wu
Chunhua Shen
Anton van den Hengel
A. Dick
CoGe
33
453
0
17 Jun 2016
Model-Agnostic Interpretability of Machine Learning
Model-Agnostic Interpretability of Machine Learning
Marco Tulio Ribeiro
Sameer Singh
Carlos Guestrin
FAtt
FaML
24
827
0
16 Jun 2016
A Correlational Encoder Decoder Architecture for Pivot Based Sequence
  Generation
A Correlational Encoder Decoder Architecture for Pivot Based Sequence Generation
Amrita Saha
Mitesh M. Khapra
A. Chandar
Janarthanan Rajendran
Kyunghyun Cho
17
18
0
15 Jun 2016
Unsupervised Learning of Predictors from Unpaired Input-Output Samples
Unsupervised Learning of Predictors from Unpaired Input-Output Samples
Jianshu Chen
Po-Sen Huang
Xiaodong He
Jianfeng Gao
Li Deng
OOD
SSL
16
8
0
15 Jun 2016
Bidirectional Long-Short Term Memory for Video Description
Bidirectional Long-Short Term Memory for Video Description
Yi Bin
Yang Yang
Zi Huang
Fumin Shen
Xing Xu
Heng Tao Shen
28
60
0
15 Jun 2016
Watch What You Just Said: Image Captioning with Text-Conditional
  Attention
Watch What You Just Said: Image Captioning with Text-Conditional Attention
Luowei Zhou
Chenliang Xu
Parker A. Koch
Jason J. Corso
VLM
11
44
0
15 Jun 2016
End-to-End Comparative Attention Networks for Person Re-identification
End-to-End Comparative Attention Networks for Person Re-identification
Hao Liu
Jiashi Feng
Meibin Qi
Jianguo Jiang
Shuicheng Yan
17
575
0
14 Jun 2016
Rationalizing Neural Predictions
Rationalizing Neural Predictions
Tao Lei
Regina Barzilay
Tommi Jaakkola
31
804
0
13 Jun 2016
Training Recurrent Answering Units with Joint Loss Minimization for VQA
Training Recurrent Answering Units with Joint Loss Minimization for VQA
Hyeonwoo Noh
Bohyung Han
24
71
0
12 Jun 2016
Natural Language Generation in Dialogue using Lexicalized and
  Delexicalized Data
Natural Language Generation in Dialogue using Lexicalized and Delexicalized Data
Shikhar Sharma
Jing He
Kaheer Suleman
Hannes Schulz
Philip Bachman
8
29
0
11 Jun 2016
Human Attention in Visual Question Answering: Do Humans and Deep
  Networks Look at the Same Regions?
Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?
Abhishek Das
Harsh Agrawal
C. L. Zitnick
Devi Parikh
Dhruv Batra
30
466
0
11 Jun 2016
Conditional Generation and Snapshot Learning in Neural Dialogue Systems
Conditional Generation and Snapshot Learning in Neural Dialogue Systems
Tsung-Hsien Wen
Milica Gasic
N. Mrksic
L. Rojas-Barahona
Pei-hao Su
Stefan Ultes
David Vandyke
S. Young
12
78
0
10 Jun 2016
Sequence-to-Sequence Learning as Beam-Search Optimization
Sequence-to-Sequence Learning as Beam-Search Optimization
Sam Wiseman
Alexander M. Rush
21
589
0
09 Jun 2016
Progressive Attention Networks for Visual Attribute Prediction
Progressive Attention Networks for Visual Attribute Prediction
Paul Hongsuck Seo
Zhe-nan Lin
Scott D. Cohen
Xiaohui Shen
Bohyung Han
8
41
0
08 Jun 2016
SE3-Nets: Learning Rigid Body Motion using Deep Neural Networks
SE3-Nets: Learning Rigid Body Motion using Deep Neural Networks
Arunkumar Byravan
D. Fox
3DPC
11
267
0
08 Jun 2016
Iterative Alternating Neural Attention for Machine Reading
Iterative Alternating Neural Attention for Machine Reading
Alessandro Sordoni
Philip Bachman
Adam Trischler
Yoshua Bengio
CLL
AIMat
19
118
0
07 Jun 2016
Multimodal Compact Bilinear Pooling for Visual Question Answering and
  Visual Grounding
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
149
1,465
0
06 Jun 2016
Attention Correctness in Neural Image Captioning
Attention Correctness in Neural Image Captioning
Chenxi Liu
Junhua Mao
Fei Sha
Alan Yuille
3DV
30
220
0
31 May 2016
End-to-End Instance Segmentation with Recurrent Attention
End-to-End Instance Segmentation with Recurrent Attention
Mengye Ren
R. Zemel
SSeg
22
61
0
30 May 2016
Does Multimodality Help Human and Machine for Translation and Image
  Captioning?
Does Multimodality Help Human and Machine for Translation and Image Captioning?
Ozan Caglayan
Walid Aransa
Yaxing Wang
Marc Masana
Mercedes García-Martínez
Fethi Bougares
Loïc Barrault
Joost van de Weijer
20
85
0
30 May 2016
Video Summarization with Long Short-term Memory
Video Summarization with Long Short-term Memory
Ke Zhang
Wei-Lun Chao
Fei Sha
Kristen Grauman
27
682
0
26 May 2016
Review Networks for Caption Generation
Review Networks for Caption Generation
Zhilin Yang
Ye Yuan
Yuexin Wu
Ruslan Salakhutdinov
William W. Cohen
3DV
24
85
0
25 May 2016
BattRAE: Bidimensional Attention-Based Recursive Autoencoders for
  Learning Bilingual Phrase Embeddings
BattRAE: Bidimensional Attention-Based Recursive Autoencoders for Learning Bilingual Phrase Embeddings
Biao Zhang
Deyi Xiong
Jinsong Su
6
20
0
25 May 2016
Localizing by Describing: Attribute-Guided Attention Localization for
  Fine-Grained Recognition
Localizing by Describing: Attribute-Guided Attention Localization for Fine-Grained Recognition
Xiao-Chang Liu
Jiang Wang
Shilei Wen
Errui Ding
Yuanqing Lin
6
76
0
20 May 2016
Generative Adversarial Text to Image Synthesis
Generative Adversarial Text to Image Synthesis
Scott E. Reed
Zeynep Akata
Xinchen Yan
Lajanugen Logeswaran
Bernt Schiele
Honglak Lee
GAN
17
3,124
0
17 May 2016
Learning Deep Representations of Fine-grained Visual Descriptions
Learning Deep Representations of Fine-grained Visual Descriptions
Scott E. Reed
Zeynep Akata
Bernt Schiele
Honglak Lee
OCL
VLM
170
840
0
17 May 2016
Movie Description
Movie Description
Anna Rohrbach
Atousa Torabi
Marcus Rohrbach
Niket Tandon
C. Pal
Hugo Larochelle
Aaron Courville
Bernt Schiele
3DV
VGen
30
353
0
12 May 2016
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Mateusz Malinowski
Marcus Rohrbach
Mario Fritz
11
101
0
09 May 2016
Chained Predictions Using Convolutional Neural Networks
Chained Predictions Using Convolutional Neural Networks
Georgia Gkioxari
Alexander Toshev
Navdeep Jaitly
BDL
16
190
0
08 May 2016
DeepPicker: a Deep Learning Approach for Fully Automated Particle
  Picking in Cryo-EM
DeepPicker: a Deep Learning Approach for Fully Automated Particle Picking in Cryo-EM
Feng Wang
Huichao Gong
Gaochao liu
Meijing Li
Chuangye Yan
Tian Xia
Xueming Li
Jianyang Zeng
22
168
0
06 May 2016
Leveraging Visual Question Answering for Image-Caption Ranking
Leveraging Visual Question Answering for Image-Caption Ranking
Xiaoyu Lin
Devi Parikh
CoGe
8
83
0
04 May 2016
Multi30K: Multilingual English-German Image Descriptions
Multi30K: Multilingual English-German Image Descriptions
Desmond Elliott
Stella Frank
K. Simaán
Lucia Specia
VLM
22
579
0
02 May 2016
Look-ahead before you leap: end-to-end active recognition by forecasting
  the effect of motion
Look-ahead before you leap: end-to-end active recognition by forecasting the effect of motion
Dinesh Jayaraman
Kristen Grauman
17
90
0
30 Apr 2016
Joint Line Segmentation and Transcription for End-to-End Handwritten
  Paragraph Recognition
Joint Line Segmentation and Transcription for End-to-End Handwritten Paragraph Recognition
Théodore Bluche
AI4TS
18
189
0
28 Apr 2016
Dialog-based Language Learning
Dialog-based Language Learning
Jason Weston
LLMAG
11
108
0
20 Apr 2016
Annotation Order Matters: Recurrent Image Annotator for Arbitrary Length
  Image Tagging
Annotation Order Matters: Recurrent Image Annotator for Arbitrary Length Image Tagging
Jiren Jin
Hideki Nakayama
3DV
VLM
16
69
0
18 Apr 2016
Parallelizing Word2Vec in Shared and Distributed Memory
Parallelizing Word2Vec in Shared and Distributed Memory
Shihao Ji
N. Satish
Sheng R. Li
Pradeep Dubey
VLM
MoE
14
72
0
15 Apr 2016
Learning Visual Storylines with Skipping Recurrent Neural Networks
Learning Visual Storylines with Skipping Recurrent Neural Networks
Gunnar A. Sigurdsson
Xinlei Chen
Abhinav Gupta
18
38
0
14 Apr 2016
Filling in the details: Perceiving from low fidelity images
Filling in the details: Perceiving from low fidelity images
F. Wick
Michael L. Wick
M. Pomplun
3DH
6
1
0
14 Apr 2016
Visual Storytelling
Visual Storytelling
Ting-Hao 'Kenneth' Huang
Huang
Francis Ferraro
N. Mostafazadeh
Ishan Misra
...
C. L. Zitnick
Devi Parikh
Lucy Vanderwende
Michel Galley
Margaret Mitchell
VGen
11
464
0
13 Apr 2016
Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with
  MDLSTM Attention
Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention
Théodore Bluche
J. Louradour
Ronaldo O. Messina
VLM
11
170
0
12 Apr 2016
Previous
123...666768697071
Next