v1v2v3 (latest)

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

10 February 2015

Jimmy Ba

Aaron Courville

Papers citing "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"

50 / 3,583 papers shown

SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning

Wei Liu

485

1,793

17 Nov 2016

Instance-aware Image and Sentence Matching with Selective Multimodal LSTM

Yan Huang

Wei Wang

Liang Wang

220

229

17 Nov 2016

DelugeNets: Deep Networks with Efficient and Flexible Cross-layer Information Inflows

209

17 Nov 2016

Semantic Regularisation for Recurrent Image Annotation

Feng Liu

Tao Xiang

Timothy M. Hospedales

Wankou Yang

Changyin Sun

174

108

16 Nov 2016

A Semi-supervised Framework for Image Captioning

Wenhu Chen

Aurelien Lucchi

Thomas Hofmann

218

16 Nov 2016

The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives

Jordan L. Boyd-Graber

Hal Daumé

L. Davis

210

113

16 Nov 2016

Diversity encouraged learning of unsupervised LSTM ensemble for neural activity video prediction

106

15 Nov 2016

Hierarchical Object Detection with Deep Reinforcement Learning

160

110

11 Nov 2016

Getting Started with Neural Models for Semantic Matching in Web Search

Maarten de Rijke

155

08 Nov 2016

Memory-augmented Attention Modelling for Videos

274

07 Nov 2016

Latent Attention For If-Then Program Synthesis

124

07 Nov 2016

Hierarchical Question Answering for Long Documents

282

170

06 Nov 2016

Boosting Image Captioning with Attributes

Yingwei Pan

Tao Mei

303

650

05 Nov 2016

Categorical Reparameterization with Gumbel-Softmax

1.1K

5,977

03 Nov 2016

The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables

728

2,726

02 Nov 2016

Dual Attention Networks for Multimodal Reasoning and Matching

Hyeonseob Nam

Jung-Woo Ha

Jeonghee Kim

237

703

02 Nov 2016

Phased LSTM: Accelerating Recurrent Network Training for Long or Event-based Sequences

223

477

29 Oct 2016

Professor Forcing: A New Algorithm for Training Recurrent Networks

Aaron Courville

327

649

27 Oct 2016

Cross-Modal Scene Networks

Carl Vondrick

Antonio Torralba

180

117

27 Oct 2016

Can Active Memory Replace Attention?

Lukasz Kaiser

Samy Bengio

174

27 Oct 2016

Jointly Learning to Align and Convert Graphemes to Phonemes with Neural Attention Models

Shubham Toshniwal

Karen Livescu

128

20 Oct 2016

Lexicon Integrated CNN Models with Attention for Sentiment Analysis

Bonggun Shin

Timothy Lee

Jinho Choi

176

117

20 Oct 2016

Using Fast Weights to Attend to the Recent Past

Jimmy Ba

297

303

20 Oct 2016

Learning Robust Video Synchronization without Annotations

P. Wieschollek

Ido Freeman

Hendrik P. A. Lensch

232

19 Oct 2016

Spatio-Temporal Attention Models for Grounded Video Captioning

M. Zanfir

Elisabeta Marinoiu

C. Sminchisescu

231

17 Oct 2016

Recurrent 3D Attentional Networks for End-to-End Active Object RecognitionComputational Visual Media (CVM), 2016

Kai Xu

198

14 Oct 2016

Video Fill in the Blank with Merging LSTMs

Amir Mazaheri

Dong Zhang

M. Shah

144

13 Oct 2016

Generating captions without looking beyond objects

Hendrik Heuer

Christof Monz

A. Smeulders

12 Oct 2016

Attention and Anticipation in Fast Visual-Inertial NavigationIEEE International Conference on Robotics and Automation (ICRA), 2016

Luca Carlone

S. Karaman

178

11 Oct 2016

Latent Sequence DecompositionsInternational Conference on Learning Representations (ICLR), 2016

355

10 Oct 2016

End-to-end Concept Word Detection for Video Captioning, Retrieval, and Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2016

410

239

10 Oct 2016

Understanding intermediate layers using linear classifier probesInternational Conference on Learning Representations (ICLR), 2016

Guillaume Alain

Yoshua Bengio

FAtt

563

1,187

05 Oct 2016

Visual Question Answering: Datasets, Algorithms, and Future ChallengesComputer Vision and Image Understanding (CVIU), 2016

Kushal Kafle

Christopher Kanan

OOD

267

258

05 Oct 2016

A Survey of Multi-View Representation Learning

654

587

03 Oct 2016

Controlling Output Length in Neural Encoder-Decoders

Graham Neubig

228

251

30 Sep 2016

Variational Autoencoder for Deep Learning of Images, Labels and Captions

Lawrence Carin

200

815

28 Sep 2016

Character Sequence Models for ColorfulWords

28 Sep 2016

Learning Language-Visual Embedding for Movie Understanding with Natural-Language

Atousa Torabi

Niket Tandon

Leonid Sigal

162

106

26 Sep 2016

Visual Fashion-Product Search at SK Planet

253

26 Sep 2016

Language as a Latent Variable: Discrete Generative Models for Sentence Compression

Yishu Miao

Phil Blunsom

502

225

23 Sep 2016

The Color of the Cat is Gray: 1 Million Full-Sentences Visual Question Answering (FSVQA)

Andrew Shin

Yoshitaka Ushiku

Tatsuya Harada

171

21 Sep 2016

Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning Challenge

256

902

21 Sep 2016

Enhanced LSTM for Natural Language Inference

Qian Chen

532

1,173

20 Sep 2016

Image-to-Markup Generation with Coarse-to-Fine Attention

243

260

16 Sep 2016

Predicting Shot Making in Basketball Learnt from Adversarial Multiagent Trajectories

299

15 Sep 2016

Multimodal Attention for Neural Machine Translation

Ozan Caglayan

Loïc Barrault

Fethi Bougares

156

13 Sep 2016

Read, Tag, and Parse All at Once, or Fully-neural Dependency Parsing

J. Chorowski

Michal Zapotoczny

Paweł Rychlikowski

180

12 Sep 2016

The Role of Context Selection in Object Detection

125

09 Sep 2016

Optimizing Recurrent Neural Networks Architectures under Time Constraints

248

29 Aug 2016

A Boundary Tilting Persepective on the Phenomenon of Adversarial Examples

T. Tanay

Lewis D. Griffin

AAML

240

282

27 Aug 2016