Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1411.5726
Cited By
CIDEr: Consensus-based Image Description Evaluation
20 November 2014
Ramakrishna Vedantam
C. L. Zitnick
Devi Parikh
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CIDEr: Consensus-based Image Description Evaluation"
50 / 2,136 papers shown
Title
Hierarchical Boundary-Aware Neural Encoder for Video Captioning
Lorenzo Baraldi
C. Grana
Rita Cucchiara
26
191
0
28 Nov 2016
Bidirectional Multirate Reconstruction for Temporal Modeling in Videos
Linchao Zhu
Zhongwen Xu
Yi Yang
24
76
0
28 Nov 2016
On Human Intellect and Machine Failures: Troubleshooting Integrative Machine Learning Systems
Besmira Nushi
Ece Kamar
Eric Horvitz
Donald Kossmann
34
77
0
24 Nov 2016
Scalable Bayesian Learning of Recurrent Neural Networks for Language Modeling
Zhe Gan
Chunyuan Li
Changyou Chen
Yunchen Pu
Qinliang Su
Lawrence Carin
BDL
UQCV
50
41
0
23 Nov 2016
Semantic Compositional Networks for Visual Captioning
Zhe Gan
Chuang Gan
Xiaodong He
Yunchen Pu
Kenneth Tran
Jianfeng Gao
Lawrence Carin
Li Deng
CoGe
42
425
0
23 Nov 2016
Adaptive Feature Abstraction for Translating Video to Text
Yunchen Pu
Martin Renqiang Min
Zhe Gan
Lawrence Carin
36
14
0
23 Nov 2016
A dataset and exploration of models for understanding video data through fill-in-the-blank question-answering
Tegan Maharaj
Nicolas Ballas
Anna Rohrbach
Aaron Courville
C. Pal
VGen
13
107
0
23 Nov 2016
Video Captioning with Transferred Semantic Attributes
Yingwei Pan
Ting Yao
Houqiang Li
Tao Mei
19
329
0
23 Nov 2016
Dense Captioning with Joint Inference and Visual Context
L. Yang
K. Tang
Jianchao Yang
Li-Jia Li
VLM
19
169
0
21 Nov 2016
A Hierarchical Approach for Generating Descriptive Image Paragraphs
J. Krause
Justin Johnson
Ranjay Krishna
Li Fei-Fei
VLM
25
373
0
20 Nov 2016
Recurrent Memory Addressing for describing videos
A. Jain
Abhinav Agarwalla
Kumar Krishna Agrawal
Pabitra Mitra
30
10
0
20 Nov 2016
SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning
Long Chen
Hanwang Zhang
Jun Xiao
Liqiang Nie
Jian Shao
Wei Liu
Tat-Seng Chua
13
1,649
0
17 Nov 2016
A Semi-supervised Framework for Image Captioning
Wenhu Chen
Aurélien Lucchi
Thomas Hofmann
29
9
0
16 Nov 2016
Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control
Natasha Jaques
S. Gu
Dzmitry Bahdanau
José Miguel Hernández-Lobato
Richard E. Turner
Douglas Eck
27
168
0
09 Nov 2016
Crowdsourcing in Computer Vision
Adriana Kovashka
Olga Russakovsky
Li Fei-Fei
Kristen Grauman
HAI
VLM
3DV
34
149
0
07 Nov 2016
Boosting Image Captioning with Attributes
Ting Yao
Yingwei Pan
Yehao Li
Zhaofan Qiu
Tao Mei
VLM
31
620
0
05 Nov 2016
End-to-end Concept Word Detection for Video Captioning, Retrieval, and Question Answering
Youngjae Yu
Hyungjin Ko
Jongwook Choi
Gunhee Kim
6
229
0
10 Oct 2016
Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models
Ashwin K. Vijayakumar
Michael Cogswell
Ramprasaath R. Selvaraju
Q. Sun
Stefan Lee
David J. Crandall
Dhruv Batra
17
540
0
07 Oct 2016
Visual Question Answering: Datasets, Algorithms, and Future Challenges
Kushal Kafle
Christopher Kanan
OOD
25
235
0
05 Oct 2016
Variational Autoencoder for Deep Learning of Images, Labels and Captions
Yunchen Pu
Zhe Gan
Ricardo Henao
Xin Yuan
Chunyuan Li
Andrew Stevens
Lawrence Carin
BDL
CoGe
19
745
0
28 Sep 2016
Visual Fashion-Product Search at SK Planet
Taewan Kim
Seyeong Kim
Sangil Na
Hayoon Kim
Moonki Kim
Beyeongki Jeon
9
6
0
26 Sep 2016
Deep Learning for Video Classification and Captioning
Zuxuan Wu
Ting Yao
Yanwei Fu
Yu-Gang Jiang
3DV
VLM
19
122
0
22 Sep 2016
The Color of the Cat is Gray: 1 Million Full-Sentences Visual Question Answering (FSVQA)
Andrew Shin
Yoshitaka Ushiku
Tatsuya Harada
44
14
0
21 Sep 2016
Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning Challenge
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
19
848
0
21 Sep 2016
Multimodal Attention for Neural Machine Translation
Ozan Caglayan
Loïc Barrault
Fethi Bougares
26
75
0
13 Sep 2016
Measuring Machine Intelligence Through Visual Question Answering
C. L. Zitnick
Aishwarya Agrawal
Stanislaw Antol
Margaret Mitchell
Dhruv Batra
Devi Parikh
19
37
0
31 Aug 2016
Title Generation for User Generated Videos
Kuo-Hao Zeng
Tseng-Hung Chen
Juan Carlos Niebles
Min Sun
27
69
0
25 Aug 2016
Seeing with Humans: Gaze-Assisted Neural Image Captioning
Yusuke Sugano
Andreas Bulling
16
68
0
18 Aug 2016
Frame- and Segment-Level Features and Candidate Pool Evaluation for Video Caption Generation
Rakshith Shetty
Jorma T. Laaksonen
11
94
0
17 Aug 2016
DeepDiary: Automatic Caption Generation for Lifelogging Image Streams
Chenyou Fan
David J. Crandall
DiffM
9
5
0
12 Aug 2016
Mean Box Pooling: A Rich Image Representation and Output Embedding for the Visual Madlibs Task
Ashkan Mokarian
Mateusz Malinowski
Mario Fritz
17
5
0
09 Aug 2016
SPICE: Semantic Propositional Image Caption Evaluation
Peter Anderson
Basura Fernando
Mark Johnson
Stephen Gould
EGVM
34
1,883
0
29 Jul 2016
Visual Question Answering: A Survey of Methods and Datasets
Qi Wu
Damien Teney
Peng Wang
Chunhua Shen
A. Dick
A. Hengel
27
413
0
20 Jul 2016
Domain Adaptation for Neural Networks by Parameter Augmentation
Yusuke Watanabe
Kazuma Hashimoto
Yoshimasa Tsuruoka
OOD
16
6
0
01 Jul 2016
Stochastic Multiple Choice Learning for Training Diverse Deep Ensembles
Stefan Lee
Senthil Purushwalkam
Michael Cogswell
Viresh Ranjan
David J. Crandall
Dhruv Batra
BDL
UQCV
OOD
23
175
0
24 Jun 2016
Bidirectional Long-Short Term Memory for Video Description
Yi Bin
Yang Yang
Zi Huang
Fumin Shen
Xing Xu
Heng Tao Shen
31
60
0
15 Jun 2016
Watch What You Just Said: Image Captioning with Text-Conditional Attention
Luowei Zhou
Chenliang Xu
Parker A. Koch
Jason J. Corso
VLM
14
44
0
15 Jun 2016
Natural Language Generation in Dialogue using Lexicalized and Delexicalized Data
Shikhar Sharma
Jing He
Kaheer Suleman
Hannes Schulz
Philip Bachman
13
29
0
11 Jun 2016
Automated Image Captioning for Rapid Prototyping and Resource Constrained Environments
Karan Sharma
Arun C. S. Kumar
S. Bhandarkar
12
0
0
04 Jun 2016
cvpaper.challenge in 2015 - A review of CVPR2015 and DeepSurvey
Hirokatsu Kataoka
Yudai Miyashita
Tomoaki K. Yamabe
Soma Shirakabe
Shin-ichi Sato
...
Kaori Abe
Takaaki Imanari
Naomichi Kobayashi
Shinichiro Morita
Akio Nakamura
24
2
0
26 May 2016
Beyond Caption To Narrative: Video Captioning With Multiple Sentences
Andrew Shin
Katsunori Ohnishi
Tatsuya Harada
12
31
0
18 May 2016
Movie Description
Anna Rohrbach
Atousa Torabi
Marcus Rohrbach
Niket Tandon
C. Pal
Hugo Larochelle
Aaron Courville
Bernt Schiele
3DV
VGen
32
353
0
12 May 2016
Leveraging Visual Question Answering for Image-Caption Ranking
Xiaoyu Lin
Devi Parikh
CoGe
14
83
0
04 May 2016
Video Description using Bidirectional Recurrent Neural Networks
Álvaro Peris
Marc Bolaños
P. Radeva
F. Casacuberta
9
33
0
12 Apr 2016
TGIF: A New Dataset and Benchmark on Animated GIF Description
Yuncheng Li
Yale Song
Liangliang Cao
Joel R. Tetreault
Larry Goldberg
A. Jaimes
Jiebo Luo
22
269
0
10 Apr 2016
Resolving Language and Vision Ambiguities Together: Joint Segmentation & Prepositional Attachment Resolution in Captioned Scenes
Gordon A. Christie
A. Laddha
Aishwarya Agrawal
Stanislaw Antol
Yash Goyal
K. Kochersberger
Dhruv Batra
15
30
0
07 Apr 2016
Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text
Subhashini Venugopalan
Lisa Anne Hendricks
Raymond J. Mooney
Kate Saenko
VLM
20
117
0
06 Apr 2016
Image Captioning with Deep Bidirectional LSTMs
Cheng Wang
Haojin Yang
Christian Bartz
Christoph Meinel
VLM
10
278
0
04 Apr 2016
Dense Image Representation with Spatial Pyramid VLAD Coding of CNN for Locally Robust Captioning
Andrew Shin
Masataka Yamaguchi
Katsunori Ohnishi
Tatsuya Harada
45
8
0
30 Mar 2016
Rich Image Captioning in the Wild
Kenneth Tran
Xiaodong He
Lei Zhang
Jian Sun
Cornelia Carapcea
Chris Thrasher
Chris Buehler
Chris Sienkiewicz
VLM
19
123
0
30 Mar 2016
Previous
1
2
3
...
41
42
43
Next