Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1411.5726
Cited By
v1
v2 (latest)
CIDEr: Consensus-based Image Description Evaluation
Computer Vision and Pattern Recognition (CVPR), 2014
20 November 2014
Ramakrishna Vedantam
C. L. Zitnick
Devi Parikh
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"CIDEr: Consensus-based Image Description Evaluation"
50 / 2,351 papers shown
Title
Bidirectional Long-Short Term Memory for Video Description
Yi Bin
Yang Yang
Zi Huang
Fumin Shen
Xing Xu
Heng Tao Shen
151
66
0
15 Jun 2016
Watch What You Just Said: Image Captioning with Text-Conditional Attention
Luowei Zhou
Chenliang Xu
Parker A. Koch
Jason J. Corso
VLM
202
44
0
15 Jun 2016
Natural Language Generation in Dialogue using Lexicalized and Delexicalized Data
Shikhar Sharma
Jing He
Kaheer Suleman
Hannes Schulz
Philip Bachman
222
30
0
11 Jun 2016
Automated Image Captioning for Rapid Prototyping and Resource Constrained Environments
Karan Sharma
Arun C. S. Kumar
S. Bhandarkar
77
0
0
04 Jun 2016
cvpaper.challenge in 2015 - A review of CVPR2015 and DeepSurvey
Hirokatsu Kataoka
Yudai Miyashita
Tomoaki K. Yamabe
Soma Shirakabe
Shin-ichi Sato
...
Kaori Abe
Takaaki Imanari
Naomichi Kobayashi
Shinichiro Morita
Akio Nakamura
116
2
0
26 May 2016
Beyond Caption To Narrative: Video Captioning With Multiple Sentences
Andrew Shin
Katsunori Ohnishi
Tatsuya Harada
123
33
0
18 May 2016
Movie Description
Anna Rohrbach
Atousa Torabi
Marcus Rohrbach
Niket Tandon
C. Pal
Hugo Larochelle
Aaron Courville
Bernt Schiele
3DV
VGen
234
387
0
12 May 2016
Leveraging Visual Question Answering for Image-Caption Ranking
Xiaoyu Lin
Devi Parikh
CoGe
225
88
0
04 May 2016
Video Description using Bidirectional Recurrent Neural Networks
Álvaro Peris
Marc Bolaños
Petia Radeva
F. Casacuberta
160
34
0
12 Apr 2016
TGIF: A New Dataset and Benchmark on Animated GIF Description
Yuncheng Li
Yale Song
Liangliang Cao
Joel R. Tetreault
Larry Goldberg
A. Jaimes
Jiebo Luo
199
295
0
10 Apr 2016
Resolving Language and Vision Ambiguities Together: Joint Segmentation & Prepositional Attachment Resolution in Captioned Scenes
Gordon A. Christie
A. Laddha
Aishwarya Agrawal
Stanislaw Antol
Yash Goyal
K. Kochersberger
Dhruv Batra
289
31
0
07 Apr 2016
Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text
Subhashini Venugopalan
Lisa Anne Hendricks
Raymond J. Mooney
Kate Saenko
VLM
164
121
0
06 Apr 2016
Image Captioning with Deep Bidirectional LSTMs
Cheng Wang
Haojin Yang
Christian Bartz
Christoph Meinel
VLM
204
292
0
04 Apr 2016
Dense Image Representation with Spatial Pyramid VLAD Coding of CNN for Locally Robust Captioning
Andrew Shin
Masataka Yamaguchi
Katsunori Ohnishi
Tatsuya Harada
124
8
0
30 Mar 2016
Rich Image Captioning in the Wild
Kenneth Tran
Xiaodong He
Lei Zhang
Jian Sun
Cornelia Carapcea
Chris Thrasher
Chris Buehler
Chris Sienkiewicz
VLM
145
127
0
30 Mar 2016
Generating Visual Explanations
Lisa Anne Hendricks
Zeynep Akata
Marcus Rohrbach
Jeff Donahue
Bernt Schiele
Trevor Darrell
VLM
FAtt
257
644
0
28 Mar 2016
BreakingNews: Article Annotation by Image and Text Processing
Arnau Ramisa
F. Yan
Francesc Moreno-Noguer
K. Mikolajczyk
194
113
0
23 Mar 2016
Image Captioning and Visual Question Answering Based on Attributes and External Knowledge
Qi Wu
Chunhua Shen
Anton Van Den Hengel
Peng Wang
A. Dick
206
373
0
09 Mar 2016
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
Ranjay Krishna
Yuke Zhu
Oliver Groth
Justin Johnson
Kenji Hata
...
Yannis Kalantidis
Li Li
David A. Shamma
Michael S. Bernstein
Fei-Fei Li
924
6,192
0
23 Feb 2016
Survey on the attention based RNN model and its applications in computer vision
Feng Wang
David Tax
AI4TS
AIMat
123
127
0
25 Jan 2016
Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures
Raffaella Bernardi
Ruken Cakici
Desmond Elliott
Aykut Erdem
Erkut Erdem
Nazli Ikizler-Cinbis
Frank Keller
A. Muscat
Barbara Plank
EGVM
VLM
214
374
0
15 Jan 2016
Neural Self Talk: Image Understanding via Continuous Questioning and Answering
Yezhou Yang
Yi Li
Cornelia Fermuller
Yiannis Aloimonos
117
24
0
10 Dec 2015
Video captioning with recurrent networks based on frame- and video-level features and visual content classification
Rakshith Shetty
Jorma T. Laaksonen
130
31
0
09 Dec 2015
DenseCap: Fully Convolutional Localization Networks for Dense Captioning
Justin Johnson
A. Karpathy
Li Fei-Fei
VLM
273
1,212
0
24 Nov 2015
Delving Deeper into Convolutional Networks for Learning Video Representations
Nicolas Ballas
Weitong Chen
C. Pal
Aaron Courville
MDE
283
756
0
19 Nov 2015
Uncovering Temporal Context for Video Question and Answering
Linchao Zhu
Zhongwen Xu
Yi Yang
Alexander G. Hauptmann
BDL
149
45
0
15 Nov 2015
Oracle performance for visual captioning
Weitong Chen
Nicolas Ballas
Dong Wang
John R. Smith
Yoshua Bengio
VLM
399
9
0
14 Nov 2015
Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning
Pingbo Pan
Zhongwen Xu
Yi Yang
Leilei Gan
Yueting Zhuang
154
391
0
11 Nov 2015
Generation and Comprehension of Unambiguous Object Descriptions
Junhua Mao
Jonathan Huang
Alexander Toshev
Oana-Maria Camburu
Alan Yuille
Kevin Patrick Murphy
ObjD
660
1,547
0
07 Nov 2015
Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks
Haonan Yu
Jiang Wang
Zhiheng Huang
Yi Yang
Wenyuan Xu
344
571
0
26 Oct 2015
Multilingual Image Description with Neural Sequence Models
Desmond Elliott
Stella Frank
Eva Hasler
VLM
250
77
0
15 Oct 2015
Image Representations and New Domains in Neural Image Captioning
Jack Hessel
Nicolas Savva
Michael J. Wilber
VLM
98
16
0
09 Aug 2015
Describing Multimedia Content using Attention-based Encoder--Decoder Networks
Dong Wang
Aaron Courville
Yoshua Bengio
184
432
0
04 Jul 2015
deltaBLEU: A Discriminative Metric for Generation Tasks with Intrinsically Diverse Targets
Michel Galley
Chris Brockett
Alessandro Sordoni
Yangfeng Ji
Michael Auli
Chris Quirk
Margaret Mitchell
Jianfeng Gao
W. Dolan
279
159
0
23 Jun 2015
Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks
Neural Information Processing Systems (NeurIPS), 2015
Samy Bengio
Oriol Vinyals
Navdeep Jaitly
Noam M. Shazeer
816
2,176
0
09 Jun 2015
The Long-Short Story of Movie Description
German Conference on Pattern Recognition (DAGM), 2015
Anna Rohrbach
Marcus Rohrbach
Bernt Schiele
VLM
139
117
0
04 Jun 2015
What value do explicit high level concepts have in vision to language problems?
Computer Vision and Pattern Recognition (CVPR), 2015
Qi Wu
Chunhua Shen
Lingqiao Liu
A. Dick
Anton Van Den Hengel
312
459
0
03 Jun 2015
Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question Answering
Neural Information Processing Systems (NeurIPS), 2015
Haoyuan Gao
Junhua Mao
Jie Zhou
Zhiheng Huang
Lei Wang
Wenyuan Xu
283
519
0
21 May 2015
Exploring Nearest Neighbor Approaches for Image Captioning
Jacob Devlin
Saurabh Gupta
Ross B. Girshick
Margaret Mitchell
C. L. Zitnick
282
199
0
17 May 2015
Sequence to Sequence -- Video to Text
Subhashini Venugopalan
Marcus Rohrbach
Jeff Donahue
Raymond J. Mooney
Trevor Darrell
Kate Saenko
364
1,468
0
03 May 2015
VQA: Visual Question Answering
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
983
6,054
0
03 May 2015
Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images
Junhua Mao
Xu Wei
Yi Yang
Jiang Wang
Zhiheng Huang
Alan Yuille
174
160
0
25 Apr 2015
Microsoft COCO Captions: Data Collection and Evaluation Server
Xinlei Chen
Hao Fang
Nayeon Lee
Ramakrishna Vedantam
Saurabh Gupta
Piotr Dollar
C. L. Zitnick
674
2,723
0
01 Apr 2015
Describing Videos by Exploiting Temporal Structure
Weitong Chen
Atousa Torabi
Dong Wang
Nicolas Ballas
C. Pal
Hugo Larochelle
Aaron Courville
438
1,092
0
27 Feb 2015
VIP: Finding Important People in Images
Clint Solomon Mathialagan
Andrew C. Gallagher
Dhruv Batra
141
30
0
19 Feb 2015
Image Specificity
M. Jas
Devi Parikh
178
40
0
16 Feb 2015
Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)
International Conference on Learning Representations (ICLR), 2014
Junhua Mao
Wenyuan Xu
Yi Yang
Jiang Wang
Zhiheng Huang
Alan Yuille
VLM
560
1,272
0
20 Dec 2014
Deep Visual-Semantic Alignments for Generating Image Descriptions
Computer Vision and Pattern Recognition (CVPR), 2014
A. Karpathy
Li Fei-Fei
498
5,850
0
07 Dec 2014
From Captions to Visual Concepts and Back
Computer Vision and Pattern Recognition (CVPR), 2014
Hao Fang
Saurabh Gupta
F. Iandola
R. Srivastava
Li Deng
...
Xiaodong He
Margaret Mitchell
John C. Platt
C. L. Zitnick
Geoffrey Zweig
VLM
377
1,346
0
18 Nov 2014
Show and Tell: A Neural Image Caption Generator
Computer Vision and Pattern Recognition (CVPR), 2014
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
3DV
572
6,349
0
17 Nov 2014
Previous
1
2
3
...
46
47
48
Next