Papers citing 'CIDEr: Consensus-based Image Description Evaluation'

Title
Bidirectional Long-Short Term Memory for Video Description Yi Bin Yang Yang Zi Huang Fumin Shen Xing Xu Heng Tao Shen 151 66 0 15 Jun 2016
Watch What You Just Said: Image Captioning with Text-Conditional Attention Luowei Zhou Chenliang Xu Parker A. Koch Jason J. Corso VLM 202 44 0 15 Jun 2016
Natural Language Generation in Dialogue using Lexicalized and Delexicalized Data Shikhar Sharma Jing He Kaheer Suleman Hannes Schulz Philip Bachman 222 30 0 11 Jun 2016
Automated Image Captioning for Rapid Prototyping and Resource Constrained Environments Karan Sharma Arun C. S. Kumar S. Bhandarkar 77 0 0 04 Jun 2016
cvpaper.challenge in 2015 - A review of CVPR2015 and DeepSurvey Hirokatsu Kataoka Yudai Miyashita Tomoaki K. Yamabe Soma Shirakabe Shin-ichi Sato ... Kaori Abe Takaaki Imanari Naomichi Kobayashi Shinichiro Morita Akio Nakamura 116 2 0 26 May 2016
Beyond Caption To Narrative: Video Captioning With Multiple Sentences Andrew Shin Katsunori Ohnishi Tatsuya Harada 123 33 0 18 May 2016
Movie Description Anna Rohrbach Atousa Torabi Marcus Rohrbach Niket Tandon C. Pal Hugo Larochelle Aaron Courville Bernt Schiele 3DV VGen 234 387 0 12 May 2016
Leveraging Visual Question Answering for Image-Caption Ranking Xiaoyu Lin Devi Parikh CoGe 225 88 0 04 May 2016
Video Description using Bidirectional Recurrent Neural Networks Álvaro Peris Marc Bolaños Petia Radeva F. Casacuberta 160 34 0 12 Apr 2016
TGIF: A New Dataset and Benchmark on Animated GIF Description Yuncheng Li Yale Song Liangliang Cao Joel R. Tetreault Larry Goldberg A. Jaimes Jiebo Luo 199 295 0 10 Apr 2016
Resolving Language and Vision Ambiguities Together: Joint Segmentation & Prepositional Attachment Resolution in Captioned Scenes Gordon A. Christie A. Laddha Aishwarya Agrawal Stanislaw Antol Yash Goyal K. Kochersberger Dhruv Batra 289 31 0 07 Apr 2016
Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text Subhashini Venugopalan Lisa Anne Hendricks Raymond J. Mooney Kate Saenko VLM 164 121 0 06 Apr 2016
Image Captioning with Deep Bidirectional LSTMs Cheng Wang Haojin Yang Christian Bartz Christoph Meinel VLM 204 292 0 04 Apr 2016
Dense Image Representation with Spatial Pyramid VLAD Coding of CNN for Locally Robust Captioning Andrew Shin Masataka Yamaguchi Katsunori Ohnishi Tatsuya Harada 124 8 0 30 Mar 2016
Rich Image Captioning in the Wild Kenneth Tran Xiaodong He Lei Zhang Jian Sun Cornelia Carapcea Chris Thrasher Chris Buehler Chris Sienkiewicz VLM 145 127 0 30 Mar 2016
Generating Visual Explanations Lisa Anne Hendricks Zeynep Akata Marcus Rohrbach Jeff Donahue Bernt Schiele Trevor Darrell VLM FAtt 257 644 0 28 Mar 2016
BreakingNews: Article Annotation by Image and Text Processing Arnau Ramisa F. Yan Francesc Moreno-Noguer K. Mikolajczyk 194 113 0 23 Mar 2016
Image Captioning and Visual Question Answering Based on Attributes and External Knowledge Qi Wu Chunhua Shen Anton Van Den Hengel Peng Wang A. Dick 206 373 0 09 Mar 2016
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations Ranjay Krishna Yuke Zhu Oliver Groth Justin Johnson Kenji Hata ... Yannis Kalantidis Li Li David A. Shamma Michael S. Bernstein Fei-Fei Li 924 6,192 0 23 Feb 2016
Survey on the attention based RNN model and its applications in computer vision Feng Wang David Tax AI4TS AIMat 123 127 0 25 Jan 2016
Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures Raffaella Bernardi Ruken Cakici Desmond Elliott Aykut Erdem Erkut Erdem Nazli Ikizler-Cinbis Frank Keller A. Muscat Barbara Plank EGVM VLM 214 374 0 15 Jan 2016
Neural Self Talk: Image Understanding via Continuous Questioning and Answering Yezhou Yang Yi Li Cornelia Fermuller Yiannis Aloimonos 117 24 0 10 Dec 2015
Video captioning with recurrent networks based on frame- and video-level features and visual content classification Rakshith Shetty Jorma T. Laaksonen 130 31 0 09 Dec 2015
DenseCap: Fully Convolutional Localization Networks for Dense Captioning Justin Johnson A. Karpathy Li Fei-Fei VLM 273 1,212 0 24 Nov 2015
Delving Deeper into Convolutional Networks for Learning Video Representations Nicolas Ballas Weitong Chen C. Pal Aaron Courville MDE 283 756 0 19 Nov 2015
Uncovering Temporal Context for Video Question and Answering Linchao Zhu Zhongwen Xu Yi Yang Alexander G. Hauptmann BDL 149 45 0 15 Nov 2015
Oracle performance for visual captioning Weitong Chen Nicolas Ballas Dong Wang John R. Smith Yoshua Bengio VLM 399 9 0 14 Nov 2015
Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning Pingbo Pan Zhongwen Xu Yi Yang Leilei Gan Yueting Zhuang 154 391 0 11 Nov 2015
Generation and Comprehension of Unambiguous Object Descriptions Junhua Mao Jonathan Huang Alexander Toshev Oana-Maria Camburu Alan Yuille Kevin Patrick Murphy ObjD 660 1,547 0 07 Nov 2015
Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks Haonan Yu Jiang Wang Zhiheng Huang Yi Yang Wenyuan Xu 344 571 0 26 Oct 2015
Multilingual Image Description with Neural Sequence Models Desmond Elliott Stella Frank Eva Hasler VLM 250 77 0 15 Oct 2015
Image Representations and New Domains in Neural Image Captioning Jack Hessel Nicolas Savva Michael J. Wilber VLM 98 16 0 09 Aug 2015
Describing Multimedia Content using Attention-based Encoder--Decoder Networks Dong Wang Aaron Courville Yoshua Bengio 184 432 0 04 Jul 2015
deltaBLEU: A Discriminative Metric for Generation Tasks with Intrinsically Diverse Targets Michel Galley Chris Brockett Alessandro Sordoni Yangfeng Ji Michael Auli Chris Quirk Margaret Mitchell Jianfeng Gao W. Dolan 279 159 0 23 Jun 2015
Scheduled Sampling for Sequence Prediction with Recurrent Neural NetworksNeural Information Processing Systems (NeurIPS), 2015 Samy Bengio Oriol Vinyals Navdeep Jaitly Noam M. Shazeer 816 2,176 0 09 Jun 2015
The Long-Short Story of Movie DescriptionGerman Conference on Pattern Recognition (DAGM), 2015 Anna Rohrbach Marcus Rohrbach Bernt Schiele VLM 139 117 0 04 Jun 2015
What value do explicit high level concepts have in vision to language problems?Computer Vision and Pattern Recognition (CVPR), 2015 Qi Wu Chunhua Shen Lingqiao Liu A. Dick Anton Van Den Hengel 312 459 0 03 Jun 2015
Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question AnsweringNeural Information Processing Systems (NeurIPS), 2015 Haoyuan Gao Junhua Mao Jie Zhou Zhiheng Huang Lei Wang Wenyuan Xu 283 519 0 21 May 2015
Exploring Nearest Neighbor Approaches for Image Captioning Jacob Devlin Saurabh Gupta Ross B. Girshick Margaret Mitchell C. L. Zitnick 282 199 0 17 May 2015
Sequence to Sequence -- Video to Text Subhashini Venugopalan Marcus Rohrbach Jeff Donahue Raymond J. Mooney Trevor Darrell Kate Saenko 364 1,468 0 03 May 2015
VQA: Visual Question Answering Aishwarya Agrawal Jiasen Lu Stanislaw Antol Margaret Mitchell C. L. Zitnick Dhruv Batra Devi Parikh CoGe 983 6,054 0 03 May 2015
Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images Junhua Mao Xu Wei Yi Yang Jiang Wang Zhiheng Huang Alan Yuille 174 160 0 25 Apr 2015
Microsoft COCO Captions: Data Collection and Evaluation Server Xinlei Chen Hao Fang Nayeon Lee Ramakrishna Vedantam Saurabh Gupta Piotr Dollar C. L. Zitnick 674 2,723 0 01 Apr 2015
Describing Videos by Exploiting Temporal Structure Weitong Chen Atousa Torabi Dong Wang Nicolas Ballas C. Pal Hugo Larochelle Aaron Courville 438 1,092 0 27 Feb 2015
VIP: Finding Important People in Images Clint Solomon Mathialagan Andrew C. Gallagher Dhruv Batra 141 30 0 19 Feb 2015
Image Specificity M. Jas Devi Parikh 178 40 0 16 Feb 2015
Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)International Conference on Learning Representations (ICLR), 2014 Junhua Mao Wenyuan Xu Yi Yang Jiang Wang Zhiheng Huang Alan Yuille VLM 560 1,272 0 20 Dec 2014
Deep Visual-Semantic Alignments for Generating Image DescriptionsComputer Vision and Pattern Recognition (CVPR), 2014 A. Karpathy Li Fei-Fei 498 5,850 0 07 Dec 2014
From Captions to Visual Concepts and BackComputer Vision and Pattern Recognition (CVPR), 2014 Hao Fang Saurabh Gupta F. Iandola R. Srivastava Li Deng ... Xiaodong He Margaret Mitchell John C. Platt C. L. Zitnick Geoffrey Zweig VLM 377 1,346 0 18 Nov 2014
Show and Tell: A Neural Image Caption GeneratorComputer Vision and Pattern Recognition (CVPR), 2014 Oriol Vinyals Alexander Toshev Samy Bengio D. Erhan 3DV 572 6,349 0 17 Nov 2014