v1v2 (latest)

CIDEr: Consensus-based Image Description Evaluation

Computer Vision and Pattern Recognition (CVPR), 2014

20 November 2014

Ramakrishna Vedantam

C. L. Zitnick

Devi Parikh

ArXiv (abs)PDF HTML

Papers citing "CIDEr: Consensus-based Image Description Evaluation"

50 / 2,346 papers shown

Title
Attentive Explanations: Justifying Decisions and Pointing to the Evidence Dong Huk Park Lisa Anne Hendricks Zeynep Akata Bernt Schiele Trevor Darrell Marcus Rohrbach AAML 191 80 0 14 Dec 2016
Text-guided Attention Model for Image Captioning Jonghwan Mun Minsu Cho Bohyung Han VLM 110 95 0 12 Dec 2016
Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning Jiasen Lu Caiming Xiong Devi Parikh R. Socher 400 1,564 0 06 Dec 2016
Areas of Attention for Image Captioning M. Pedersoli Thomas Lucas Cordelia Schmid Jakob Verbeek 236 215 0 03 Dec 2016
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering Yash Goyal Tejas Khot D. Summers-Stay Dhruv Batra Devi Parikh CoGe 968 3,749 0 02 Dec 2016
Guided Open Vocabulary Image Captioning with Constrained Beam Search Peter Anderson Basura Fernando Mark Johnson Stephen Gould 315 247 0 02 Dec 2016
Self-critical Sequence Training for Image Captioning Steven J. Rennie E. Marcheret Youssef Mroueh Jerret Ross Vaibhava Goel 499 2,032 0 02 Dec 2016
Improved Image Captioning via Policy Gradient optimization of SPIDEr Siqi Liu Zhenhai Zhu Ning Ye S. Guadarrama Kevin Patrick Murphy 498 474 0 01 Dec 2016
Video Captioning with Multi-Faceted Attention Xiang Long Chuang Gan Gerard de Melo 157 88 0 01 Dec 2016
NewsQA: A Machine Comprehension Dataset Adam Trischler Tong Wang Xingdi Yuan Justin Harris Alessandro Sordoni Philip Bachman Kaheer Suleman 545 924 0 29 Nov 2016
Hierarchical Boundary-Aware Neural Encoder for Video Captioning Lorenzo Baraldi C. Grana Rita Cucchiara 263 196 0 28 Nov 2016
Bidirectional Multirate Reconstruction for Temporal Modeling in Videos Linchao Zhu Zhongwen Xu Yi Yang 150 78 0 28 Nov 2016
On Human Intellect and Machine Failures: Troubleshooting Integrative Machine Learning Systems Besmira Nushi Ece Kamar Eric Horvitz Donald Kossmann 153 82 0 24 Nov 2016
Scalable Bayesian Learning of Recurrent Neural Networks for Language Modeling Zhe Gan Chunyuan Li Changyou Chen Yunchen Pu Qinliang Su Lawrence Carin BDL UQCV 239 42 0 23 Nov 2016
Semantic Compositional Networks for Visual Captioning Zhe Gan Chuang Gan Xiaodong He Yunchen Pu Kenneth Tran Jianfeng Gao Lawrence Carin Li Deng CoGe 261 443 0 23 Nov 2016
Adaptive Feature Abstraction for Translating Video to Text Yunchen Pu Martin Renqiang Min Zhe Gan Lawrence Carin 186 14 0 23 Nov 2016
A dataset and exploration of models for understanding video data through fill-in-the-blank question-answering Tegan Maharaj Nicolas Ballas Anna Rohrbach Aaron Courville C. Pal VGen 157 113 0 23 Nov 2016
Video Captioning with Transferred Semantic Attributes Yingwei Pan Ting Yao Houqiang Li Tao Mei 138 336 0 23 Nov 2016
Dense Captioning with Joint Inference and Visual Context L. Yang K. Tang Jianchao Yang Li Li VLM 210 177 0 21 Nov 2016
A Hierarchical Approach for Generating Descriptive Image Paragraphs J. Krause Justin Johnson Ranjay Krishna Li Fei-Fei VLM 205 398 0 20 Nov 2016
Recurrent Memory Addressing for describing videos A. Jain Abhinav Agarwalla Kumar Krishna Agrawal Pabitra Mitra 128 10 0 20 Nov 2016
SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning Long Chen Hanwang Zhang Jun Xiao Liqiang Nie Jian Shao Wei Liu Tat-Seng Chua 416 1,775 0 17 Nov 2016
A Semi-supervised Framework for Image Captioning Wenhu Chen Aurelien Lucchi Thomas Hofmann 209 9 0 16 Nov 2016
Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control Natasha Jaques S. Gu Dzmitry Bahdanau José Miguel Hernández-Lobato Richard Turner Douglas Eck 407 198 0 09 Nov 2016
Crowdsourcing in Computer Vision Adriana Kovashka Olga Russakovsky Li Fei-Fei Kristen Grauman HAI VLM 3DV 100 130 0 07 Nov 2016
Boosting Image Captioning with Attributes Ting Yao Yingwei Pan Yehao Li Zhaofan Qiu Tao Mei VLM 284 647 0 05 Nov 2016
End-to-end Concept Word Detection for Video Captioning, Retrieval, and Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2016 Youngjae Yu Hyungjin Ko Jongwook Choi Gunhee Kim 376 239 0 10 Oct 2016
Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models Ashwin K. Vijayakumar Michael Cogswell Ramprasaath R. Selvaraju Q. Sun Stefan Lee David J. Crandall Dhruv Batra 309 602 0 07 Oct 2016
Visual Question Answering: Datasets, Algorithms, and Future ChallengesComputer Vision and Image Understanding (CVIU), 2016 Kushal Kafle Christopher Kanan OOD 236 256 0 05 Oct 2016
Variational Autoencoder for Deep Learning of Images, Labels and Captions Yunchen Pu Zhe Gan Ricardo Henao Xin Yuan Chunyuan Li Andrew Stevens Lawrence Carin BDL CoGe 153 809 0 28 Sep 2016
Visual Fashion-Product Search at SK Planet Taewan Kim Seyeong Kim Sangil Na Hayoon Kim Moonki Kim Beyeongki Jeon 232 6 0 26 Sep 2016
Deep Learning for Video Classification and Captioning Zuxuan Wu Ting Yao Yanwei Fu Yu-Gang Jiang 3DV VLM 131 139 0 22 Sep 2016
The Color of the Cat is Gray: 1 Million Full-Sentences Visual Question Answering (FSVQA) Andrew Shin Yoshitaka Ushiku Tatsuya Harada 133 16 0 21 Sep 2016
Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning Challenge Oriol Vinyals Alexander Toshev Samy Bengio D. Erhan 237 896 0 21 Sep 2016
Multimodal Attention for Neural Machine Translation Ozan Caglayan Loïc Barrault Fethi Bougares 143 79 0 13 Sep 2016
Measuring Machine Intelligence Through Visual Question Answering C. L. Zitnick Aishwarya Agrawal Stanislaw Antol Margaret Mitchell Dhruv Batra Devi Parikh 149 38 0 31 Aug 2016
Title Generation for User Generated VideosEuropean Conference on Computer Vision (ECCV), 2016 Kuo-Hao Zeng Tseng-Hung Chen Juan Carlos Niebles Min Sun 160 71 0 25 Aug 2016
Seeing with Humans: Gaze-Assisted Neural Image Captioning Yusuke Sugano Andreas Bulling 212 72 0 18 Aug 2016
Frame- and Segment-Level Features and Candidate Pool Evaluation for Video Caption GenerationACM Multimedia (MM), 2016 Rakshith Shetty Jorma T. Laaksonen 131 94 0 17 Aug 2016
DeepDiary: Automatic Caption Generation for Lifelogging Image Streams Chenyou Fan David J. Crandall DiffM 78 5 0 12 Aug 2016
Mean Box Pooling: A Rich Image Representation and Output Embedding for the Visual Madlibs Task Ashkan Mokarian Mateusz Malinowski Mario Fritz 234 5 0 09 Aug 2016
SPICE: Semantic Propositional Image Caption Evaluation Peter Anderson Basura Fernando Mark Johnson Stephen Gould EGVM 310 2,141 0 29 Jul 2016
Visual Question Answering: A Survey of Methods and Datasets Qi Wu Damien Teney Peng Wang Chunhua Shen A. Dick Anton Van Den Hengel 309 448 0 20 Jul 2016
Domain Adaptation for Neural Networks by Parameter Augmentation Yusuke Watanabe Kazuma Hashimoto Yoshimasa Tsuruoka OOD 133 6 0 01 Jul 2016
Stochastic Multiple Choice Learning for Training Diverse Deep EnsemblesNeural Information Processing Systems (NeurIPS), 2016 Stefan Lee Senthil Purushwalkam Michael Cogswell Viresh Ranjan David J. Crandall Dhruv Batra BDL UQCV OOD 242 189 0 24 Jun 2016
Bidirectional Long-Short Term Memory for Video Description Yi Bin Yang Yang Zi Huang Fumin Shen Xing Xu Heng Tao Shen 143 66 0 15 Jun 2016
Watch What You Just Said: Image Captioning with Text-Conditional Attention Luowei Zhou Chenliang Xu Parker A. Koch Jason J. Corso VLM 202 44 0 15 Jun 2016
Natural Language Generation in Dialogue using Lexicalized and Delexicalized Data Shikhar Sharma Jing He Kaheer Suleman Hannes Schulz Philip Bachman 202 30 0 11 Jun 2016
Automated Image Captioning for Rapid Prototyping and Resource Constrained Environments Karan Sharma Arun C. S. Kumar S. Bhandarkar 65 0 0 04 Jun 2016
cvpaper.challenge in 2015 - A review of CVPR2015 and DeepSurvey Hirokatsu Kataoka Yudai Miyashita Tomoaki K. Yamabe Soma Shirakabe Shin-ichi Sato ... Kaori Abe Takaaki Imanari Naomichi Kobayashi Shinichiro Morita Akio Nakamura 116 2 0 26 May 2016