ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.5726
  4. Cited By
CIDEr: Consensus-based Image Description Evaluation
v1v2 (latest)

CIDEr: Consensus-based Image Description Evaluation

Computer Vision and Pattern Recognition (CVPR), 2014
20 November 2014
Ramakrishna Vedantam
C. L. Zitnick
Devi Parikh
ArXiv (abs)PDFHTML

Papers citing "CIDEr: Consensus-based Image Description Evaluation"

50 / 2,346 papers shown
Title
Attentive Explanations: Justifying Decisions and Pointing to the
  Evidence
Attentive Explanations: Justifying Decisions and Pointing to the Evidence
Dong Huk Park
Lisa Anne Hendricks
Zeynep Akata
Bernt Schiele
Trevor Darrell
Marcus Rohrbach
AAML
191
80
0
14 Dec 2016
Text-guided Attention Model for Image Captioning
Text-guided Attention Model for Image Captioning
Jonghwan Mun
Minsu Cho
Bohyung Han
VLM
110
95
0
12 Dec 2016
Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image
  Captioning
Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning
Jiasen Lu
Caiming Xiong
Devi Parikh
R. Socher
400
1,564
0
06 Dec 2016
Areas of Attention for Image Captioning
Areas of Attention for Image Captioning
M. Pedersoli
Thomas Lucas
Cordelia Schmid
Jakob Verbeek
236
215
0
03 Dec 2016
Making the V in VQA Matter: Elevating the Role of Image Understanding in
  Visual Question Answering
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
Yash Goyal
Tejas Khot
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
968
3,749
0
02 Dec 2016
Guided Open Vocabulary Image Captioning with Constrained Beam Search
Guided Open Vocabulary Image Captioning with Constrained Beam Search
Peter Anderson
Basura Fernando
Mark Johnson
Stephen Gould
315
247
0
02 Dec 2016
Self-critical Sequence Training for Image Captioning
Self-critical Sequence Training for Image Captioning
Steven J. Rennie
E. Marcheret
Youssef Mroueh
Jerret Ross
Vaibhava Goel
499
2,032
0
02 Dec 2016
Improved Image Captioning via Policy Gradient optimization of SPIDEr
Improved Image Captioning via Policy Gradient optimization of SPIDEr
Siqi Liu
Zhenhai Zhu
Ning Ye
S. Guadarrama
Kevin Patrick Murphy
498
474
0
01 Dec 2016
Video Captioning with Multi-Faceted Attention
Video Captioning with Multi-Faceted Attention
Xiang Long
Chuang Gan
Gerard de Melo
157
88
0
01 Dec 2016
NewsQA: A Machine Comprehension Dataset
NewsQA: A Machine Comprehension Dataset
Adam Trischler
Tong Wang
Xingdi Yuan
Justin Harris
Alessandro Sordoni
Philip Bachman
Kaheer Suleman
545
924
0
29 Nov 2016
Hierarchical Boundary-Aware Neural Encoder for Video Captioning
Hierarchical Boundary-Aware Neural Encoder for Video Captioning
Lorenzo Baraldi
C. Grana
Rita Cucchiara
263
196
0
28 Nov 2016
Bidirectional Multirate Reconstruction for Temporal Modeling in Videos
Bidirectional Multirate Reconstruction for Temporal Modeling in Videos
Linchao Zhu
Zhongwen Xu
Yi Yang
150
78
0
28 Nov 2016
On Human Intellect and Machine Failures: Troubleshooting Integrative
  Machine Learning Systems
On Human Intellect and Machine Failures: Troubleshooting Integrative Machine Learning Systems
Besmira Nushi
Ece Kamar
Eric Horvitz
Donald Kossmann
153
82
0
24 Nov 2016
Scalable Bayesian Learning of Recurrent Neural Networks for Language
  Modeling
Scalable Bayesian Learning of Recurrent Neural Networks for Language Modeling
Zhe Gan
Chunyuan Li
Changyou Chen
Yunchen Pu
Qinliang Su
Lawrence Carin
BDLUQCV
239
42
0
23 Nov 2016
Semantic Compositional Networks for Visual Captioning
Semantic Compositional Networks for Visual Captioning
Zhe Gan
Chuang Gan
Xiaodong He
Yunchen Pu
Kenneth Tran
Jianfeng Gao
Lawrence Carin
Li Deng
CoGe
261
443
0
23 Nov 2016
Adaptive Feature Abstraction for Translating Video to Text
Adaptive Feature Abstraction for Translating Video to Text
Yunchen Pu
Martin Renqiang Min
Zhe Gan
Lawrence Carin
186
14
0
23 Nov 2016
A dataset and exploration of models for understanding video data through
  fill-in-the-blank question-answering
A dataset and exploration of models for understanding video data through fill-in-the-blank question-answering
Tegan Maharaj
Nicolas Ballas
Anna Rohrbach
Aaron Courville
C. Pal
VGen
157
113
0
23 Nov 2016
Video Captioning with Transferred Semantic Attributes
Video Captioning with Transferred Semantic Attributes
Yingwei Pan
Ting Yao
Houqiang Li
Tao Mei
138
336
0
23 Nov 2016
Dense Captioning with Joint Inference and Visual Context
Dense Captioning with Joint Inference and Visual Context
L. Yang
K. Tang
Jianchao Yang
Li Li
VLM
210
177
0
21 Nov 2016
A Hierarchical Approach for Generating Descriptive Image Paragraphs
A Hierarchical Approach for Generating Descriptive Image Paragraphs
J. Krause
Justin Johnson
Ranjay Krishna
Li Fei-Fei
VLM
205
398
0
20 Nov 2016
Recurrent Memory Addressing for describing videos
Recurrent Memory Addressing for describing videos
A. Jain
Abhinav Agarwalla
Kumar Krishna Agrawal
Pabitra Mitra
128
10
0
20 Nov 2016
SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks
  for Image Captioning
SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning
Long Chen
Hanwang Zhang
Jun Xiao
Liqiang Nie
Jian Shao
Wei Liu
Tat-Seng Chua
416
1,775
0
17 Nov 2016
A Semi-supervised Framework for Image Captioning
A Semi-supervised Framework for Image Captioning
Wenhu Chen
Aurelien Lucchi
Thomas Hofmann
209
9
0
16 Nov 2016
Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models
  with KL-control
Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control
Natasha Jaques
S. Gu
Dzmitry Bahdanau
José Miguel Hernández-Lobato
Richard Turner
Douglas Eck
407
198
0
09 Nov 2016
Crowdsourcing in Computer Vision
Crowdsourcing in Computer Vision
Adriana Kovashka
Olga Russakovsky
Li Fei-Fei
Kristen Grauman
HAIVLM3DV
100
130
0
07 Nov 2016
Boosting Image Captioning with Attributes
Boosting Image Captioning with Attributes
Ting Yao
Yingwei Pan
Yehao Li
Zhaofan Qiu
Tao Mei
VLM
284
647
0
05 Nov 2016
End-to-end Concept Word Detection for Video Captioning, Retrieval, and
  Question Answering
End-to-end Concept Word Detection for Video Captioning, Retrieval, and Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2016
Youngjae Yu
Hyungjin Ko
Jongwook Choi
Gunhee Kim
376
239
0
10 Oct 2016
Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence
  Models
Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models
Ashwin K. Vijayakumar
Michael Cogswell
Ramprasaath R. Selvaraju
Q. Sun
Stefan Lee
David J. Crandall
Dhruv Batra
309
602
0
07 Oct 2016
Visual Question Answering: Datasets, Algorithms, and Future Challenges
Visual Question Answering: Datasets, Algorithms, and Future ChallengesComputer Vision and Image Understanding (CVIU), 2016
Kushal Kafle
Christopher Kanan
OOD
236
256
0
05 Oct 2016
Variational Autoencoder for Deep Learning of Images, Labels and Captions
Variational Autoencoder for Deep Learning of Images, Labels and Captions
Yunchen Pu
Zhe Gan
Ricardo Henao
Xin Yuan
Chunyuan Li
Andrew Stevens
Lawrence Carin
BDLCoGe
153
809
0
28 Sep 2016
Visual Fashion-Product Search at SK Planet
Visual Fashion-Product Search at SK Planet
Taewan Kim
Seyeong Kim
Sangil Na
Hayoon Kim
Moonki Kim
Beyeongki Jeon
232
6
0
26 Sep 2016
Deep Learning for Video Classification and Captioning
Deep Learning for Video Classification and Captioning
Zuxuan Wu
Ting Yao
Yanwei Fu
Yu-Gang Jiang
3DVVLM
131
139
0
22 Sep 2016
The Color of the Cat is Gray: 1 Million Full-Sentences Visual Question
  Answering (FSVQA)
The Color of the Cat is Gray: 1 Million Full-Sentences Visual Question Answering (FSVQA)
Andrew Shin
Yoshitaka Ushiku
Tatsuya Harada
133
16
0
21 Sep 2016
Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning
  Challenge
Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning Challenge
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
237
896
0
21 Sep 2016
Multimodal Attention for Neural Machine Translation
Multimodal Attention for Neural Machine Translation
Ozan Caglayan
Loïc Barrault
Fethi Bougares
143
79
0
13 Sep 2016
Measuring Machine Intelligence Through Visual Question Answering
Measuring Machine Intelligence Through Visual Question Answering
C. L. Zitnick
Aishwarya Agrawal
Stanislaw Antol
Margaret Mitchell
Dhruv Batra
Devi Parikh
149
38
0
31 Aug 2016
Title Generation for User Generated Videos
Title Generation for User Generated VideosEuropean Conference on Computer Vision (ECCV), 2016
Kuo-Hao Zeng
Tseng-Hung Chen
Juan Carlos Niebles
Min Sun
160
71
0
25 Aug 2016
Seeing with Humans: Gaze-Assisted Neural Image Captioning
Seeing with Humans: Gaze-Assisted Neural Image Captioning
Yusuke Sugano
Andreas Bulling
212
72
0
18 Aug 2016
Frame- and Segment-Level Features and Candidate Pool Evaluation for
  Video Caption Generation
Frame- and Segment-Level Features and Candidate Pool Evaluation for Video Caption GenerationACM Multimedia (MM), 2016
Rakshith Shetty
Jorma T. Laaksonen
131
94
0
17 Aug 2016
DeepDiary: Automatic Caption Generation for Lifelogging Image Streams
DeepDiary: Automatic Caption Generation for Lifelogging Image Streams
Chenyou Fan
David J. Crandall
DiffM
78
5
0
12 Aug 2016
Mean Box Pooling: A Rich Image Representation and Output Embedding for
  the Visual Madlibs Task
Mean Box Pooling: A Rich Image Representation and Output Embedding for the Visual Madlibs Task
Ashkan Mokarian
Mateusz Malinowski
Mario Fritz
234
5
0
09 Aug 2016
SPICE: Semantic Propositional Image Caption Evaluation
SPICE: Semantic Propositional Image Caption Evaluation
Peter Anderson
Basura Fernando
Mark Johnson
Stephen Gould
EGVM
310
2,141
0
29 Jul 2016
Visual Question Answering: A Survey of Methods and Datasets
Visual Question Answering: A Survey of Methods and Datasets
Qi Wu
Damien Teney
Peng Wang
Chunhua Shen
A. Dick
Anton Van Den Hengel
309
448
0
20 Jul 2016
Domain Adaptation for Neural Networks by Parameter Augmentation
Domain Adaptation for Neural Networks by Parameter Augmentation
Yusuke Watanabe
Kazuma Hashimoto
Yoshimasa Tsuruoka
OOD
133
6
0
01 Jul 2016
Stochastic Multiple Choice Learning for Training Diverse Deep Ensembles
Stochastic Multiple Choice Learning for Training Diverse Deep EnsemblesNeural Information Processing Systems (NeurIPS), 2016
Stefan Lee
Senthil Purushwalkam
Michael Cogswell
Viresh Ranjan
David J. Crandall
Dhruv Batra
BDLUQCVOOD
242
189
0
24 Jun 2016
Bidirectional Long-Short Term Memory for Video Description
Bidirectional Long-Short Term Memory for Video Description
Yi Bin
Yang Yang
Zi Huang
Fumin Shen
Xing Xu
Heng Tao Shen
143
66
0
15 Jun 2016
Watch What You Just Said: Image Captioning with Text-Conditional
  Attention
Watch What You Just Said: Image Captioning with Text-Conditional Attention
Luowei Zhou
Chenliang Xu
Parker A. Koch
Jason J. Corso
VLM
202
44
0
15 Jun 2016
Natural Language Generation in Dialogue using Lexicalized and
  Delexicalized Data
Natural Language Generation in Dialogue using Lexicalized and Delexicalized Data
Shikhar Sharma
Jing He
Kaheer Suleman
Hannes Schulz
Philip Bachman
202
30
0
11 Jun 2016
Automated Image Captioning for Rapid Prototyping and Resource
  Constrained Environments
Automated Image Captioning for Rapid Prototyping and Resource Constrained Environments
Karan Sharma
Arun C. S. Kumar
S. Bhandarkar
65
0
0
04 Jun 2016
cvpaper.challenge in 2015 - A review of CVPR2015 and DeepSurvey
cvpaper.challenge in 2015 - A review of CVPR2015 and DeepSurvey
Hirokatsu Kataoka
Yudai Miyashita
Tomoaki K. Yamabe
Soma Shirakabe
Shin-ichi Sato
...
Kaori Abe
Takaaki Imanari
Naomichi Kobayashi
Shinichiro Morita
Akio Nakamura
116
2
0
26 May 2016
Previous
123...454647
Next