ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1607.08822
  4. Cited By
SPICE: Semantic Propositional Image Caption Evaluation

SPICE: Semantic Propositional Image Caption Evaluation

29 July 2016
Peter Anderson
Basura Fernando
Mark Johnson
Stephen Gould
    EGVM
ArXiv (abs)PDFHTML

Papers citing "SPICE: Semantic Propositional Image Caption Evaluation"

50 / 1,002 papers shown
Normalized and Geometry-Aware Self-Attention Network for Image
  Captioning
Normalized and Geometry-Aware Self-Attention Network for Image CaptioningComputer Vision and Pattern Recognition (CVPR), 2020
Longteng Guo
Jing Liu
Xinxin Zhu
Peng Yao
Shichen Lu
Hanqing Lu
ViT
312
218
0
19 Mar 2020
Object-Centric Image Generation from Layouts
Object-Centric Image Generation from LayoutsAAAI Conference on Artificial Intelligence (AAAI), 2020
Tristan Sylvain
Pengchuan Zhang
Yoshua Bengio
R. Devon Hjelm
Shikhar Sharma
EGVMOCL
331
106
0
16 Mar 2020
Deconfounded Image Captioning: A Causal Retrospect
Deconfounded Image Captioning: A Causal RetrospectIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Xu Yang
Hanwang Zhang
Jianfei Cai
CML
222
149
0
09 Mar 2020
Better Captioning with Sequence-Level Exploration
Better Captioning with Sequence-Level ExplorationComputer Vision and Pattern Recognition (CVPR), 2020
Jia Chen
Qin Jin
143
12
0
08 Mar 2020
Captioning Images with Novel Objects via Online Vocabulary Expansion
Captioning Images with Novel Objects via Online Vocabulary Expansion
Mikihiro Tanaka
Tatsuya Harada
3DV
219
2
0
06 Mar 2020
Show, Edit and Tell: A Framework for Editing Image Captions
Show, Edit and Tell: A Framework for Editing Image CaptionsComputer Vision and Pattern Recognition (CVPR), 2020
Fawaz Sammani
Luke Melas-Kyriazi
KELMDiffM
153
65
0
06 Mar 2020
Say As You Wish: Fine-grained Control of Image Caption Generation with
  Abstract Scene Graphs
Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene GraphsComputer Vision and Pattern Recognition (CVPR), 2020
Shizhe Chen
Qin Jin
Peng Wang
Qi Wu
DiffM
333
240
0
01 Mar 2020
Exploring and Distilling Cross-Modal Information for Image Captioning
Exploring and Distilling Cross-Modal Information for Image CaptioningInternational Joint Conference on Artificial Intelligence (IJCAI), 2019
Fenglin Liu
Xuancheng Ren
Yuanxin Liu
Kai Lei
Xu Sun
ViT
195
55
0
28 Feb 2020
Visual Commonsense R-CNN
Visual Commonsense R-CNNComputer Vision and Pattern Recognition (CVPR), 2020
Tan Wang
Jianqiang Huang
Hanwang Zhang
Qianru Sun
SSLObjDCML
270
280
0
27 Feb 2020
Analysis of diversity-accuracy tradeoff in image captioning
Analysis of diversity-accuracy tradeoff in image captioning
Ruotian Luo
Gregory Shakhnarovich
134
15
0
27 Feb 2020
Captioning Images Taken by People Who Are Blind
Captioning Images Taken by People Who Are BlindEuropean Conference on Computer Vision (ECCV), 2020
Danna Gurari
Yinan Zhao
Meng Zhang
Nilavra Bhattacharya
334
203
0
20 Feb 2020
Latent Normalizing Flows for Many-to-Many Cross-Domain Mappings
Latent Normalizing Flows for Many-to-Many Cross-Domain MappingsInternational Conference on Learning Representations (ICLR), 2020
Shweta Mahajan
Iryna Gurevych
Stefan Roth
DRL
170
38
0
16 Feb 2020
3D Dynamic Scene Graphs: Actionable Spatial Perception with Places,
  Objects, and Humans
3D Dynamic Scene Graphs: Actionable Spatial Perception with Places, Objects, and Humans
Antoni Rosinol
Arjun Gupta
Marcus Abate
Jingang Shi
Luca Carlone
306
225
0
15 Feb 2020
CBAG: Conditional Biomedical Abstract Generation
CBAG: Conditional Biomedical Abstract GenerationPLoS ONE (PLOS ONE), 2020
Justin Sybrandt
Ilya Safro
MedImAI4CE
146
10
0
13 Feb 2020
Sparse and Structured Visual Attention
Sparse and Structured Visual AttentionInternational Conference on Information Photonics (ICIP), 2019
Pedro Henrique Martins
S. Becker
Zita Marinho
Michael Arens
167
9
0
13 Feb 2020
Show, Recall, and Tell: Image Captioning with Recall Mechanism
Show, Recall, and Tell: Image Captioning with Recall MechanismAAAI Conference on Artificial Intelligence (AAAI), 2020
Li Wang
Zechen Bai
Yonghua Zhang
Hongtao Lu
269
71
0
15 Jan 2020
In Defense of Grid Features for Visual Question Answering
In Defense of Grid Features for Visual Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2020
Huaizu Jiang
Ishan Misra
Marcus Rohrbach
Erik Learned-Miller
Xinlei Chen
OODObjD
338
358
0
10 Jan 2020
Explain and Improve: LRP-Inference Fine-Tuning for Image Captioning
  Models
Explain and Improve: LRP-Inference Fine-Tuning for Image Captioning ModelsInformation Fusion (Inf. Fusion), 2020
Jiamei Sun
Sebastian Lapuschkin
Wojciech Samek
Alexander Binder
FAtt
645
36
0
04 Jan 2020
Going Beneath the Surface: Evaluating Image Captioning for
  Grammaticality, Truthfulness and Diversity
Going Beneath the Surface: Evaluating Image Captioning for Grammaticality, Truthfulness and Diversity
Huiyuan Xie
Tom Sherborne
A. Kuhnle
Ann A. Copestake
DiffM
124
9
0
19 Dec 2019
DMRM: A Dual-channel Multi-hop Reasoning Model for Visual Dialog
DMRM: A Dual-channel Multi-hop Reasoning Model for Visual DialogAAAI Conference on Artificial Intelligence (AAAI), 2019
Feilong Chen
Fandong Meng
Jiaming Xu
Peng Li
Bo Xu
Jie Zhou
189
34
0
18 Dec 2019
Meshed-Memory Transformer for Image Captioning
Meshed-Memory Transformer for Image CaptioningComputer Vision and Pattern Recognition (CVPR), 2019
Marcella Cornia
Matteo Stefanini
Lorenzo Baraldi
Rita Cucchiara
263
1,037
0
17 Dec 2019
Action Genome: Actions as Composition of Spatio-temporal Scene Graphs
Action Genome: Actions as Composition of Spatio-temporal Scene GraphsComputer Vision and Pattern Recognition (CVPR), 2019
Jingwei Ji
Ranjay Krishna
Li Fei-Fei
Juan Carlos Niebles
266
391
0
15 Dec 2019
Fast Image Caption Generation with Position Alignment
Fast Image Caption Generation with Position Alignment
Z. Fei
144
42
0
13 Dec 2019
Connecting Vision and Language with Localized Narratives
Connecting Vision and Language with Localized NarrativesEuropean Conference on Computer Vision (ECCV), 2019
Jordi Pont-Tuset
J. Uijlings
Soravit Changpinyo
Radu Soricut
V. Ferrari
ObjD
504
288
0
06 Dec 2019
Generating Videos of Zero-Shot Compositions of Actions and Objects
Generating Videos of Zero-Shot Compositions of Actions and Objects
Megha Nawhal
Mengyao Zhai
Andreas M. Lehrmann
Leonid Sigal
Greg Mori
230
1
0
05 Dec 2019
Better Understanding Hierarchical Visual Relationship for Image Caption
Better Understanding Hierarchical Visual Relationship for Image Caption
Z. Fei
122
1
0
04 Dec 2019
Learning to Relate from Captions and Bounding Boxes
Learning to Relate from Captions and Bounding BoxesAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Sarthak Garg
Joel Ruben Antony Moniz
Anshu Aviral
Priyatham Bollimpalli
162
4
0
01 Dec 2019
Identifying Model Weakness with Adversarial Examiner
Identifying Model Weakness with Adversarial ExaminerAAAI Conference on Artificial Intelligence (AAAI), 2019
Michelle Shu
Chenxi Liu
Weichao Qiu
Alan Yuille
AAMLELM
196
23
0
25 Nov 2019
Injecting Prior Knowledge into Image Caption Generation
Injecting Prior Knowledge into Image Caption Generation
A. Goel
Basura Fernando
Thanh-Son Nguyen
Hakan Bilen
161
0
0
22 Nov 2019
Reinforcing an Image Caption Generator Using Off-Line Human Feedback
Reinforcing an Image Caption Generator Using Off-Line Human FeedbackAAAI Conference on Artificial Intelligence (AAAI), 2019
Paul Hongsuck Seo
Piyush Sharma
Tomer Levinboim
Bohyung Han
Radu Soricut
OffRL
184
23
0
21 Nov 2019
On Architectures for Including Visual Information in Neural Language
  Models for Image Description
On Architectures for Including Visual Information in Neural Language Models for Image Description
Marc Tanti
Albert Gatt
K. Camilleri
VLM
139
2
0
09 Nov 2019
CommonGen: A Constrained Text Generation Challenge for Generative
  Commonsense Reasoning
CommonGen: A Constrained Text Generation Challenge for Generative Commonsense Reasoning
Bill Yuchen Lin
Wangchunshu Zhou
Minghan Shen
Pei Zhou
Chandra Bhagavatula
Yu Xing
Xiang Ren
LRM
405
16
0
09 Nov 2019
Scene Graph based Image Retrieval -- A case study on the CLEVR Dataset
Scene Graph based Image Retrieval -- A case study on the CLEVR Dataset
Sahana Ramnath
Amrita Saha
Soumen Chakrabarti
Mitesh M. Khapra
3DV
121
15
0
03 Nov 2019
Hidden State Guidance: Improving Image Captioning using An Image
  Conditioned Autoencoder
Hidden State Guidance: Improving Image Captioning using An Image Conditioned Autoencoder
Jialin Wu
Raymond J. Mooney
134
0
0
31 Oct 2019
Semantic Object Accuracy for Generative Text-to-Image Synthesis
Semantic Object Accuracy for Generative Text-to-Image SynthesisIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019
Tobias Hinz
Stefan Heinrich
S. Wermter
EGVM
433
179
0
29 Oct 2019
Cross-modal Scene Graph Matching for Relationship-aware Image-Text
  Retrieval
Cross-modal Scene Graph Matching for Relationship-aware Image-Text RetrievalIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2019
Sijin Wang
Ruiping Wang
Ziwei Yao
Shiguang Shan
Xilin Chen
3DV
205
239
0
11 Oct 2019
Automatic Quality Estimation for Natural Language Generation: Ranting
  (Jointly Rating and Ranking)
Automatic Quality Estimation for Natural Language Generation: Ranting (Jointly Rating and Ranking)International Conference on Natural Language Generation (INLG), 2019
Ondrej Dusek
Karin Sevegnani
Ioannis Konstas
Verena Rieser
ALM
144
10
0
10 Oct 2019
Semantic-aware Image Deblurring
Semantic-aware Image Deblurring
Fuhai Chen
Rongrong Ji
Chengpeng Dai
Xiaoshuai Sun
Chia-Wen Lin
Jiayi Ji
Baochang Zhang
Feiyue Huang
Liujuan Cao
BDLVLM
208
7
0
09 Oct 2019
3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera
3D Scene Graph: A Structure for Unified Semantics, 3D Space, and CameraIEEE International Conference on Computer Vision (ICCV), 2019
Iro Armeni
Zhi-Yang He
JunYoung Gwak
Amir Zamir
Martin Fischer
Jitendra Malik
Silvio Savarese
3DV3DPC
303
422
0
06 Oct 2019
Generalization in Generation: A closer look at Exposure Bias
Generalization in Generation: A closer look at Exposure BiasConference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Florian Schmidt
257
117
0
01 Oct 2019
Learning Visual Relation Priors for Image-Text Matching and Image
  Captioning with Neural Scene Graph Generators
Learning Visual Relation Priors for Image-Text Matching and Image Captioning with Neural Scene Graph Generators
Kuang-Huei Lee
Hamid Palangi
Xi Chen
Houdong Hu
Jianfeng Gao
VLM
154
40
0
22 Sep 2019
Watch, Listen and Tell: Multi-modal Weakly Supervised Dense Event
  Captioning
Watch, Listen and Tell: Multi-modal Weakly Supervised Dense Event CaptioningIEEE International Conference on Computer Vision (ICCV), 2019
Tanzila Rahman
Bicheng Xu
Leonid Sigal
197
86
0
22 Sep 2019
Adaptively Aligned Image Captioning via Adaptive Attention Time
Adaptively Aligned Image Captioning via Adaptive Attention TimeNeural Information Processing Systems (NeurIPS), 2019
Lun Huang
Wenmin Wang
Yaxian Xia
Jie Chen
198
67
0
19 Sep 2019
Communication-based Evaluation for Natural Language Generation
Communication-based Evaluation for Natural Language Generation
Benjamin Newman
Reuben Cohn-Gordon
Christopher Potts
149
8
0
16 Sep 2019
Automatically Extracting Challenge Sets for Non local Phenomena in
  Neural Machine Translation
Automatically Extracting Challenge Sets for Non local Phenomena in Neural Machine TranslationConference on Computational Natural Language Learning (CoNLL), 2019
Leshem Choshen
Omri Abend
295
19
0
15 Sep 2019
Scene Graph Parsing by Attention Graph
Scene Graph Parsing by Attention Graph
Martin Andrews
Yew Ken Chia
Sam Witteveen
GNN
99
12
0
13 Sep 2019
What Makes A Good Story? Designing Composite Rewards for Visual
  Storytelling
What Makes A Good Story? Designing Composite Rewards for Visual StorytellingAAAI Conference on Artificial Intelligence (AAAI), 2019
Junjie Hu
Yu Cheng
Zhe Gan
Jingjing Liu
Jianfeng Gao
Graham Neubig
229
72
0
11 Sep 2019
Compositional Generalization in Image Captioning
Compositional Generalization in Image CaptioningConference on Computational Natural Language Learning (CoNLL), 2019
Mitja Nikolaus
Mostafa Abdou
Matthew Lamm
Rahul Aralikatte
Desmond Elliott
CoGe
253
49
0
10 Sep 2019
FDA: Feature Disruptive Attack
FDA: Feature Disruptive AttackIEEE International Conference on Computer Vision (ICCV), 2019
Aditya Ganeshan
S. VivekB.
R. Venkatesh Babu
AAML
276
131
0
10 Sep 2019
Hierarchy Parsing for Image Captioning
Hierarchy Parsing for Image CaptioningIEEE International Conference on Computer Vision (ICCV), 2019
Ting Yao
Yingwei Pan
Yehao Li
Tao Mei
VLM
212
181
0
09 Sep 2019
Previous
123...161718192021
Next
Page 17 of 21
Pageof 21