Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1607.08822
Cited By
SPICE: Semantic Propositional Image Caption Evaluation
29 July 2016
Peter Anderson
Basura Fernando
Mark Johnson
Stephen Gould
EGVM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"SPICE: Semantic Propositional Image Caption Evaluation"
50 / 1,002 papers shown
Aesthetic Attributes Assessment of Images with AMANv2 and DPC-CaptionsV2
Xinghui Zhou
Xin Jin
Jianwen Lv
Heng Huang
Ming Mao
Shuai Cui
CoGe
115
0
0
09 Aug 2022
Distinctive Image Captioning via CLIP Guided Group Optimization
Youyuan Zhang
Jiuniu Wang
Hao Wu
Wenjia Xu
VLM
376
9
0
08 Aug 2022
Prompt Tuning for Generative Multimodal Pretrained Models
Han Yang
Junyang Lin
An Yang
Peng Wang
Chang Zhou
Hongxia Yang
VLM
LRM
VPVLM
183
37
0
04 Aug 2022
Retrieval-Augmented Transformer for Image Captioning
International Conference on Content-Based Multimedia Indexing (CBMI), 2022
Sara Sarto
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
202
69
0
26 Jul 2022
Chunk-aware Alignment and Lexical Constraint for Visual Entailment with Natural Language Explanations
ACM Multimedia (ACM MM), 2022
Qian Yang
Yunxin Li
Baotian Hu
Lin Ma
Yuxin Ding
Min Zhang
240
11
0
23 Jul 2022
Zero-Shot Video Captioning with Evolving Pseudo-Tokens
Yoad Tewel
Yoav Shalev
Roy Nadler
Idan Schwartz
Lior Wolf
233
33
0
22 Jul 2022
Efficient Modeling of Future Context for Image Captioning
ACM Multimedia (ACM MM), 2022
Zhengcong Fei
Junshi Huang
Xiaoming Wei
Xiaolin K. Wei
216
17
0
22 Jul 2022
Diffsound: Discrete Diffusion Model for Text-to-sound Generation
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Dongchao Yang
Jianwei Yu
Helin Wang
Wen Wang
Chao Weng
Yuexian Zou
Dong Yu
DiffM
285
383
0
20 Jul 2022
GRIT: Faster and Better Image captioning Transformer Using Dual Visual Features
European Conference on Computer Vision (ECCV), 2022
Van-Quang Nguyen
Masanori Suganuma
Takayuki Okatani
ViT
218
148
0
20 Jul 2022
Explicit Image Caption Editing
European Conference on Computer Vision (ECCV), 2022
Zhen Wang
Long Chen
Wenbo Ma
G. Han
Yulei Niu
Jian Shao
Jun Xiao
191
14
0
20 Jul 2022
Dual-branch Hybrid Learning Network for Unbiased Scene Graph Generation
Chao Zheng
Lianli Gao
Xinyu Lyu
Pengpeng Zeng
Abdulmotaleb El Saddik
Hengtao Shen
178
26
0
16 Jul 2022
Adaptive Fine-Grained Predicates Learning for Scene Graph Generation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Xinyu Lyu
Lianli Gao
Pengpeng Zeng
Hengtao Shen
Jingkuan Song
224
21
0
11 Jul 2022
Predicting Word Learning in Children from the Performance of Computer Vision Systems
Annual Meeting of the Cognitive Science Society (CogSci), 2022
Sunayana Rane
Mira L. Nencheva
Zeyu Wang
C. Lew‐Williams
Olga Russakovsky
Thomas Griffiths
252
3
0
07 Jul 2022
Dual-Stream Transformer for Generic Event Boundary Captioning
Xin Gu
Hanhua Ye
Guang Chen
Yufei Wang
Libo Zhang
Longyin Wen
132
4
0
07 Jul 2022
Are metrics measuring what they should? An evaluation of image captioning task metrics
Signal processing. Image communication (SPIC), 2022
Othón González-Chávez
Guillermo Ruiz
Daniela Moctezuma
Tania A. Ramirez-delreal
226
10
0
04 Jul 2022
Rethinking Surgical Captioning: End-to-End Window-Based MLP Transformer Using Patches
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022
Mengya Xu
Mobarakol Islam
Hongliang Ren
MedIm
180
14
0
30 Jun 2022
ZoDIAC: Zoneout Dropout Injection Attention Calculation
Zanyar Zohourianshahzadi
Terrance Boult
Jugal Kalita
293
0
0
28 Jun 2022
From Shallow to Deep: Compositional Reasoning over Graphs for Visual Question Answering
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Zihao Zhu
NAI
ReLM
GNN
245
4
0
25 Jun 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
753
1,371
0
22 Jun 2022
REVECA -- Rich Encoder-decoder framework for Video Event CAptioner
Jaehyuk Heo
YongGi Jeong
Sunwoo Kim
Jaehee Kim
Pilsung Kang
104
0
0
18 Jun 2022
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Neural Information Processing Systems (NeurIPS), 2022
Zi-Yi Dou
Aishwarya Kamath
Zhe Gan
Pengchuan Zhang
Jianfeng Wang
...
Ce Liu
Yann LeCun
Nanyun Peng
Jianfeng Gao
Lijuan Wang
VLM
ObjD
296
152
0
15 Jun 2022
Measuring Representational Harms in Image Captioning
Conference on Fairness, Accountability and Transparency (FAccT), 2022
Angelina Wang
Solon Barocas
Kristen Laird
Hanna M. Wallach
258
60
0
14 Jun 2022
Comprehending and Ordering Semantics for Image Captioning
Computer Vision and Pattern Recognition (CVPR), 2022
Yehao Li
Yingwei Pan
Ting Yao
Tao Mei
193
114
0
14 Jun 2022
Language Models are General-Purpose Interfaces
Y. Hao
Haoyu Song
Li Dong
Shaohan Huang
Zewen Chi
Wenhui Wang
Shuming Ma
Furu Wei
MLLM
226
110
0
13 Jun 2022
CoSe-Co: Text Conditioned Generative CommonSense Contextualizer
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Rachit Bansal
Milan Aggarwal
S. Bhatia
Jivat Neet Kaur
Balaji Krishnamurthy
180
4
0
12 Jun 2022
Improving Image Captioning with Control Signal of Sentence Quality
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Zhangzi Zhu
Hong Qu
274
0
0
07 Jun 2022
Automated Audio Captioning with Epochal Difficult Captions for Curriculum Learning
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2022
Andrew Koh
Soham Dinesh Tiwari
Chng Eng Siong
135
1
0
04 Jun 2022
Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning
Neural Information Processing Systems (NeurIPS), 2022
Yujia Xie
Luowei Zhou
Xiyang Dai
Lu Yuan
Nguyen Bach
Ce Liu
Michael Zeng
VLM
MLLM
189
30
0
03 Jun 2022
BAN-Cap: A Multi-Purpose English-Bangla Image Descriptions Dataset
International Conference on Language Resources and Evaluation (LREC), 2022
Mohammad Faiyaz Khan
S. M. S. Shifath
Md. Saiful Islam
237
6
0
28 May 2022
GIT: A Generative Image-to-text Transformer for Vision and Language
Jianfeng Wang
Zhengyuan Yang
Xiaowei Hu
Linjie Li
Kevin Qinghong Lin
Zhe Gan
Zicheng Liu
Ce Liu
Lijuan Wang
VLM
613
714
0
27 May 2022
A Survey on Long-Tailed Visual Recognition
International Journal of Computer Vision (IJCV), 2022
Pu Cao
He Jiang
Q. Song
Jun Guo
307
163
0
27 May 2022
Revisiting Generative Commonsense Reasoning: A Pre-Ordering Approach
Chao Zhao
Faeze Brahman
Tenghao Huang
Snigdha Chaturvedi
LRM
204
5
0
26 May 2022
Prompt-based Learning for Unpaired Image Captioning
IEEE transactions on multimedia (IEEE TMM), 2022
Peipei Zhu
Tianlin Li
Lin Zhu
Zhenglong Sun
Weishi Zheng
Yaowei Wang
Chen Chen
VLM
229
45
0
26 May 2022
Fine-grained Image Captioning with CLIP Reward
Jaemin Cho
Seunghyun Yoon
Ajinkya Kale
Franck Dernoncourt
Trung Bui
Joey Tianyi Zhou
CLIP
391
99
0
26 May 2022
Mutual Information Divergence: A Unified Metric for Multimodal Generative Models
Neural Information Processing Systems (NeurIPS), 2022
Jin-Hwa Kim
Yunji Kim
Jiyoung Lee
Kang Min Yoo
Sang-Woo Lee
EGVM
349
41
0
25 May 2022
Context Matters for Image Descriptions for Accessibility: Challenges for Referenceless Evaluation Metrics
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Elisa Kreiss
Cynthia L. Bennett
Shayan Hooshmand
E. Zelikman
Meredith Ringel Morris
Christopher Potts
246
35
0
21 May 2022
What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics
David M. Chan
Austin Myers
Sudheendra Vijayanarasimhan
David A. Ross
Bryan Seybold
John F. Canny
205
7
0
12 May 2022
Automated Audio Captioning: An Overview of Recent Progress and New Challenges
EURASIP Journal on Audio, Speech, and Music Processing (EURASIP J. Audio Speech Music Process.), 2022
Xinhao Mei
Xubo Liu
Mark D. Plumbley
Wenwu Wang
299
54
0
12 May 2022
Beyond the Status Quo: A Contemporary Survey of Advances and Challenges in Audio Captioning
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Xuenan Xu
Zeyu Xie
Mengyue Wu
K. Yu
286
23
0
11 May 2022
RoViST:Learning Robust Metrics for Visual Storytelling
Eileen Wang
S. Han
Josiah Poon
166
13
0
08 May 2022
Language Models Can See: Plugging Visual Controls in Text Generation
Yixuan Su
Tian Lan
Yahui Liu
Fangyu Liu
Dani Yogatama
Yan Wang
Lingpeng Kong
Nigel Collier
VLM
MLLM
274
111
0
05 May 2022
Reducing Predictive Feature Suppression in Resource-Constrained Contrastive Image-Caption Retrieval
Maurits J. R. Bleeker
Andrew Yates
Maarten de Rijke
300
4
0
28 Apr 2022
Controllable Image Captioning
Luka Maxwell
364
0
0
28 Apr 2022
SceneTrilogy: On Human Scene-Sketch and its Complementarity with Photo and Text
Computer Vision and Pattern Recognition (CVPR), 2022
Pinaki Nath Chowdhury
A. Bhunia
Aneeshan Sain
Subhadeep Koley
Tao Xiang
Yi-Zhe Song
407
37
0
25 Apr 2022
Caption Feature Space Regularization for Audio Captioning
Yiming Zhang
Hong Yu
Ruoyi Du
Zhanyu Ma
Yuan Dong
256
3
0
18 Apr 2022
Non-Parallel Text Style Transfer with Self-Parallel Supervision
International Conference on Learning Representations (ICLR), 2022
Ruibo Liu
Chongyang Gao
Chenyan Jia
Guangxuan Xu
Soroush Vosoughi
VLM
176
18
0
18 Apr 2022
Towards Lightweight Transformer via Group-wise Transformation for Vision-and-Language Tasks
IEEE Transactions on Image Processing (IEEE TIP), 2022
Gen Luo
Weihao Ye
Xiaoshuai Sun
Yan Wang
Liujuan Cao
Yongjian Wu
Feiyue Huang
Rongrong Ji
ViT
158
57
0
16 Apr 2022
On Distinctive Image Captioning via Comparing and Reweighting
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
188
23
0
08 Apr 2022
GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval
European Conference on Computer Vision (ECCV), 2022
Yuxuan Wang
Difei Gao
Licheng Yu
Stan Weixian Lei
Matt Feiszli
Mike Zheng Shou
590
29
0
01 Apr 2022
Reproducibility Issues for BERT-based Evaluation Metrics
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yanran Chen
Jonas Belouadi
Steffen Eger
415
20
0
30 Mar 2022
Previous
1
2
3
...
10
11
12
...
19
20
21
Next
Page 11 of 21
Page
of 21
Go