Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1607.08822
Cited By
SPICE: Semantic Propositional Image Caption Evaluation
29 July 2016
Peter Anderson
Basura Fernando
Mark Johnson
Stephen Gould
EGVM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"SPICE: Semantic Propositional Image Caption Evaluation"
50 / 1,002 papers shown
Local Interpretations for Explainable Natural Language Processing: A Survey
ACM Computing Surveys (CSUR), 2021
Siwen Luo
Michal Guerquin
S. Han
Josiah Poon
MILM
418
64
0
20 Mar 2021
Constrained Text Generation with Global Guidance -- Case Study on CommonGen
Yixian Liu
Liwen Zhang
Wenjuan Han
Yue Zhang
Kewei Tu
179
10
0
12 Mar 2021
Perspectives and Prospects on Transformer Architecture for Cross-Modal Tasks with Language and Vision
International Journal of Computer Vision (IJCV), 2021
Andrew Shin
Masato Ishii
T. Narihira
310
51
0
06 Mar 2021
Causal Attention for Vision-Language Tasks
Computer Vision and Pattern Recognition (CVPR), 2021
Xu Yang
Hanwang Zhang
Guojun Qi
Jianfei Cai
CML
239
195
0
05 Mar 2021
CrossMap Transformer: A Crossmodal Masked Path Transformer Using Double Back-Translation for Vision-and-Language Navigation
IEEE Robotics and Automation Letters (RA-L), 2021
A. Magassouba
K. Sugiura
Hisashi Kawai
148
14
0
01 Mar 2021
Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Xuenan Xu
Heinrich Dinkel
Mengyue Wu
Zeyu Xie
Kai Yu
171
64
0
23 Feb 2021
VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning
Computer Vision and Pattern Recognition (CVPR), 2021
Jun Chen
Han Guo
Kai Yi
Boyang Albert Li
Mohamed Elhoseiny
VLM
463
276
0
20 Feb 2021
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts
Computer Vision and Pattern Recognition (CVPR), 2021
Soravit Changpinyo
P. Sharma
Nan Ding
Radu Soricut
VLM
1.2K
1,370
0
17 Feb 2021
Improved Bengali Image Captioning via deep convolutional neural network based encoder-decoder model
Mohammad Faiyaz Khan
S. M. S. Shifath
Md. Saiful Islam
VLM
126
24
0
14 Feb 2021
The Role of the Input in Natural Language Video Description
IEEE transactions on multimedia (TMM), 2020
S. Cascianelli
G. Costante
Alessandro Devo
Thomas Alessandro Ciarfuglia
P. Valigi
M. L. Fravolini
154
5
0
09 Feb 2021
Unifying Vision-and-Language Tasks via Text Generation
International Conference on Machine Learning (ICML), 2021
Jaemin Cho
Jie Lei
Hao Tan
Joey Tianyi Zhou
MLLM
614
611
0
04 Feb 2021
The Role of Syntactic Planning in Compositional Image Captioning
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
Emanuele Bugliarello
Desmond Elliott
CoGe
107
15
0
28 Jan 2021
On the Evaluation of Vision-and-Language Navigation Instructions
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
Mingde Zhao
Peter Anderson
Vihan Jain
Su Wang
Alexander Ku
Jason Baldridge
Eugene Ie
487
59
0
26 Jan 2021
ECOL-R: Encouraging Copying in Novel Object Captioning with Reinforcement Learning
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
Yufei Wang
Ian D. Wood
Stephen Wan
Mark Johnson
158
8
0
25 Jan 2021
Fast Sequence Generation with Multi-Agent Reinforcement Learning
Longteng Guo
Jing Liu
Xinxin Zhu
Hanqing Lu
LRM
181
8
0
24 Jan 2021
Macroscopic Control of Text Generation for Image Captioning
Zhangzi Zhu
Tianlei Wang
Hong Qu
196
4
0
20 Jan 2021
Diagnostic Captioning: A Survey
Knowledge and Information Systems (KAIS), 2021
John Pavlopoulos
Vasiliki Kougia
Ion Androutsopoulos
D. Papamichail
3DV
MedIm
237
32
0
18 Jan 2021
Kimera: from SLAM to Spatial Perception with 3D Dynamic Scene Graphs
Antoni Rosinol
Andrew Violette
Marcus Abate
Nathan Hughes
Yun Chang
Jingang Shi
Arjun Gupta
Luca Carlone
3DV
448
300
0
18 Jan 2021
Dual-Level Collaborative Transformer for Image Captioning
AAAI Conference on Artificial Intelligence (AAAI), 2021
Yunpeng Luo
Jiayi Ji
Xiaoshuai Sun
Liujuan Cao
Yongjian Wu
Feiyue Huang
Chia-Wen Lin
Rongrong Ji
ViT
247
329
0
16 Jan 2021
On-the-Fly Attention Modulation for Neural Generation
Findings (Findings), 2021
Yue Dong
Chandra Bhagavatula
Ximing Lu
Jena D. Hwang
Antoine Bosselut
Jackie C.K. Cheung
Yejin Choi
298
17
0
02 Jan 2021
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units
Annual Meeting of the Association for Computational Linguistics (ACL), 2020
Wei-Ning Hsu
David Harwath
Christopher Song
James R. Glass
CLIP
191
74
0
31 Dec 2020
Image-to-Image Retrieval by Learning Similarity between Scene Graphs
AAAI Conference on Artificial Intelligence (AAAI), 2020
Sangwoong Yoon
Woo-Young Kang
Sungwook Jeon
SeongEun Lee
C. Han
Jonghun Park
Eun-Sol Kim
3DH
220
54
0
29 Dec 2020
WEmbSim: A Simple yet Effective Metric for Image Captioning
International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2020
Naeha Sharif
Lyndon White
Bennamoun
Wei Liu
Syed Afaq Ali Shah
113
2
0
24 Dec 2020
LCEval: Learned Composite Metric for Caption Evaluation
International Journal of Computer Vision (IJCV), 2019
Naeha Sharif
Lyndon White
Bennamoun
Wei Liu
Syed Afaq Ali Shah
136
8
0
24 Dec 2020
SubICap: Towards Subword-informed Image Captioning
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020
Naeha Sharif
Bennamoun
Wei Liu
Syed Afaq Ali Shah
112
2
0
24 Dec 2020
Lexically-constrained Text Generation through Commonsense Knowledge Extraction and Injection
Yikang Li
P. Goel
Varsha Kuppur Rajendra
H. Singh
Jonathan M Francis
Kaixin Ma
Eric Nyberg
A. Oltramari
208
7
0
19 Dec 2020
AutoCaption: Image Captioning with Neural Architecture Search
Xinxin Zhu
Weining Wang
Longteng Guo
Jing Liu
277
11
0
16 Dec 2020
Intrinsic Image Captioning Evaluation
Chao Zeng
Sam Kwong
103
1
0
14 Dec 2020
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network
AAAI Conference on Artificial Intelligence (AAAI), 2020
Jiayi Ji
Yunpeng Luo
Xiaoshuai Sun
Fuhai Chen
Gen Luo
Yongjian Wu
Yue Gao
Rongrong Ji
ViT
206
201
0
13 Dec 2020
MiniVLM: A Smaller and Faster Vision-Language Model
Jianfeng Wang
Xiaowei Hu
Pengchuan Zhang
Xiujun Li
Lijuan Wang
Guang Dai
Jianfeng Gao
Zicheng Liu
VLM
MLLM
265
70
0
13 Dec 2020
Image Captioning with Context-Aware Auxiliary Guidance
AAAI Conference on Artificial Intelligence (AAAI), 2020
Zeliang Song
Xiaofei Zhou
Zhendong Mao
Jianlong Tan
224
34
0
10 Dec 2020
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps
AAAI Conference on Artificial Intelligence (AAAI), 2020
Qi Zhu
Chenyu Gao
Peng Wang
Qi Wu
208
58
0
09 Dec 2020
Towards Annotation-Free Evaluation of Cross-Lingual Image Captioning
Aozhu Chen
Xinyi Huang
Hailan Lin
Xirong Li
217
5
0
09 Dec 2020
TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Zhengyuan Yang
Yijuan Lu
Jianfeng Wang
Xi Yin
D. Florêncio
Lijuan Wang
Cha Zhang
Lei Zhang
Jiebo Luo
VLM
266
159
0
08 Dec 2020
Confidence-aware Non-repetitive Multimodal Transformers for TextCaps
Zhaokai Wang
Renda Bao
Qi Wu
Si Liu
349
28
0
07 Dec 2020
An Enhanced Knowledge Injection Model for Commonsense Generation
International Conference on Computational Linguistics (COLING), 2020
Zhihao Fan
Yeyun Gong
Zhongyu Wei
Siyuan Wang
Ya-Chieh Huang
Jian Jiao
Xuanjing Huang
Nan Duan
Ruofei Zhang
262
30
0
01 Dec 2020
Language-Driven Region Pointer Advancement for Controllable Image Captioning
International Conference on Computational Linguistics (COLING), 2020
Annika Lindh
R. Ross
John D. Kelleher
122
14
0
30 Nov 2020
A Comprehensive Review on Recent Methods and Challenges of Video Description
Ashutosh Kumar Singh
Thoudam Doren Singh
Sivaji Bandyopadhyay
3DV
VLM
203
5
0
30 Nov 2020
FFCI: A Framework for Interpretable Automatic Evaluation of Summarization
Journal of Artificial Intelligence Research (JAIR), 2020
Fajri Koto
Timothy Baldwin
Jey Han Lau
HILM
261
40
0
27 Nov 2020
Neuro-Symbolic Representations for Video Captioning: A Case for Leveraging Inductive Biases for Vision and Language
Hassan Akbari
Hamid Palangi
Jianwei Yang
Sudha Rao
Asli Celikyilmaz
Roland Fernandez
P. Smolensky
Jianfeng Gao
Shih-Fu Chang
210
3
0
18 Nov 2020
Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human Gaze
Ece Takmaz
Sandro Pezzelle
Lisa Beinborn
Raquel Fernández
217
25
0
09 Nov 2020
A Gold Standard Methodology for Evaluating Accuracy in Data-To-Text Systems
Craig Thomson
Ehud Reiter
140
53
0
08 Nov 2020
Diverse Image Captioning with Context-Object Split Latent Spaces
Neural Information Processing Systems (NeurIPS), 2020
Shweta Mahajan
Stefan Roth
206
46
0
02 Nov 2020
DeepOpht: Medical Report Generation for Retinal Images via Deep Models and Visual Explanation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020
Jia-Hong Huang
Chao-Han Huck Yang
Fangyu Liu
Meng Tian
Yi-Chieh Liu
...
Kang Wang
Hiromasa Morikawa
Hernghua Chang
Jesper N. Tegnér
M. Worring
MedIm
213
66
0
01 Nov 2020
Fusion Models for Improved Visual Captioning
M. Kalimuthu
Aditya Mogadala
Marius Mosbach
Dietrich Klakow
VLM
197
2
0
28 Oct 2020
Pre-training Text-to-Text Transformers for Concept-centric Common Sense
International Conference on Learning Representations (ICLR), 2020
Wangchunshu Zhou
Dong-Ho Lee
Ravi Kiran Selvam
Seyeon Lee
Bill Yuchen Lin
Xiang Ren
LRM
VLM
258
73
0
24 Oct 2020
WaveTransformer: A Novel Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information
An Tran
Konstantinos Drossos
Maria Sandsten
208
19
0
21 Oct 2020
A Survey on Deep Learning and Explainability for Automatic Report Generation from Medical Images
ACM Computing Surveys (ACM CSUR), 2020
Pablo Messina
Pablo Pino
Denis Parra
Alvaro Soto
Cecilia Besa
S. Uribe
Marcelo andía
C. Tejos
Claudia Prieto
Daniel Capurro
MedIm
327
82
0
20 Oct 2020
Multimodal Research in Vision and Language: A Review of Current and Emerging Trends
Shagun Uppal
Sarthak Bhagat
Devamanyu Hazarika
Navonil Majumdar
Soujanya Poria
Roger Zimmermann
Amir Zadeh
282
6
0
19 Oct 2020
Positioning yourself in the maze of Neural Text Generation: A Task-Agnostic Survey
Khyathi Chandu
A. Black
261
0
0
14 Oct 2020
Previous
1
2
3
...
14
15
16
...
19
20
21
Next
Page 15 of 21
Page
of 21
Go