Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1711.09151
Cited By
Convolutional Image Captioning
24 November 2017
J. Aneja
Aditya Deshpande
Alex Schwing
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Convolutional Image Captioning"
50 / 105 papers shown
Title
ReCap: Event-Aware Image Captioning with Article Retrieval and Semantic Gaussian Normalization
T. Nguyen
Thanh-Tung Phan-Nguyen
Gia-Huy Dinh
Lam-Huy Nguyen
M. Tran
T. Le
60
0
0
01 Sep 2025
Group-based Distinctive Image Captioning with Memory Difference Encoding and Attention
International Journal of Computer Vision (IJCV), 2024
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
315
0
0
03 Apr 2025
ChatBEV: A Visual Language Model that Understands BEV Maps
Qingyao Xu
Tian Jin
Guang Chen
Yanfeng Wang
Yujiao Shi
307
2
0
18 Mar 2025
Pixels to Prose: Understanding the art of Image Captioning
Hrishikesh Singh
Aarti Sharma
Millie Pant
3DV
VLM
186
2
0
28 Aug 2024
Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends and Metrics Analysis
Uri Berger
Gabriel Stanovsky
Omri Abend
Lea Frermann
328
0
0
09 Aug 2024
Compressed Image Captioning using CNN-based Encoder-Decoder Framework
Md Alif
Mahmudul Hasan
Shovon Bhowmick
195
2
0
28 Apr 2024
Context-Guided Spatio-Temporal Video Grounding
Computer Vision and Pattern Recognition (CVPR), 2024
Xin Gu
Hengrui Fan
Yan Huang
Tiejian Luo
Libo Zhang
228
37
0
03 Jan 2024
Survey of Social Bias in Vision-Language Models
Nayeon Lee
Yejin Bang
Holy Lovenia
Samuel Cahyawijaya
Wenliang Dai
Pascale Fung
VLM
337
29
0
24 Sep 2023
Diagnosing Human-object Interaction Detectors
International Journal of Computer Vision (IJCV), 2023
Fangrui Zhu
Yiming Xie
Weidi Xie
Huaizu Jiang
197
10
0
16 Aug 2023
MMNet: Multi-Collaboration and Multi-Supervision Network for Sequential Deepfake Detection
IEEE Transactions on Information Forensics and Security (IEEE TIFS), 2023
Ruiyang Xia
Decheng Liu
Jie Li
Lin Yuan
N. Wang
Xinbo Gao
139
34
0
06 Jul 2023
GEST: the Graph of Events in Space and Time as a Common Representation between Vision and Language
Mihai Masala
Nicolae Cudlenco
Traian Rebedea
Marius Leordeanu
152
0
0
22 May 2023
Image-to-Text Translation for Interactive Image Recognition: A Comparative User Study with Non-Expert Users
Journal of Information Processing (JIP), 2023
Wataru Kawabe
Yusuke Sugano
VLM
136
2
0
11 May 2023
Multi-modal Machine Learning in Engineering Design: A Review and Future Directions
Journal of Computing and Information Science in Engineering (JCISE), 2023
Binyang Song
Ruilin Zhou
Faez Ahmed
AI4CE
312
63
0
14 Feb 2023
Overcoming Catastrophic Forgetting by XAI
Giang Nguyen
162
0
0
25 Nov 2022
Improving Radiology Summarization with Radiograph and Anatomy Prompts
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Jinpeng Hu
Zhihong Chen
Yang Liu
Xiang Wan
Tsung-Hui Chang
MedIm
148
10
0
15 Oct 2022
M^4I: Multi-modal Models Membership Inference
Neural Information Processing Systems (NeurIPS), 2022
Pingyi Hu
Zihan Wang
Ruoxi Sun
Hu Wang
Minhui Xue
189
35
0
15 Sep 2022
Facial Expression Recognition and Image Description Generation in Vietnamese
Fuzzy Systems and Data Mining (FSDM), 2022
Khang Nhut Lam
Kim Thi-Thanh Nguyen
Loc Huu Nguy
Jugal Kalita
3DH
CVBM
150
1
0
12 Aug 2022
Aesthetic Attributes Assessment of Images with AMANv2 and DPC-CaptionsV2
Xinghui Zhou
Xin Jin
Jianwen Lv
Heng Huang
Ming Mao
Shuai Cui
CoGe
99
0
0
09 Aug 2022
Retrieval-Augmented Transformer for Image Captioning
International Conference on Content-Based Multimedia Indexing (CBMI), 2022
Sara Sarto
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
168
68
0
26 Jul 2022
Are metrics measuring what they should? An evaluation of image captioning task metrics
Signal processing. Image communication (SPIC), 2022
Othón González-Chávez
Guillermo Ruiz
Daniela Moctezuma
Tania A. Ramirez-delreal
199
9
0
04 Jul 2022
Measuring Representational Harms in Image Captioning
Conference on Fairness, Accountability and Transparency (FAccT), 2022
Angelina Wang
Solon Barocas
Kristen Laird
Hanna M. Wallach
194
60
0
14 Jun 2022
Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-based Beam Search
IEEE Transactions on Image Processing (IEEE TIP), 2022
Tianlin Li
Zhe Chen
Bo Jiang
Jin Tang
Bin Luo
Dacheng Tao
296
21
0
19 May 2022
Diverse Image Captioning with Grounded Style
German Conference on Pattern Recognition (GCPR), 2022
Franz Klein
Shweta Mahajan
S. Roth
159
8
0
03 May 2022
Controllable Image Captioning
Luka Maxwell
280
0
0
28 Apr 2022
On Distinctive Image Captioning via Comparing and Reweighting
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
153
23
0
08 Apr 2022
CaMEL: Mean Teacher Learning for Image Captioning
International Conference on Pattern Recognition (ICPR), 2022
Manuele Barraco
Matteo Stefanini
Marcella Cornia
S. Cascianelli
Lorenzo Baraldi
Rita Cucchiara
ViT
VLM
180
37
0
21 Feb 2022
Deep Learning Approaches on Image Captioning: A Review
ACM Computing Surveys (ACM CSUR), 2022
Taraneh Ghandi
H. Pourreza
H. Mahyar
VLM
310
141
0
31 Jan 2022
Do Smart Glasses Dream of Sentimental Visions? Deep Emotionship Analysis for Eyewear Devices
Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2022
Yingying Zhao
Yuhu Chang
Yutian Lu
Yujiang Wang
Mingzhi Dong
...
Robert P. Dick
Fan Yang
Tun Lu
Ning Gu
L. Shang
172
15
0
24 Jan 2022
An Integrated Approach for Video Captioning and Applications
Soheyla Amirian
T. Taha
Khaled Rasheed
H. Arabnia
117
1
0
23 Jan 2022
Interactive Attention AI to translate low light photos to captions for night scene understanding in women safety
A. Rajagopal
V. Nirmala
Arun Muthuraj Vedamanickam
152
0
0
04 Jan 2022
Neural Attention for Image Captioning: Review of Outstanding Methods
Zanyar Zohourianshahzadi
Jugal Kalita
VLM
168
57
0
29 Nov 2021
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
Computer Vision and Pattern Recognition (CVPR), 2021
Yoad Tewel
Yoav Shalev
Idan Schwartz
Lior Wolf
VLM
295
230
0
29 Nov 2021
Cross Modification Attention Based Deliberation Model for Image Captioning
Zheng Lian
Yanan Zhang
Haichang Li
Rui Wang
Xiaohui Hu
102
7
0
17 Sep 2021
Bornon: Bengali Image Captioning with Transformer-based Deep learning approach
Faisal Muhammad Shah
Mayeesha Humaira
Md Abidur Rahman Khan Jim
Amit Saha Ami
Shimul Paul
103
20
0
11 Sep 2021
Journalistic Guidelines Aware News Image Captioning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Xuewen Yang
Svebor Karaman
Joel R. Tetreault
Alex Jaimes
206
32
0
07 Sep 2021
Group-based Distinctive Image Captioning with Memory Attention
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
168
19
0
20 Aug 2021
X-modaler: A Versatile and High-performance Codebase for Cross-modal Analytics
Yehao Li
Yingwei Pan
Jingwen Chen
Ting Yao
Tao Mei
VLM
166
36
0
18 Aug 2021
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models
IEEE International Conference on Computer Vision (ICCV), 2021
Zheyuan Liu
Cristian Rodriguez-Opazo
Damien Teney
Stephen Gould
VLM
238
282
0
09 Aug 2021
ReFormer: The Relational Transformer for Image Captioning
ACM Multimedia (ACM MM), 2021
Xuewen Yang
Yingru Liu
Xin Wang
ViT
183
62
0
29 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DV
VLM
MLLM
379
342
0
14 Jul 2021
Multi-Modal Image Captioning for the Visually Impaired
North American Chapter of the Association for Computational Linguistics (NAACL), 2021
Hiba Ahsan
Nikita Bhalla
Daivat Bhatt
Kaivankumar Shah
147
27
0
17 May 2021
Discrete-continuous Action Space Policy Gradient-based Attention for Image-Text Matching
Computer Vision and Pattern Recognition (CVPR), 2021
Shiyang Yan
Li Yu
Yuan Xie
234
34
0
21 Apr 2021
Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning
Soheyla Amirian
Khaled Rasheed
T. Taha
H. Arabnia
VLM
VGen
95
26
0
07 Apr 2021
Dynamic Attention guided Multi-Trajectory Analysis for Single Object Tracking
Tianlin Li
Zhe Chen
Jin Tang
Bin Luo
Yaowei Wang
Yonghong Tian
Feng Wu
182
49
0
30 Mar 2021
Analysis of Convolutional Decoder for Image Caption Generation
Sulabh Katiyar
S. Borgohain
99
0
0
08 Mar 2021
Comparative evaluation of CNN architectures for Image Caption Generation
International Journal of Advanced Computer Science and Applications (IJACSA), 2021
Sulabh Katiyar
S. Borgohain
111
27
0
23 Feb 2021
Image Captioning using Deep Stacked LSTMs, Contextual Word Embeddings and Data Augmentation
Sulabh Katiyar
S. Borgohain
VLM
128
15
0
22 Feb 2021
Intrinsic Image Captioning Evaluation
Chao Zeng
Sam Kwong
83
1
0
14 Dec 2020
Robust Image Captioning
Daniel Yarnell
Xian Wang
93
0
0
06 Dec 2020
Dual Attention on Pyramid Feature Maps for Image Captioning
IEEE transactions on multimedia (TMM), 2020
Litao Yu
Jian Zhang
Qiang Wu
280
56
0
02 Nov 2020
1
2
3
Next