Convolutional Image Captioning

24 November 2017

Papers citing "Convolutional Image Captioning"

50 / 105 papers shown

Title
ReCap: Event-Aware Image Captioning with Article Retrieval and Semantic Gaussian Normalization T. Nguyen Thanh-Tung Phan-Nguyen Gia-Huy Dinh Lam-Huy Nguyen M. Tran T. Le 60 0 0 01 Sep 2025
Group-based Distinctive Image Captioning with Memory Difference Encoding and AttentionInternational Journal of Computer Vision (IJCV), 2024 Jiuniu Wang Wenjia Xu Qingzhong Wang Antoni B. Chan 315 0 0 03 Apr 2025
ChatBEV: A Visual Language Model that Understands BEV Maps Qingyao Xu Tian Jin Guang Chen Yanfeng Wang Yujiao Shi 307 2 0 18 Mar 2025
Pixels to Prose: Understanding the art of Image Captioning Hrishikesh Singh Aarti Sharma Millie Pant 3DV VLM 186 2 0 28 Aug 2024
Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends and Metrics Analysis Uri Berger Gabriel Stanovsky Omri Abend Lea Frermann 328 0 0 09 Aug 2024
Compressed Image Captioning using CNN-based Encoder-Decoder Framework Md Alif Mahmudul Hasan Shovon Bhowmick 195 2 0 28 Apr 2024
Context-Guided Spatio-Temporal Video GroundingComputer Vision and Pattern Recognition (CVPR), 2024 Xin Gu Hengrui Fan Yan Huang Tiejian Luo Libo Zhang 228 37 0 03 Jan 2024
Survey of Social Bias in Vision-Language Models Nayeon Lee Yejin Bang Holy Lovenia Samuel Cahyawijaya Wenliang Dai Pascale Fung VLM 337 29 0 24 Sep 2023
Diagnosing Human-object Interaction DetectorsInternational Journal of Computer Vision (IJCV), 2023 Fangrui Zhu Yiming Xie Weidi Xie Huaizu Jiang 197 10 0 16 Aug 2023
MMNet: Multi-Collaboration and Multi-Supervision Network for Sequential Deepfake DetectionIEEE Transactions on Information Forensics and Security (IEEE TIFS), 2023 Ruiyang Xia Decheng Liu Jie Li Lin Yuan N. Wang Xinbo Gao 139 34 0 06 Jul 2023
GEST: the Graph of Events in Space and Time as a Common Representation between Vision and Language Mihai Masala Nicolae Cudlenco Traian Rebedea Marius Leordeanu 152 0 0 22 May 2023
Image-to-Text Translation for Interactive Image Recognition: A Comparative User Study with Non-Expert UsersJournal of Information Processing (JIP), 2023 Wataru Kawabe Yusuke Sugano VLM 136 2 0 11 May 2023
Multi-modal Machine Learning in Engineering Design: A Review and Future DirectionsJournal of Computing and Information Science in Engineering (JCISE), 2023 Binyang Song Ruilin Zhou Faez Ahmed AI4CE 312 63 0 14 Feb 2023
Overcoming Catastrophic Forgetting by XAI Giang Nguyen 162 0 0 25 Nov 2022
Improving Radiology Summarization with Radiograph and Anatomy PromptsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022 Jinpeng Hu Zhihong Chen Yang Liu Xiang Wan Tsung-Hui Chang MedIm 148 10 0 15 Oct 2022
M^4I: Multi-modal Models Membership InferenceNeural Information Processing Systems (NeurIPS), 2022 Pingyi Hu Zihan Wang Ruoxi Sun Hu Wang Minhui Xue 189 35 0 15 Sep 2022
Facial Expression Recognition and Image Description Generation in VietnameseFuzzy Systems and Data Mining (FSDM), 2022 Khang Nhut Lam Kim Thi-Thanh Nguyen Loc Huu Nguy Jugal Kalita 3DH CVBM 150 1 0 12 Aug 2022
Aesthetic Attributes Assessment of Images with AMANv2 and DPC-CaptionsV2 Xinghui Zhou Xin Jin Jianwen Lv Heng Huang Ming Mao Shuai Cui CoGe 99 0 0 09 Aug 2022
Retrieval-Augmented Transformer for Image CaptioningInternational Conference on Content-Based Multimedia Indexing (CBMI), 2022 Sara Sarto Marcella Cornia Lorenzo Baraldi Rita Cucchiara 168 68 0 26 Jul 2022
Are metrics measuring what they should? An evaluation of image captioning task metricsSignal processing. Image communication (SPIC), 2022 Othón González-Chávez Guillermo Ruiz Daniela Moctezuma Tania A. Ramirez-delreal 199 9 0 04 Jul 2022
Measuring Representational Harms in Image CaptioningConference on Fairness, Accountability and Transparency (FAccT), 2022 Angelina Wang Solon Barocas Kristen Laird Hanna M. Wallach 194 60 0 14 Jun 2022
Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-based Beam SearchIEEE Transactions on Image Processing (IEEE TIP), 2022 Tianlin Li Zhe Chen Bo Jiang Jin Tang Bin Luo Dacheng Tao 296 21 0 19 May 2022
Diverse Image Captioning with Grounded StyleGerman Conference on Pattern Recognition (GCPR), 2022 Franz Klein Shweta Mahajan S. Roth 159 8 0 03 May 2022
Controllable Image Captioning Luka Maxwell 280 0 0 28 Apr 2022
On Distinctive Image Captioning via Comparing and ReweightingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022 Jiuniu Wang Wenjia Xu Qingzhong Wang Antoni B. Chan 153 23 0 08 Apr 2022
CaMEL: Mean Teacher Learning for Image CaptioningInternational Conference on Pattern Recognition (ICPR), 2022 Manuele Barraco Matteo Stefanini Marcella Cornia S. Cascianelli Lorenzo Baraldi Rita Cucchiara ViT VLM 180 37 0 21 Feb 2022
Deep Learning Approaches on Image Captioning: A ReviewACM Computing Surveys (ACM CSUR), 2022 Taraneh Ghandi H. Pourreza H. Mahyar VLM 310 141 0 31 Jan 2022
Do Smart Glasses Dream of Sentimental Visions? Deep Emotionship Analysis for Eyewear DevicesProceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2022 Yingying Zhao Yuhu Chang Yutian Lu Yujiang Wang Mingzhi Dong ... Robert P. Dick Fan Yang Tun Lu Ning Gu L. Shang 172 15 0 24 Jan 2022
An Integrated Approach for Video Captioning and Applications Soheyla Amirian T. Taha Khaled Rasheed H. Arabnia 117 1 0 23 Jan 2022
Interactive Attention AI to translate low light photos to captions for night scene understanding in women safety A. Rajagopal V. Nirmala Arun Muthuraj Vedamanickam 152 0 0 04 Jan 2022
Neural Attention for Image Captioning: Review of Outstanding Methods Zanyar Zohourianshahzadi Jugal Kalita VLM 168 57 0 29 Nov 2021
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic ArithmeticComputer Vision and Pattern Recognition (CVPR), 2021 Yoad Tewel Yoav Shalev Idan Schwartz Lior Wolf VLM 295 230 0 29 Nov 2021
Cross Modification Attention Based Deliberation Model for Image Captioning Zheng Lian Yanan Zhang Haichang Li Rui Wang Xiaohui Hu 102 7 0 17 Sep 2021
Bornon: Bengali Image Captioning with Transformer-based Deep learning approach Faisal Muhammad Shah Mayeesha Humaira Md Abidur Rahman Khan Jim Amit Saha Ami Shimul Paul 103 20 0 11 Sep 2021
Journalistic Guidelines Aware News Image CaptioningConference on Empirical Methods in Natural Language Processing (EMNLP), 2021 Xuewen Yang Svebor Karaman Joel R. Tetreault Alex Jaimes 206 32 0 07 Sep 2021
Group-based Distinctive Image Captioning with Memory Attention Jiuniu Wang Wenjia Xu Qingzhong Wang Antoni B. Chan 168 19 0 20 Aug 2021
X-modaler: A Versatile and High-performance Codebase for Cross-modal Analytics Yehao Li Yingwei Pan Jingwen Chen Ting Yao Tao Mei VLM 166 36 0 18 Aug 2021
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language ModelsIEEE International Conference on Computer Vision (ICCV), 2021 Zheyuan Liu Cristian Rodriguez-Opazo Damien Teney Stephen Gould VLM 238 282 0 09 Aug 2021
ReFormer: The Relational Transformer for Image CaptioningACM Multimedia (ACM MM), 2021 Xuewen Yang Yingru Liu Xin Wang ViT 183 62 0 29 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image CaptioningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021 Matteo Stefanini Marcella Cornia Lorenzo Baraldi S. Cascianelli G. Fiameni Rita Cucchiara 3DV VLM MLLM 379 342 0 14 Jul 2021
Multi-Modal Image Captioning for the Visually ImpairedNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021 Hiba Ahsan Nikita Bhalla Daivat Bhatt Kaivankumar Shah 147 27 0 17 May 2021
Discrete-continuous Action Space Policy Gradient-based Attention for Image-Text MatchingComputer Vision and Pattern Recognition (CVPR), 2021 Shiyang Yan Li Yu Yuan Xie 234 34 0 21 Apr 2021
Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning Soheyla Amirian Khaled Rasheed T. Taha H. Arabnia VLM VGen 95 26 0 07 Apr 2021
Dynamic Attention guided Multi-Trajectory Analysis for Single Object Tracking Tianlin Li Zhe Chen Jin Tang Bin Luo Yaowei Wang Yonghong Tian Feng Wu 182 49 0 30 Mar 2021
Analysis of Convolutional Decoder for Image Caption Generation Sulabh Katiyar S. Borgohain 99 0 0 08 Mar 2021
Comparative evaluation of CNN architectures for Image Caption GenerationInternational Journal of Advanced Computer Science and Applications (IJACSA), 2021 Sulabh Katiyar S. Borgohain 111 27 0 23 Feb 2021
Image Captioning using Deep Stacked LSTMs, Contextual Word Embeddings and Data Augmentation Sulabh Katiyar S. Borgohain VLM 128 15 0 22 Feb 2021
Intrinsic Image Captioning Evaluation Chao Zeng Sam Kwong 83 1 0 14 Dec 2020
Robust Image Captioning Daniel Yarnell Xian Wang 93 0 0 06 Dec 2020
Dual Attention on Pyramid Feature Maps for Image CaptioningIEEE transactions on multimedia (TMM), 2020 Litao Yu Jian Zhang Qiang Wu 280 56 0 02 Nov 2020