v1v2 (latest)

Areas of Attention for Image Captioning

3 December 2016

Papers citing "Areas of Attention for Image Captioning"

50 / 51 papers shown

Title
Attention-based transformer models for image captioning across languages: An in-depth survey and evaluationComputer Science Review (CSR), 2025 Israa A. Albadarneh Bassam Hammo Omar Al-Kadi VLM 151 2 0 03 Jun 2025
An Ensemble Model with Attention Based Mechanism for Image CaptioningComputers & electrical engineering (Comput. Electr. Eng.), 2025 Israa Al Badarneh Bassam Hammo Omar Al-Kadi 324 11 0 28 Jan 2025
A Novel Method of Fuzzy Topic Modeling based on Transformer Processing Ching-Hsun Tseng Shin-Jye Lee Po-Wei Cheng Chien Lee Chih-Chieh Hung 96 0 0 18 Sep 2023
Stacked Cross-modal Feature Consolidation Attention Networks for Image Captioning Mozhgan Pourkeshavarz Shahabedin Nabavi Mohsen Moghaddam M. Shamsfard 158 4 0 08 Feb 2023
How to Describe Images in a More Funny Way? Towards a Modular Approach to Cross-Modal Sarcasm Generation Jie Ruan Yue Wu Xiaojun Wan Yuesheng Zhu 123 1 0 20 Nov 2022
M^4I: Multi-modal Models Membership InferenceNeural Information Processing Systems (NeurIPS), 2022 Pingyi Hu Zihan Wang Ruoxi Sun Hu Wang Minhui Xue 193 35 0 15 Sep 2022
vieCap4H-VLSP 2021: Vietnamese Image Captioning for Healthcare Domain using Swin Transformer and Attention-based LSTMVNU Journal of Science Computer Science and Communication Engineering (JCSCE), 2022 THANH VAN NGUYEN Long H. Nguyen Nhat Truong Pham Liu Tai Nguyen Van Huong Do Hai Nguyen Ngoc Duy Nguyen VLM ViT 130 1 0 03 Sep 2022
PIXEL: Physics-Informed Cell Representations for Fast and Accurate PDE SolversAAAI Conference on Artificial Intelligence (AAAI), 2022 Namgyu Kang Byeonghyeon Lee Youngjoon Hong S. Yun Eunbyung Park PINN AI4CE 201 23 0 26 Jul 2022
Are metrics measuring what they should? An evaluation of image captioning task metricsSignal processing. Image communication (SPIC), 2022 Othón González-Chávez Guillermo Ruiz Daniela Moctezuma Tania A. Ramirez-delreal 199 9 0 04 Jul 2022
Image Captioning based on Feature Refinement and Reflective Decoding G. Alabduljabbar Hafida Benhidour Said Kerrache 3DV 112 3 0 16 Jun 2022
A Deep Neural Framework for Image Caption Generation Using GRU-Based Attention Mechanism Rashid Khan Shujah Islam Khadija Kanwal Mansoor Iqbal Md. Imran Hossain Z. Ye 3DV 82 20 0 03 Mar 2022
Self-Supervised Image-to-Text and Text-to-Image Synthesis Anindya Sundar Das S. Saha SSL 89 6 0 09 Dec 2021
Neural Attention for Image Captioning: Review of Outstanding Methods Zanyar Zohourianshahzadi Jugal Kalita VLM 168 57 0 29 Nov 2021
CIDEr-R: Robust Consensus-based Image Description Evaluation G. O. D. Santos Esther Luna Colombini Sandra Avila 139 38 0 28 Sep 2021
From Show to Tell: A Survey on Deep Learning-based Image CaptioningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021 Matteo Stefanini Marcella Cornia Lorenzo Baraldi S. Cascianelli G. Fiameni Rita Cucchiara 3DV VLM MLLM 379 343 0 14 Jul 2021
Attention, please! A survey of Neural Attention Models in Deep LearningArtificial Intelligence Review (AIR), 2021 Alana de Santana Correia Esther Luna Colombini HAI 308 249 0 31 Mar 2021
Generalizing Face Forgery Detection with High-frequency FeaturesComputer Vision and Pattern Recognition (CVPR), 2021 Yucheng Luo Yong Zhang Junchi Yan Wei Liu CVBM 193 469 0 23 Mar 2021
Visual Question Answering based on Local-Scene-Aware Referring Expression GenerationNeural Networks (NN), 2021 Jungjun Kim Dong-Gyu Lee Jialin Wu Hong G Jung Seong-Whan Lee ObjD 157 23 0 22 Jan 2021
Boost Image Captioning with Knowledge ReasoningMachine-mediated learning (ML), 2020 Feicheng Huang Zhiwen Wang Haiyang Wei Canlong Zhang Huifang Ma 91 27 0 02 Nov 2020
Image Captioning with Attention for Smart Local Tourism using EfficientNet D. H. Fudholi Yurio Windiatmoko Nurdi Afrianto Prastyo Eko Susanto Magfirah Suyuti A. Hidayatullah R. Rahmadi 3DH 109 13 0 18 Sep 2020
RATT: Recurrent Attention to Transient Tasks for Continual Image CaptioningNeural Information Processing Systems (NeurIPS), 2020 Riccardo Del Chiaro Bartlomiej Twardowski Andrew D. Bagdanov Joost van de Weijer CLL VLM 173 49 0 13 Jul 2020
Adaptive Offline Quintuplet Loss for Image-Text MatchingEuropean Conference on Computer Vision (ECCV), 2020 Tianlang Chen Jiajun Deng Jiebo Luo 429 75 0 07 Mar 2020
Expressing Objects just like Words: Recurrent Visual Embedding for Image-Text MatchingAAAI Conference on Artificial Intelligence (AAAI), 2020 Tianlang Chen Jiebo Luo 138 70 0 20 Feb 2020
Meshed-Memory Transformer for Image CaptioningComputer Vision and Pattern Recognition (CVPR), 2019 Marcella Cornia Matteo Stefanini Lorenzo Baraldi Rita Cucchiara 210 1,017 0 17 Dec 2019
Predicting the Politics of an Image Using Webly Supervised DataNeural Information Processing Systems (NeurIPS), 2019 Christopher Thomas Adriana Kovashka SSL 195 24 0 31 Oct 2019
Cross Attention Network for Few-shot ClassificationNeural Information Processing Systems (NeurIPS), 2019 Rui Hou Hong Chang Bingpeng Ma Shiguang Shan Xilin Chen 414 738 0 17 Oct 2019
Exploring Overall Contextual Information for Image Captioning in Human-Like Cognitive StyleIEEE International Conference on Computer Vision (ICCV), 2019 Hongwei Ge Zehang Yan Kai Zhang Mingde Zhao Liang Sun 119 25 0 15 Oct 2019
SMArT: Training Shallow Memory-aware Transformers for Robotic ExplainabilityIEEE International Conference on Robotics and Automation (ICRA), 2019 Marcella Cornia Lorenzo Baraldi Rita Cucchiara 257 29 0 07 Oct 2019
Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic Labels Improve Image Captioning and Visual Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2019 Soravit Changpinyo Bo Pang Piyush Sharma Radu Soricut ObjD 223 20 0 04 Sep 2019
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and MethodsJournal of Artificial Intelligence Research (JAIR), 2019 Aditya Mogadala M. Kalimuthu Dietrich Klakow VLM 376 141 0 22 Jul 2019
Image Captioning with Integrated Bottom-Up and Multi-level Residual Top-Down Attention for Game Scene Understanding Jian Zheng S. Krishnamurthy Ruxin Chen Min-Hung Chen Zhenhao Ge Xiaohua Li 127 4 0 16 Jun 2019
Multi-scale self-guided attention for medical image segmentationIEEE journal of biomedical and health informatics (IEEE JBHI), 2019 Ashish Sinha Jose Dolz SSeg 287 479 0 07 Jun 2019
Generating Question Relevant Captions to Aid Visual Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2019 Jialin Wu Zeyuan Hu Raymond J. Mooney 220 45 0 03 Jun 2019
Visual Entailment: A Novel Task for Fine-Grained Image Understanding Ning Xie Farley Lai Derek Doran Asim Kadav CoGe 304 346 0 20 Jan 2019
Robust Change Captioning Dong Huk Park Trevor Darrell Anna Rohrbach 161 5 0 08 Jan 2019
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions Marcella Cornia Lorenzo Baraldi Rita Cucchiara DiffM 219 194 0 26 Nov 2018
Gated Hierarchical Attention for Image Captioning Qingzhong Wang Antoni B. Chan 124 19 0 30 Oct 2018
Area Attention Yang Li Lukasz Kaiser Samy Bengio Si Si 366 25 0 23 Oct 2018
A Comprehensive Survey of Deep Learning for Image Captioning Md Zakir Hossain Ferdous Sohel M. Shiratuddin Hamid Laga VLM 3DV 294 839 0 06 Oct 2018
Facial Action Unit Detection Using Attention and Relation Learning Zhiwen Shao Zhilei Liu Jianfei Cai Yunsheng Wu Lizhuang Ma ViT 197 131 0 10 Aug 2018
Joint Image Captioning and Question Answering Jialin Wu Zeyuan Hu Raymond J. Mooney 105 14 0 22 May 2018
Stacked Semantic-Guided Attention Model for Fine-Grained Zero-Shot Learning YunLong Yu Zhong Ji Yanwei Fu Jichang Guo Yanwei Pang Zhongfei Zhang VLM 126 27 0 21 May 2018
SemStyle: Learning to Generate Stylised Image Captions using Unaligned Text A. Mathews Lexing Xie Xuming He VLM 137 116 0 18 May 2018
Token-level and sequence-level loss smoothing for RNN language models Maha Elbayad Laurent Besacier Jakob Verbeek 143 19 0 14 May 2018
Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data Xihui Liu Jiaming Song Jing Shao Dapeng Chen Xiaogang Wang 213 142 0 22 Mar 2018
Teaching Machines to Code: Neural Markup Generation with Visual Attention Sumeet S. Singh 129 9 0 15 Feb 2018
TieNet: Text-Image Embedding Network for Common Thorax Disease Classification and Reporting in Chest X-rays Xiaosong Wang Yifan Peng Le Lu Zhiyong Lu Ronald M. Summers MedIm 165 514 0 12 Jan 2018
ADVISE: Symbolism and External Knowledge for Decoding Advertisements Keren Ye Adriana Kovashka 156 57 0 17 Nov 2017
Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries Bohan Zhuang Qi Wu Chunhua Shen Ian Reid Anton Van Den Hengel ObjD 151 142 0 17 Nov 2017
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering Peter Anderson Xiaodong He Chris Buehler Damien Teney Mark Johnson Stephen Gould Lei Zhang AIMat 530 4,504 0 25 Jul 2017