Attention Correctness in Neural Image Captioning

31 May 2016

Papers citing "Attention Correctness in Neural Image Captioning"

31 / 31 papers shown

Title
VisAlign: Dataset for Measuring the Degree of Alignment between AI and Humans in Visual Perception Jiyoung Lee Seung Wook Kim Seunghyun Won Joonseok Lee Marzyeh Ghassemi James Thorne Jaeseok Choi O.-Kil Kwon E. Choi 22 1 0 03 Aug 2023
Contrastive Language-Image Pretrained Models are Zero-Shot Human Scanpath Predictors Dario Zanca Andrea Zugarini S.J. Dietz Thomas Altstidl Mark A. Turban Ndjeuha Leo Schwinn Bjoern M. Eskofier VLM 9 1 0 21 May 2023
An Image captioning algorithm based on the Hybrid Deep Learning Technique (CNN+GRU) Rana Adnan Ahmad Muhammad Azhar Hina Sattar 21 10 0 06 Jan 2023
Prophet Attention: Predicting Attention with Future Attention for Image Captioning Fenglin Liu Xuancheng Ren Xian Wu Wei Fan Yuexian Zou Xu Sun 21 46 0 19 Oct 2022
Skeletal Human Action Recognition using Hybrid Attention based Graph Convolutional Network Hao Xing Darius Burschka GNN 3DH 17 7 0 12 Jul 2022
A General Survey on Attention Mechanisms in Deep Learning Gianni Brauwers Flavius Frasincar 23 296 0 27 Mar 2022
CNN Attention Guidance for Improved Orthopedics Radiographic Fracture Classification Zhibin Liao Kewen Liao Haifeng Shen M. F. van Boxel J. Prijs R. Jaarsma J. Doornberg A. Hengel Johan W. Verjans 21 14 0 21 Mar 2022
Keyword localisation in untranscribed speech using visually grounded speech models Kayode Olaleye Dan Oneaţă Herman Kamper 19 7 0 02 Feb 2022
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing Pengfei Liu Weizhe Yuan Jinlan Fu Zhengbao Jiang Hiroaki Hayashi Graham Neubig VLM SyDa 23 3,828 0 28 Jul 2021
CASTing Your Model: Learning to Localize Improves Self-Supervised Representations Ramprasaath R. Selvaraju Karan Desai Justin Johnson Nikhil Naik SSL 14 79 0 08 Dec 2020
Dual Attention on Pyramid Feature Maps for Image Captioning Litao Yu Jian Andrew Zhang Qiang Wu 16 47 0 02 Nov 2020
On the Potential of Lexico-logical Alignments for Semantic Parsing to SQL Queries Tianze Shi Chen Zhao Jordan L. Boyd-Graber Hal Daumé Lillian Lee 16 78 0 21 Oct 2020
Gaussian Smoothen Semantic Features (GSSF) -- Exploring the Linguistic Aspects of Visual Captioning in Indian Languages (Bengali) Using MSCOCO Framework C. Sur 11 7 0 16 Feb 2020
MRRC: Multiple Role Representation Crossover Interpretation for Image Captioning With R-CNN Feature Distribution Composition (FDC) C. Sur 23 16 0 15 Feb 2020
Scene Graph Parsing by Attention Graph Martin Andrews Yew Ken Chia Sam Witteveen GNN 19 11 0 13 Sep 2019
TVQA+: Spatio-Temporal Grounding for Video Question Answering Jie Lei Licheng Yu Tamara L. Berg Mohit Bansal 28 227 0 25 Apr 2019
Grounded Video Description Luowei Zhou Yannis Kalantidis Xinlei Chen Jason J. Corso Marcus Rohrbach 27 190 0 17 Dec 2018
Neural Sign Language Translation based on Human Keypoint Estimation Sang-Ki Ko Chang Jo Kim Hyedong Jung C. Cho SLR 22 207 0 28 Nov 2018
A Comprehensive Survey of Deep Learning for Image Captioning Md. Zakir Hossain Ferdous Sohel M. Shiratuddin Hamid Laga VLM 3DV 28 760 0 06 Oct 2018
Distinctive-attribute Extraction for Image Captioning Boeun Kim Young Han Lee Hyedong Jung C. Cho 17 6 0 25 Jul 2018
Discriminability objective for training descriptive captions Ruotian Luo Brian L. Price Scott D. Cohen Gregory Shakhnarovich 19 202 0 12 Mar 2018
Netizen-Style Commenting on Fashion Photos: Dataset and Diversity Measures Wen Hua Lin Kuan-Ting Chen HungYueh Chiang Winston H. Hsu 23 10 0 31 Jan 2018
Attacking Visual Language Grounding with Adversarial Examples: A Case Study on Neural Image Captioning Hongge Chen Huan Zhang Pin-Yu Chen Jinfeng Yi Cho-Jui Hsieh GAN AAML 27 49 0 06 Dec 2017
MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network Zizhao Zhang Yuanpu Xie Fuyong Xing M. McGough L. Yang MedIm 13 301 0 08 Jul 2017
Recurrent Multimodal Interaction for Referring Image Segmentation Chenxi Liu Zhe-nan Lin Xiaohui Shen Jimei Yang Xin Lu Alan Yuille EgoV 36 234 0 23 Mar 2017
MAT: A Multimodal Attentive Translator for Image Captioning Chang Liu F. Sun Changhu Wang Feng Wang Alan Yuille 12 58 0 18 Feb 2017
Comprehension-guided referring expressions Ruotian Luo Gregory Shakhnarovich ObjD 27 171 0 12 Jan 2017
An Empirical Study of Language CNN for Image Captioning Jiuxiang Gu G. Wang Jianfei Cai Tsuhan Chen 17 132 0 21 Dec 2016
Areas of Attention for Image Captioning M. Pedersoli Thomas Lucas Cordelia Schmid Jakob Verbeek 25 205 0 03 Dec 2016
Neural Machine Translation with Supervised Attention Lemao Liu Masao Utiyama A. Finch Eiichiro Sumita 21 156 0 14 Sep 2016
Effective Approaches to Attention-based Neural Machine Translation Thang Luong Hieu H. Pham Christopher D. Manning 216 7,924 0 17 Aug 2015