Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1612.01033
Cited By
v1
v2 (latest)
Areas of Attention for Image Captioning
3 December 2016
M. Pedersoli
Thomas Lucas
Cordelia Schmid
Jakob Verbeek
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Areas of Attention for Image Captioning"
50 / 51 papers shown
Title
Attention-based transformer models for image captioning across languages: An in-depth survey and evaluation
Computer Science Review (CSR), 2025
Israa A. Albadarneh
Bassam Hammo
Omar Al-Kadi
VLM
151
2
0
03 Jun 2025
An Ensemble Model with Attention Based Mechanism for Image Captioning
Computers & electrical engineering (Comput. Electr. Eng.), 2025
Israa Al Badarneh
Bassam Hammo
Omar Al-Kadi
324
11
0
28 Jan 2025
A Novel Method of Fuzzy Topic Modeling based on Transformer Processing
Ching-Hsun Tseng
Shin-Jye Lee
Po-Wei Cheng
Chien Lee
Chih-Chieh Hung
96
0
0
18 Sep 2023
Stacked Cross-modal Feature Consolidation Attention Networks for Image Captioning
Mozhgan Pourkeshavarz
Shahabedin Nabavi
Mohsen Moghaddam
M. Shamsfard
150
4
0
08 Feb 2023
How to Describe Images in a More Funny Way? Towards a Modular Approach to Cross-Modal Sarcasm Generation
Jie Ruan
Yue Wu
Xiaojun Wan
Yuesheng Zhu
123
1
0
20 Nov 2022
M^4I: Multi-modal Models Membership Inference
Neural Information Processing Systems (NeurIPS), 2022
Pingyi Hu
Zihan Wang
Ruoxi Sun
Hu Wang
Minhui Xue
193
35
0
15 Sep 2022
vieCap4H-VLSP 2021: Vietnamese Image Captioning for Healthcare Domain using Swin Transformer and Attention-based LSTM
VNU Journal of Science Computer Science and Communication Engineering (JCSCE), 2022
THANH VAN NGUYEN
Long H. Nguyen
Nhat Truong Pham
Liu Tai Nguyen
Van Huong Do
Hai Nguyen
Ngoc Duy Nguyen
VLM
ViT
130
1
0
03 Sep 2022
PIXEL: Physics-Informed Cell Representations for Fast and Accurate PDE Solvers
AAAI Conference on Artificial Intelligence (AAAI), 2022
Namgyu Kang
Byeonghyeon Lee
Youngjoon Hong
S. Yun
Eunbyung Park
PINN
AI4CE
201
23
0
26 Jul 2022
Are metrics measuring what they should? An evaluation of image captioning task metrics
Signal processing. Image communication (SPIC), 2022
Othón González-Chávez
Guillermo Ruiz
Daniela Moctezuma
Tania A. Ramirez-delreal
199
9
0
04 Jul 2022
Image Captioning based on Feature Refinement and Reflective Decoding
G. Alabduljabbar
Hafida Benhidour
Said Kerrache
3DV
112
3
0
16 Jun 2022
A Deep Neural Framework for Image Caption Generation Using GRU-Based Attention Mechanism
Rashid Khan
Shujah Islam
Khadija Kanwal
Mansoor Iqbal
Md. Imran Hossain
Z. Ye
3DV
82
20
0
03 Mar 2022
Self-Supervised Image-to-Text and Text-to-Image Synthesis
Anindya Sundar Das
S. Saha
SSL
89
6
0
09 Dec 2021
Neural Attention for Image Captioning: Review of Outstanding Methods
Zanyar Zohourianshahzadi
Jugal Kalita
VLM
168
57
0
29 Nov 2021
CIDEr-R: Robust Consensus-based Image Description Evaluation
G. O. D. Santos
Esther Luna Colombini
Sandra Avila
139
38
0
28 Sep 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DV
VLM
MLLM
379
343
0
14 Jul 2021
Attention, please! A survey of Neural Attention Models in Deep Learning
Artificial Intelligence Review (AIR), 2021
Alana de Santana Correia
Esther Luna Colombini
HAI
308
249
0
31 Mar 2021
Generalizing Face Forgery Detection with High-frequency Features
Computer Vision and Pattern Recognition (CVPR), 2021
Yucheng Luo
Yong Zhang
Junchi Yan
Wei Liu
CVBM
193
469
0
23 Mar 2021
Visual Question Answering based on Local-Scene-Aware Referring Expression Generation
Neural Networks (NN), 2021
Jungjun Kim
Dong-Gyu Lee
Jialin Wu
Hong G Jung
Seong-Whan Lee
ObjD
157
23
0
22 Jan 2021
Boost Image Captioning with Knowledge Reasoning
Machine-mediated learning (ML), 2020
Feicheng Huang
Zhiwen Wang
Haiyang Wei
Canlong Zhang
Huifang Ma
91
27
0
02 Nov 2020
Image Captioning with Attention for Smart Local Tourism using EfficientNet
D. H. Fudholi
Yurio Windiatmoko
Nurdi Afrianto
Prastyo Eko Susanto
Magfirah Suyuti
A. Hidayatullah
R. Rahmadi
3DH
109
13
0
18 Sep 2020
RATT: Recurrent Attention to Transient Tasks for Continual Image Captioning
Neural Information Processing Systems (NeurIPS), 2020
Riccardo Del Chiaro
Bartlomiej Twardowski
Andrew D. Bagdanov
Joost van de Weijer
CLL
VLM
173
49
0
13 Jul 2020
Adaptive Offline Quintuplet Loss for Image-Text Matching
European Conference on Computer Vision (ECCV), 2020
Tianlang Chen
Jiajun Deng
Jiebo Luo
429
75
0
07 Mar 2020
Expressing Objects just like Words: Recurrent Visual Embedding for Image-Text Matching
AAAI Conference on Artificial Intelligence (AAAI), 2020
Tianlang Chen
Jiebo Luo
138
70
0
20 Feb 2020
Meshed-Memory Transformer for Image Captioning
Computer Vision and Pattern Recognition (CVPR), 2019
Marcella Cornia
Matteo Stefanini
Lorenzo Baraldi
Rita Cucchiara
210
1,017
0
17 Dec 2019
Predicting the Politics of an Image Using Webly Supervised Data
Neural Information Processing Systems (NeurIPS), 2019
Christopher Thomas
Adriana Kovashka
SSL
191
24
0
31 Oct 2019
Cross Attention Network for Few-shot Classification
Neural Information Processing Systems (NeurIPS), 2019
Rui Hou
Hong Chang
Bingpeng Ma
Shiguang Shan
Xilin Chen
410
738
0
17 Oct 2019
Exploring Overall Contextual Information for Image Captioning in Human-Like Cognitive Style
IEEE International Conference on Computer Vision (ICCV), 2019
Hongwei Ge
Zehang Yan
Kai Zhang
Mingde Zhao
Liang Sun
119
25
0
15 Oct 2019
SMArT: Training Shallow Memory-aware Transformers for Robotic Explainability
IEEE International Conference on Robotics and Automation (ICRA), 2019
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
257
29
0
07 Oct 2019
Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic Labels Improve Image Captioning and Visual Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Soravit Changpinyo
Bo Pang
Piyush Sharma
Radu Soricut
ObjD
223
20
0
04 Sep 2019
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Journal of Artificial Intelligence Research (JAIR), 2019
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
376
141
0
22 Jul 2019
Image Captioning with Integrated Bottom-Up and Multi-level Residual Top-Down Attention for Game Scene Understanding
Jian Zheng
S. Krishnamurthy
Ruxin Chen
Min-Hung Chen
Zhenhao Ge
Xiaohua Li
127
4
0
16 Jun 2019
Multi-scale self-guided attention for medical image segmentation
IEEE journal of biomedical and health informatics (IEEE JBHI), 2019
Ashish Sinha
Jose Dolz
SSeg
283
479
0
07 Jun 2019
Generating Question Relevant Captions to Aid Visual Question Answering
Annual Meeting of the Association for Computational Linguistics (ACL), 2019
Jialin Wu
Zeyuan Hu
Raymond J. Mooney
220
45
0
03 Jun 2019
Visual Entailment: A Novel Task for Fine-Grained Image Understanding
Ning Xie
Farley Lai
Derek Doran
Asim Kadav
CoGe
304
346
0
20 Jan 2019
Robust Change Captioning
Dong Huk Park
Trevor Darrell
Anna Rohrbach
161
5
0
08 Jan 2019
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
DiffM
219
194
0
26 Nov 2018
Gated Hierarchical Attention for Image Captioning
Qingzhong Wang
Antoni B. Chan
124
19
0
30 Oct 2018
Area Attention
Yang Li
Lukasz Kaiser
Samy Bengio
Si Si
366
25
0
23 Oct 2018
A Comprehensive Survey of Deep Learning for Image Captioning
Md Zakir Hossain
Ferdous Sohel
M. Shiratuddin
Hamid Laga
VLM
3DV
294
839
0
06 Oct 2018
Facial Action Unit Detection Using Attention and Relation Learning
Zhiwen Shao
Zhilei Liu
Jianfei Cai
Yunsheng Wu
Lizhuang Ma
ViT
197
131
0
10 Aug 2018
Joint Image Captioning and Question Answering
Jialin Wu
Zeyuan Hu
Raymond J. Mooney
105
14
0
22 May 2018
Stacked Semantic-Guided Attention Model for Fine-Grained Zero-Shot Learning
YunLong Yu
Zhong Ji
Yanwei Fu
Jichang Guo
Yanwei Pang
Zhongfei Zhang
VLM
126
27
0
21 May 2018
SemStyle: Learning to Generate Stylised Image Captions using Unaligned Text
A. Mathews
Lexing Xie
Xuming He
VLM
137
116
0
18 May 2018
Token-level and sequence-level loss smoothing for RNN language models
Maha Elbayad
Laurent Besacier
Jakob Verbeek
143
19
0
14 May 2018
Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data
Xihui Liu
Jiaming Song
Jing Shao
Dapeng Chen
Xiaogang Wang
213
142
0
22 Mar 2018
Teaching Machines to Code: Neural Markup Generation with Visual Attention
Sumeet S. Singh
129
9
0
15 Feb 2018
TieNet: Text-Image Embedding Network for Common Thorax Disease Classification and Reporting in Chest X-rays
Xiaosong Wang
Yifan Peng
Le Lu
Zhiyong Lu
Ronald M. Summers
MedIm
165
514
0
12 Jan 2018
ADVISE: Symbolism and External Knowledge for Decoding Advertisements
Keren Ye
Adriana Kovashka
156
57
0
17 Nov 2017
Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries
Bohan Zhuang
Qi Wu
Chunhua Shen
Ian Reid
Anton Van Den Hengel
ObjD
151
142
0
17 Nov 2017
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
Peter Anderson
Xiaodong He
Chris Buehler
Damien Teney
Mark Johnson
Stephen Gould
Lei Zhang
AIMat
530
4,504
0
25 Jul 2017
1
2
Next