ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.01033
  4. Cited By
Areas of Attention for Image Captioning
v1v2 (latest)

Areas of Attention for Image Captioning

3 December 2016
M. Pedersoli
Thomas Lucas
Cordelia Schmid
Jakob Verbeek
ArXiv (abs)PDFHTML

Papers citing "Areas of Attention for Image Captioning"

50 / 51 papers shown
Title
Attention-based transformer models for image captioning across languages: An in-depth survey and evaluation
Attention-based transformer models for image captioning across languages: An in-depth survey and evaluationComputer Science Review (CSR), 2025
Israa A. Albadarneh
Bassam Hammo
Omar Al-Kadi
VLM
151
2
0
03 Jun 2025
An Ensemble Model with Attention Based Mechanism for Image CaptioningComputers & electrical engineering (Comput. Electr. Eng.), 2025
Israa Al Badarneh
Bassam Hammo
Omar Al-Kadi
324
11
0
28 Jan 2025
A Novel Method of Fuzzy Topic Modeling based on Transformer Processing
A Novel Method of Fuzzy Topic Modeling based on Transformer Processing
Ching-Hsun Tseng
Shin-Jye Lee
Po-Wei Cheng
Chien Lee
Chih-Chieh Hung
96
0
0
18 Sep 2023
Stacked Cross-modal Feature Consolidation Attention Networks for Image
  Captioning
Stacked Cross-modal Feature Consolidation Attention Networks for Image Captioning
Mozhgan Pourkeshavarz
Shahabedin Nabavi
Mohsen Moghaddam
M. Shamsfard
158
4
0
08 Feb 2023
How to Describe Images in a More Funny Way? Towards a Modular Approach
  to Cross-Modal Sarcasm Generation
How to Describe Images in a More Funny Way? Towards a Modular Approach to Cross-Modal Sarcasm Generation
Jie Ruan
Yue Wu
Xiaojun Wan
Yuesheng Zhu
123
1
0
20 Nov 2022
M^4I: Multi-modal Models Membership Inference
M^4I: Multi-modal Models Membership InferenceNeural Information Processing Systems (NeurIPS), 2022
Pingyi Hu
Zihan Wang
Ruoxi Sun
Hu Wang
Minhui Xue
193
35
0
15 Sep 2022
vieCap4H-VLSP 2021: Vietnamese Image Captioning for Healthcare Domain
  using Swin Transformer and Attention-based LSTM
vieCap4H-VLSP 2021: Vietnamese Image Captioning for Healthcare Domain using Swin Transformer and Attention-based LSTMVNU Journal of Science Computer Science and Communication Engineering (JCSCE), 2022
THANH VAN NGUYEN
Long H. Nguyen
Nhat Truong Pham
Liu Tai Nguyen
Van Huong Do
Hai Nguyen
Ngoc Duy Nguyen
VLMViT
130
1
0
03 Sep 2022
PIXEL: Physics-Informed Cell Representations for Fast and Accurate PDE
  Solvers
PIXEL: Physics-Informed Cell Representations for Fast and Accurate PDE SolversAAAI Conference on Artificial Intelligence (AAAI), 2022
Namgyu Kang
Byeonghyeon Lee
Youngjoon Hong
S. Yun
Eunbyung Park
PINNAI4CE
201
23
0
26 Jul 2022
Are metrics measuring what they should? An evaluation of image
  captioning task metrics
Are metrics measuring what they should? An evaluation of image captioning task metricsSignal processing. Image communication (SPIC), 2022
Othón González-Chávez
Guillermo Ruiz
Daniela Moctezuma
Tania A. Ramirez-delreal
199
9
0
04 Jul 2022
Image Captioning based on Feature Refinement and Reflective Decoding
Image Captioning based on Feature Refinement and Reflective Decoding
G. Alabduljabbar
Hafida Benhidour
Said Kerrache
3DV
112
3
0
16 Jun 2022
A Deep Neural Framework for Image Caption Generation Using GRU-Based
  Attention Mechanism
A Deep Neural Framework for Image Caption Generation Using GRU-Based Attention Mechanism
Rashid Khan
Shujah Islam
Khadija Kanwal
Mansoor Iqbal
Md. Imran Hossain
Z. Ye
3DV
82
20
0
03 Mar 2022
Self-Supervised Image-to-Text and Text-to-Image Synthesis
Self-Supervised Image-to-Text and Text-to-Image Synthesis
Anindya Sundar Das
S. Saha
SSL
89
6
0
09 Dec 2021
Neural Attention for Image Captioning: Review of Outstanding Methods
Neural Attention for Image Captioning: Review of Outstanding Methods
Zanyar Zohourianshahzadi
Jugal Kalita
VLM
168
57
0
29 Nov 2021
CIDEr-R: Robust Consensus-based Image Description Evaluation
CIDEr-R: Robust Consensus-based Image Description Evaluation
G. O. D. Santos
Esther Luna Colombini
Sandra Avila
139
38
0
28 Sep 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
From Show to Tell: A Survey on Deep Learning-based Image CaptioningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DVVLMMLLM
379
343
0
14 Jul 2021
Attention, please! A survey of Neural Attention Models in Deep Learning
Attention, please! A survey of Neural Attention Models in Deep LearningArtificial Intelligence Review (AIR), 2021
Alana de Santana Correia
Esther Luna Colombini
HAI
308
249
0
31 Mar 2021
Generalizing Face Forgery Detection with High-frequency Features
Generalizing Face Forgery Detection with High-frequency FeaturesComputer Vision and Pattern Recognition (CVPR), 2021
Yucheng Luo
Yong Zhang
Junchi Yan
Wei Liu
CVBM
193
469
0
23 Mar 2021
Visual Question Answering based on Local-Scene-Aware Referring
  Expression Generation
Visual Question Answering based on Local-Scene-Aware Referring Expression GenerationNeural Networks (NN), 2021
Jungjun Kim
Dong-Gyu Lee
Jialin Wu
Hong G Jung
Seong-Whan Lee
ObjD
157
23
0
22 Jan 2021
Boost Image Captioning with Knowledge Reasoning
Boost Image Captioning with Knowledge ReasoningMachine-mediated learning (ML), 2020
Feicheng Huang
Zhiwen Wang
Haiyang Wei
Canlong Zhang
Huifang Ma
91
27
0
02 Nov 2020
Image Captioning with Attention for Smart Local Tourism using
  EfficientNet
Image Captioning with Attention for Smart Local Tourism using EfficientNet
D. H. Fudholi
Yurio Windiatmoko
Nurdi Afrianto
Prastyo Eko Susanto
Magfirah Suyuti
A. Hidayatullah
R. Rahmadi
3DH
109
13
0
18 Sep 2020
RATT: Recurrent Attention to Transient Tasks for Continual Image
  Captioning
RATT: Recurrent Attention to Transient Tasks for Continual Image CaptioningNeural Information Processing Systems (NeurIPS), 2020
Riccardo Del Chiaro
Bartlomiej Twardowski
Andrew D. Bagdanov
Joost van de Weijer
CLLVLM
173
49
0
13 Jul 2020
Adaptive Offline Quintuplet Loss for Image-Text Matching
Adaptive Offline Quintuplet Loss for Image-Text MatchingEuropean Conference on Computer Vision (ECCV), 2020
Tianlang Chen
Jiajun Deng
Jiebo Luo
429
75
0
07 Mar 2020
Expressing Objects just like Words: Recurrent Visual Embedding for
  Image-Text Matching
Expressing Objects just like Words: Recurrent Visual Embedding for Image-Text MatchingAAAI Conference on Artificial Intelligence (AAAI), 2020
Tianlang Chen
Jiebo Luo
138
70
0
20 Feb 2020
Meshed-Memory Transformer for Image Captioning
Meshed-Memory Transformer for Image CaptioningComputer Vision and Pattern Recognition (CVPR), 2019
Marcella Cornia
Matteo Stefanini
Lorenzo Baraldi
Rita Cucchiara
210
1,017
0
17 Dec 2019
Predicting the Politics of an Image Using Webly Supervised Data
Predicting the Politics of an Image Using Webly Supervised DataNeural Information Processing Systems (NeurIPS), 2019
Christopher Thomas
Adriana Kovashka
SSL
195
24
0
31 Oct 2019
Cross Attention Network for Few-shot Classification
Cross Attention Network for Few-shot ClassificationNeural Information Processing Systems (NeurIPS), 2019
Rui Hou
Hong Chang
Bingpeng Ma
Shiguang Shan
Xilin Chen
414
738
0
17 Oct 2019
Exploring Overall Contextual Information for Image Captioning in
  Human-Like Cognitive Style
Exploring Overall Contextual Information for Image Captioning in Human-Like Cognitive StyleIEEE International Conference on Computer Vision (ICCV), 2019
Hongwei Ge
Zehang Yan
Kai Zhang
Mingde Zhao
Liang Sun
119
25
0
15 Oct 2019
SMArT: Training Shallow Memory-aware Transformers for Robotic
  Explainability
SMArT: Training Shallow Memory-aware Transformers for Robotic ExplainabilityIEEE International Conference on Robotics and Automation (ICRA), 2019
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
257
29
0
07 Oct 2019
Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic
  Labels Improve Image Captioning and Visual Question Answering
Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic Labels Improve Image Captioning and Visual Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Soravit Changpinyo
Bo Pang
Piyush Sharma
Radu Soricut
ObjD
223
20
0
04 Sep 2019
Trends in Integration of Vision and Language Research: A Survey of
  Tasks, Datasets, and Methods
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and MethodsJournal of Artificial Intelligence Research (JAIR), 2019
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
376
141
0
22 Jul 2019
Image Captioning with Integrated Bottom-Up and Multi-level Residual
  Top-Down Attention for Game Scene Understanding
Image Captioning with Integrated Bottom-Up and Multi-level Residual Top-Down Attention for Game Scene Understanding
Jian Zheng
S. Krishnamurthy
Ruxin Chen
Min-Hung Chen
Zhenhao Ge
Xiaohua Li
127
4
0
16 Jun 2019
Multi-scale self-guided attention for medical image segmentation
Multi-scale self-guided attention for medical image segmentationIEEE journal of biomedical and health informatics (IEEE JBHI), 2019
Ashish Sinha
Jose Dolz
SSeg
287
479
0
07 Jun 2019
Generating Question Relevant Captions to Aid Visual Question Answering
Generating Question Relevant Captions to Aid Visual Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Jialin Wu
Zeyuan Hu
Raymond J. Mooney
220
45
0
03 Jun 2019
Visual Entailment: A Novel Task for Fine-Grained Image Understanding
Visual Entailment: A Novel Task for Fine-Grained Image Understanding
Ning Xie
Farley Lai
Derek Doran
Asim Kadav
CoGe
304
346
0
20 Jan 2019
Robust Change Captioning
Robust Change Captioning
Dong Huk Park
Trevor Darrell
Anna Rohrbach
161
5
0
08 Jan 2019
Show, Control and Tell: A Framework for Generating Controllable and
  Grounded Captions
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
DiffM
219
194
0
26 Nov 2018
Gated Hierarchical Attention for Image Captioning
Gated Hierarchical Attention for Image Captioning
Qingzhong Wang
Antoni B. Chan
124
19
0
30 Oct 2018
Area Attention
Area Attention
Yang Li
Lukasz Kaiser
Samy Bengio
Si Si
366
25
0
23 Oct 2018
A Comprehensive Survey of Deep Learning for Image Captioning
A Comprehensive Survey of Deep Learning for Image Captioning
Md Zakir Hossain
Ferdous Sohel
M. Shiratuddin
Hamid Laga
VLM3DV
294
839
0
06 Oct 2018
Facial Action Unit Detection Using Attention and Relation Learning
Facial Action Unit Detection Using Attention and Relation Learning
Zhiwen Shao
Zhilei Liu
Jianfei Cai
Yunsheng Wu
Lizhuang Ma
ViT
197
131
0
10 Aug 2018
Joint Image Captioning and Question Answering
Joint Image Captioning and Question Answering
Jialin Wu
Zeyuan Hu
Raymond J. Mooney
105
14
0
22 May 2018
Stacked Semantic-Guided Attention Model for Fine-Grained Zero-Shot
  Learning
Stacked Semantic-Guided Attention Model for Fine-Grained Zero-Shot Learning
YunLong Yu
Zhong Ji
Yanwei Fu
Jichang Guo
Yanwei Pang
Zhongfei Zhang
VLM
126
27
0
21 May 2018
SemStyle: Learning to Generate Stylised Image Captions using Unaligned
  Text
SemStyle: Learning to Generate Stylised Image Captions using Unaligned Text
A. Mathews
Lexing Xie
Xuming He
VLM
137
116
0
18 May 2018
Token-level and sequence-level loss smoothing for RNN language models
Token-level and sequence-level loss smoothing for RNN language models
Maha Elbayad
Laurent Besacier
Jakob Verbeek
143
19
0
14 May 2018
Show, Tell and Discriminate: Image Captioning by Self-retrieval with
  Partially Labeled Data
Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data
Xihui Liu
Jiaming Song
Jing Shao
Dapeng Chen
Xiaogang Wang
213
142
0
22 Mar 2018
Teaching Machines to Code: Neural Markup Generation with Visual
  Attention
Teaching Machines to Code: Neural Markup Generation with Visual Attention
Sumeet S. Singh
129
9
0
15 Feb 2018
TieNet: Text-Image Embedding Network for Common Thorax Disease
  Classification and Reporting in Chest X-rays
TieNet: Text-Image Embedding Network for Common Thorax Disease Classification and Reporting in Chest X-rays
Xiaosong Wang
Yifan Peng
Le Lu
Zhiyong Lu
Ronald M. Summers
MedIm
165
514
0
12 Jan 2018
ADVISE: Symbolism and External Knowledge for Decoding Advertisements
ADVISE: Symbolism and External Knowledge for Decoding Advertisements
Keren Ye
Adriana Kovashka
156
57
0
17 Nov 2017
Parallel Attention: A Unified Framework for Visual Object Discovery
  through Dialogs and Queries
Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries
Bohan Zhuang
Qi Wu
Chunhua Shen
Ian Reid
Anton Van Den Hengel
ObjD
151
142
0
17 Nov 2017
Bottom-Up and Top-Down Attention for Image Captioning and Visual
  Question Answering
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
Peter Anderson
Xiaodong He
Chris Buehler
Damien Teney
Mark Johnson
Stephen Gould
Lei Zhang
AIMat
530
4,504
0
25 Jul 2017
12
Next