ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.05594
  4. Cited By
SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks
  for Image Captioning
v1v2 (latest)

SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning

17 November 2016
Long Chen
Hanwang Zhang
Jun Xiao
Liqiang Nie
Jian Shao
Wei Liu
Tat-Seng Chua
ArXiv (abs)PDFHTML

Papers citing "SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning"

50 / 440 papers shown
CNN-based fully automatic wrist cartilage volume quantification in MR
  Image
CNN-based fully automatic wrist cartilage volume quantification in MR Image
Nikita Vladimirov
E. Brui
A. Levchuk
Vladimir A. Fokin
A. Efimtcev
D. Bendahan
75
0
0
22 Jun 2022
Toward An Optimal Selection of Dialogue Strategies: A Target-Driven
  Approach for Intelligent Outbound Robots
Toward An Optimal Selection of Dialogue Strategies: A Target-Driven Approach for Intelligent Outbound Robots
Ruifeng Qian
Shijie Li
Mengjiao Bao
Huan Chen
Yu Che
105
2
0
22 Jun 2022
Dual Windows Are Significant: Learning from Mediastinal Window and
  Focusing on Lung Window
Dual Windows Are Significant: Learning from Mediastinal Window and Focusing on Lung WindowCAAI International Conference on Artificial Intelligence (ICCAI), 2022
Qiuli Wang
Xin Tan
Chen Liu
149
2
0
08 Jun 2022
From Pixels to Objects: Cubic Visual Attention for Visual Question
  Answering
From Pixels to Objects: Cubic Visual Attention for Visual Question AnsweringInternational Joint Conference on Artificial Intelligence (IJCAI), 2018
Jingkuan Song
Pengpeng Zeng
Lianli Gao
Heng Tao Shen
144
66
0
04 Jun 2022
Egocentric Video-Language Pretraining
Egocentric Video-Language PretrainingNeural Information Processing Systems (NeurIPS), 2022
Kevin Qinghong Lin
Alex Jinpeng Wang
Mattia Soldan
Michael Wray
Rui Yan
...
Hongfa Wang
Dima Damen
Guohao Li
Wei Liu
Mike Zheng Shou
VLMEgoV
268
247
0
03 Jun 2022
WaveMix: A Resource-efficient Neural Network for Image Analysis
WaveMix: A Resource-efficient Neural Network for Image Analysis
Pranav Jeevan
Kavitha Viswanathan
S. AnanduA
A. Sethi
449
28
0
28 May 2022
Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual
  Context for Image Captioning
Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image CaptioningComputer Vision and Pattern Recognition (CVPR), 2022
Chia-Wen Kuo
Z. Kira
260
80
0
09 May 2022
A survey on attention mechanisms for medical applications: are we moving
  towards better algorithms?
A survey on attention mechanisms for medical applications: are we moving towards better algorithms?IEEE Access (IEEE Access), 2022
Tiago Gonçalves
Isabel Rio-Torto
Luís F. Teixeira
J. S. Cardoso
OODMedIm
208
52
0
26 Apr 2022
Attention in Reasoning: Dataset, Analysis, and Modeling
Attention in Reasoning: Dataset, Analysis, and ModelingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Shi Chen
Ming Jiang
Jinhui Yang
Qi Zhao
LRM
142
4
0
20 Apr 2022
Visual Attention Methods in Deep Learning: An In-Depth Survey
Visual Attention Methods in Deep Learning: An In-Depth SurveyInformation Fusion (Inf. Fusion), 2022
Mohammed Hassanin
Saeed Anwar
Ibrahim Radwan
Fahad Shahbaz Khan
Lin Wang
338
245
0
16 Apr 2022
Image Captioning In the Transformer Age
Image Captioning In the Transformer Age
Yangliu Xu
Li Li
Haiyang Xu
Songfang Huang
Fei Huang
Jianfei Cai
ViT
149
7
0
15 Apr 2022
A3CLNN: Spatial, Spectral and Multiscale Attention ConvLSTM Neural
  Network for Multisource Remote Sensing Data Classification
A3CLNN: Spatial, Spectral and Multiscale Attention ConvLSTM Neural Network for Multisource Remote Sensing Data ClassificationIEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2020
Hengchao Li
Wen-Shuai Hu
Wei Li
Jun Li
Q. Du
Antonio J. Plaza
126
110
0
09 Apr 2022
Revisiting Near/Remote Sensing with Geospatial Attention
Revisiting Near/Remote Sensing with Geospatial AttentionComputer Vision and Pattern Recognition (CVPR), 2022
Scott Workman
M. U. Rafique
Hunter Blanton
Nathan Jacobs
170
21
0
04 Apr 2022
Point-Unet: A Context-aware Point-based Neural Network for Volumetric
  Segmentation
Point-Unet: A Context-aware Point-based Neural Network for Volumetric SegmentationInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022
Ngoc-Vuong Ho
Tan H. Nguyen
Gia-Han Diep
Ngan Le
Binh-Son Hua
3DPC
222
28
0
16 Mar 2022
Dynamic Instance Domain Adaptation
Dynamic Instance Domain AdaptationIEEE Transactions on Image Processing (IEEE TIP), 2022
Zhongying Deng
Kaiyang Zhou
Da Li
Junjun He
Yi-Zhe Song
Tao Xiang
OOD
250
43
0
09 Mar 2022
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with
  Transformers
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers
Kailai Li
Huayao Liu
Kailun Yang
Xinxin Hu
Ruiping Liu
Rainer Stiefelhagen
ViT
417
505
0
09 Mar 2022
Structure-Aware Flow Generation for Human Body Reshaping
Structure-Aware Flow Generation for Human Body ReshapingComputer Vision and Pattern Recognition (CVPR), 2022
Jianqiang Ren
Xingtai Lv
Biwen Lei
Miaomiao Cui
Xuansong Xie
3DH
134
9
0
09 Mar 2022
Adaptive Cross-Layer Attention for Image Restoration
Adaptive Cross-Layer Attention for Image Restoration
Yancheng Wang
N. Xu
Yingzhen Yang
270
4
0
04 Mar 2022
ADVISE: ADaptive Feature Relevance and VISual Explanations for
  Convolutional Neural Networks
ADVISE: ADaptive Feature Relevance and VISual Explanations for Convolutional Neural NetworksThe Visual Computer (TVC), 2022
Mohammad Mahdi Dehshibi
Mona Ashtari-Majlan
Gereziher W. Adhane
David Masip
AAMLFAtt
173
4
0
02 Mar 2022
Video Question Answering: Datasets, Algorithms and Challenges
Video Question Answering: Datasets, Algorithms and ChallengesConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yaoyao Zhong
Junbin Xiao
Wei Ji
Yicong Li
Wei Deng
Tat-Seng Chua
325
114
0
02 Mar 2022
Sensing accident-prone features in urban scenes for proactive driving
  and accident prevention
Sensing accident-prone features in urban scenes for proactive driving and accident prevention
Sumit Mishra
Praveenbalaji Rajendran
L. Vecchietti
Dongsoo Har
262
16
0
25 Feb 2022
Faithful learning with sure data for lung nodule diagnosis
Faithful learning with sure data for lung nodule diagnosis
Hanxiao Zhang
Liang Chen
Xiao Gu
Minghui Zhang
Yulei Qin
Feng Yao
Zhexin Wang
Yun Gu
Guangyao Yang
96
1
0
25 Feb 2022
Visual Attention Network
Visual Attention NetworkComputational Visual Media (CVM), 2022
Meng-Hao Guo
Chengrou Lu
Zheng-Ning Liu
Ming-Ming Cheng
Shiyong Hu
ViTVLM
469
869
0
20 Feb 2022
A Frustratingly Simple Approach for End-to-End Image Captioning
A Frustratingly Simple Approach for End-to-End Image Captioning
Ziyang Luo
Yadong Xi
Rongsheng Zhang
Jing Ma
VLMMLLM
237
19
0
30 Jan 2022
Synthesizing Tensor Transformations for Visual Self-attention
Synthesizing Tensor Transformations for Visual Self-attention
Xian Wei
Xihao Wang
Hai Lan
Jiaming Lei
Ya-Han Huang
Hui Yu
Jian Yang
ViT
69
0
0
05 Jan 2022
Learning Spatially-Adaptive Squeeze-Excitation Networks for Image
  Synthesis and Image Recognition
Learning Spatially-Adaptive Squeeze-Excitation Networks for Image Synthesis and Image Recognition
Jianghao Shen
Tianfu Wu
ViT
215
0
0
29 Dec 2021
Associative Adversarial Learning Based on Selective Attack
Associative Adversarial Learning Based on Selective Attack
Runqi Wang
Xiaoyue Duan
Baochang Zhang
Shenjun Xue
Wentao Zhu
David Doermann
G. Guo
AAML
260
0
0
28 Dec 2021
DAM-AL: Dilated Attention Mechanism with Attention Loss for 3D Infant
  Brain Image Segmentation
DAM-AL: Dilated Attention Mechanism with Attention Loss for 3D Infant Brain Image SegmentationACM Symposium on Applied Computing (SAC), 2021
Dinh-Hieu Hoang
Gia-Han Diep
Minh-Triet Tran
Ngan T. H Le
152
11
0
27 Dec 2021
Attention-Based Sensor Fusion for Human Activity Recognition Using IMU
  Signals
Attention-Based Sensor Fusion for Human Activity Recognition Using IMU Signals
Wenjin Tao
Haodong Chen
Md Moniruzzaman
M. C. Leu
Zhaozheng Yi
Ruwen Qin
85
15
0
20 Dec 2021
Improving Face-Based Age Estimation with Attention-Based Dynamic Patch
  Fusion
Improving Face-Based Age Estimation with Attention-Based Dynamic Patch FusionIEEE Transactions on Image Processing (TIP), 2021
Haoyi Wang
Victor Sanchez
Chang-Tsun Li
CVBM
122
41
0
19 Dec 2021
Event-guided Deblurring of Unknown Exposure Time Videos
Event-guided Deblurring of Unknown Exposure Time Videos
Taewoo Kim
Jungmin Lee
Lin Wang
Kuk-Jin Yoon
296
43
0
13 Dec 2021
Unsupervised Domain-Specific Deblurring using Scale-Specific Attention
Unsupervised Domain-Specific Deblurring using Scale-Specific Attention
Praveen Kandula
N. Rajagopalan.A.
198
0
0
12 Dec 2021
Couplformer:Rethinking Vision Transformer with Coupling Attention Map
Couplformer:Rethinking Vision Transformer with Coupling Attention Map
Hai Lan
Xihao Wang
Xian Wei
ViT
166
3
0
10 Dec 2021
Rethinking the Two-Stage Framework for Grounded Situation Recognition
Rethinking the Two-Stage Framework for Grounded Situation Recognition
Meng Wei
Long Chen
Wei Ji
Xiaoyu Yue
Tat-Seng Chua
189
37
0
10 Dec 2021
Classification-Then-Grounding: Reformulating Video Scene Graphs as
  Temporal Bipartite Graphs
Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs
Kaifeng Gao
Long Chen
Yulei Niu
Jian Shao
Jun Xiao
223
36
0
08 Dec 2021
ADD: Frequency Attention and Multi-View based Knowledge Distillation to
  Detect Low-Quality Compressed Deepfake Images
ADD: Frequency Attention and Multi-View based Knowledge Distillation to Detect Low-Quality Compressed Deepfake Images
B. Le
Simon S. Woo
AAML
187
105
0
07 Dec 2021
Neural Attention for Image Captioning: Review of Outstanding Methods
Neural Attention for Image Captioning: Review of Outstanding Methods
Zanyar Zohourianshahzadi
Jugal Kalita
VLM
188
57
0
29 Nov 2021
TDAM: Top-Down Attention Module for Contextually Guided Feature
  Selection in CNNs
TDAM: Top-Down Attention Module for Contextually Guided Feature Selection in CNNsEuropean Conference on Computer Vision (ECCV), 2021
Shantanu Jaiswal
Basura Fernando
Cheston Tan
ViT
235
18
0
26 Nov 2021
ClipCap: CLIP Prefix for Image Captioning
ClipCap: CLIP Prefix for Image Captioning
Ron Mokady
Amir Hertz
Amit H. Bermano
CLIPVLM
308
789
0
18 Nov 2021
Image-specific Convolutional Kernel Modulation for Single Image
  Super-resolution
Image-specific Convolutional Kernel Modulation for Single Image Super-resolution
Yuanfei Huang
Jie Li
Yanting Hu
Xinbo Gao
Huan Huang
SupR
185
0
0
16 Nov 2021
Attention Mechanisms in Computer Vision: A Survey
Attention Mechanisms in Computer Vision: A SurveyComputational Visual Media (CVM), 2021
Meng-Hao Guo
Tianhan Xu
Jiangjiang Liu
Zheng-Ning Liu
Peng-Tao Jiang
Tai-Jiang Mu
Song-Hai Zhang
Ralph Robert Martin
Ming-Ming Cheng
Shimin Hu
286
2,082
0
15 Nov 2021
Co-segmentation Inspired Attention Module for Video-based Computer
  Vision Tasks
Co-segmentation Inspired Attention Module for Video-based Computer Vision TasksComputer Vision and Image Understanding (CVIU), 2021
Arulkumar Subramaniam
Jayesh Vaidya
Muhammed Ameen
Athira M. Nambiar
Anurag Mittal
356
7
0
14 Nov 2021
Local Multi-Head Channel Self-Attention for Facial Expression
  Recognition
Local Multi-Head Channel Self-Attention for Facial Expression Recognition
Roberto Pecoraro
Valerio Basile
Viviana Bono
Sara Gallo
ViT
300
62
0
14 Nov 2021
Explaining Face Presentation Attack Detection Using Natural Language
Explaining Face Presentation Attack Detection Using Natural Language
H. Mirzaalian
Mohamed E. Hussein
L. Spinoulas
Jonathan May
Wael AbdAlmageed
CVBMFAttAAML
90
5
0
08 Nov 2021
Dense Prediction with Attentive Feature Aggregation
Dense Prediction with Attentive Feature AggregationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021
Yung-Hsu Yang
Thomas E. Huang
Min Sun
Samuel Rota Buló
Peter Kontschieder
Feng Yu
195
8
0
01 Nov 2021
ST-ABN: Visual Explanation Taking into Account Spatio-temporal
  Information for Video Recognition
ST-ABN: Visual Explanation Taking into Account Spatio-temporal Information for Video Recognition
Masahiro Mitsuhara
Tsubasa Hirakawa
Takayoshi Yamashita
H. Fujiyoshi
208
1
0
29 Oct 2021
Recurrence along Depth: Deep Convolutional Neural Networks with
  Recurrent Layer Aggregation
Recurrence along Depth: Deep Convolutional Neural Networks with Recurrent Layer AggregationNeural Information Processing Systems (NeurIPS), 2021
Jingyu Zhao
Yanwen Fang
Guodong Li
135
29
0
22 Oct 2021
Topic Scene Graph Generation by Attention Distillation from Caption
Topic Scene Graph Generation by Attention Distillation from CaptionIEEE International Conference on Computer Vision (ICCV), 2021
Wenbin Wang
R. Wang
X. Chen
DiffM
213
16
0
12 Oct 2021
Recurrent Attention Models with Object-centric Capsule Representation
  for Multi-object Recognition
Recurrent Attention Models with Object-centric Capsule Representation for Multi-object Recognition
Hossein Adeli
Seoyoung Ahn
G. Zelinsky
OCL
225
3
0
11 Oct 2021
Counterfactual Samples Synthesizing and Training for Robust Visual
  Question Answering
Counterfactual Samples Synthesizing and Training for Robust Visual Question Answering
Long Chen
Yuhang Zheng
Yulei Niu
Hanwang Zhang
Jun Xiao
AAMLOOD
279
47
0
03 Oct 2021
Previous
123456789
Next