ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.05594
  4. Cited By
SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks
  for Image Captioning
v1v2 (latest)

SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning

17 November 2016
Long Chen
Hanwang Zhang
Jun Xiao
Liqiang Nie
Jian Shao
Wei Liu
Tat-Seng Chua
ArXiv (abs)PDFHTML

Papers citing "SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning"

50 / 440 papers shown
Learning Personalized Page Content Ranking Using Customer Representation
Learning Personalized Page Content Ranking Using Customer Representation
Xin Shen
Yan Zhao
Sujan Perera
Yujia Liu
Jinyun Yan
Mitchell Goodman
BDL
164
9
0
09 May 2023
From Association to Generation: Text-only Captioning by Unsupervised
  Cross-modal Mapping
From Association to Generation: Text-only Captioning by Unsupervised Cross-modal MappingInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Junyan Wang
Ming Yan
Yi Zhang
Jitao Sang
CLIPVLM
294
17
0
26 Apr 2023
Co-attention Propagation Network for Zero-Shot Video Object Segmentation
Co-attention Propagation Network for Zero-Shot Video Object SegmentationIEEE Transactions on Image Processing (IEEE TIP), 2023
Gensheng Pei
Yazhou Yao
Fumin Shen
Daniel Huang
Xing-Rui Huang
Hengtao Shen
VOS
278
15
0
08 Apr 2023
SARGAN: Spatial Attention-based Residuals for Facial Expression
  Manipulation
SARGAN: Spatial Attention-based Residuals for Facial Expression Manipulation
Arbish Akram
Nazar Khan
GANCVBM
209
12
0
30 Mar 2023
Multi-scale Hierarchical Vision Transformer with Cascaded Attention
  Decoding for Medical Image Segmentation
Multi-scale Hierarchical Vision Transformer with Cascaded Attention Decoding for Medical Image SegmentationInternational Conference on Medical Imaging with Deep Learning (MIDL), 2023
Md Mostafijur Rahman
R. Marculescu
MedImViT
194
85
0
29 Mar 2023
SiamTHN: Siamese Target Highlight Network for Visual Tracking
SiamTHN: Siamese Target Highlight Network for Visual Tracking
Jiahao Bao
Kaiqiang Chen
Xian Sun
Liangjin Zhao
Wenhui Diao
Ming Yan
145
22
0
22 Mar 2023
Learning Combinatorial Prompts for Universal Controllable Image
  Captioning
Learning Combinatorial Prompts for Universal Controllable Image CaptioningInternational Journal of Computer Vision (IJCV), 2023
Zhen Wang
Jun Xiao
Yueting Zhuang
Fei Gao
Jian Shao
Long Chen
198
11
0
11 Mar 2023
Distilled Reverse Attention Network for Open-world Compositional
  Zero-Shot Learning
Distilled Reverse Attention Network for Open-world Compositional Zero-Shot LearningIEEE International Conference on Computer Vision (ICCV), 2023
Yun Yvonna Li
Zhe Liu
Saurav Jha
Sally Cripps
Weitong Chen
BDLCoGe
201
21
0
01 Mar 2023
GRAN: Ghost Residual Attention Network for Single Image Super Resolution
GRAN: Ghost Residual Attention Network for Single Image Super Resolution
Axi Niu
Pei Wang
Yu Zhu
Jinqiu Sun
Qingsen Yan
Yanning Zhang
SupR
167
9
0
28 Feb 2023
Pixel Difference Convolutional Network for RGB-D Semantic Segmentation
Pixel Difference Convolutional Network for RGB-D Semantic Segmentation
Jun Yang
Lizhi Bai
Yaoru Sun
Chunqi Tian
Maoyu Mao
Guorun Wang
SSeg
232
42
0
23 Feb 2023
Stacked Cross-modal Feature Consolidation Attention Networks for Image
  Captioning
Stacked Cross-modal Feature Consolidation Attention Networks for Image Captioning
Mozhgan Pourkeshavarz
Shahabedin Nabavi
Mohsen Moghaddam
M. Shamsfard
189
4
0
08 Feb 2023
Multimodality Representation Learning: A Survey on Evolution,
  Pretraining and Its Applications
Multimodality Representation Learning: A Survey on Evolution, Pretraining and Its Applications
Muhammad Arslan Manzoor
S. Albarri
Ziting Xian
Zaiqiao Meng
Preslav Nakov
Shangsong Liang
AI4TS
330
52
0
01 Feb 2023
From English to More Languages: Parameter-Efficient Model Reprogramming
  for Cross-Lingual Speech Recognition
From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Chao-Han Huck Yang
Yue Liu
Yu Zhang
Nanxin Chen
Rohit Prabhavalkar
Tara N. Sainath
Trevor Strohman
178
32
0
19 Jan 2023
Collaborative Perception in Autonomous Driving: Methods, Datasets and
  Challenges
Collaborative Perception in Autonomous Driving: Methods, Datasets and ChallengesIEEE Intelligent Transportation Systems Magazine (ITS), 2023
Yushan Han
Hui Zhang
Huifang Li
Yi Jin
Congyan Lang
Yidong Li
314
191
0
16 Jan 2023
Noise-aware Learning from Web-crawled Image-Text Data for Image
  Captioning
Noise-aware Learning from Web-crawled Image-Text Data for Image CaptioningIEEE International Conference on Computer Vision (ICCV), 2022
Woohyun Kang
Jonghwan Mun
Sungjun Lee
Byungseok Roh
VLM
245
27
0
27 Dec 2022
CAT: Learning to Collaborate Channel and Spatial Attention from
  Multi-Information Fusion
CAT: Learning to Collaborate Channel and Spatial Attention from Multi-Information FusionIET Computer Vision (ICV), 2022
Zizhang Wu
Man Wang
Weiwei Sun
Yuchen Li
Tianhao Xu
Fan Wang
Keke Huang
198
5
0
13 Dec 2022
Semiconductor Defect Pattern Classification by
  Self-Proliferation-and-Attention Neural Network
Semiconductor Defect Pattern Classification by Self-Proliferation-and-Attention Neural NetworkIEEE transactions on semiconductor manufacturing (IEEE TSM), 2022
Yuanfu Yang
Min Sun
162
6
0
01 Dec 2022
ExpNet: A unified network for Expert-Level Classification
ExpNet: A unified network for Expert-Level Classification
Junde Wu
Huihui Fang
Yehui Yang
Yu Zhang
Haoyi Xiong
Huazhu Fu
Yanwu Xu
228
1
0
29 Nov 2022
Conditioning Covert Geo-Location (CGL) Detection on Semantic Class
  Information
Conditioning Covert Geo-Location (CGL) Detection on Semantic Class InformationPattern Recognition and Machine Intelligence (PRMI), 2022
Binoy Saha
Sukhendu Das
179
0
0
27 Nov 2022
A Novel Center-based Deep Contrastive Metric Learning Method for the
  Detection of Polymicrogyria in Pediatric Brain MRI
A Novel Center-based Deep Contrastive Metric Learning Method for the Detection of Polymicrogyria in Pediatric Brain MRI
Lingfeng Zhang
N. Abdeen
Jochen Lang
129
3
0
22 Nov 2022
AdaTriplet-RA: Domain Matching via Adaptive Triplet and Reinforced
  Attention for Unsupervised Domain Adaptation
AdaTriplet-RA: Domain Matching via Adaptive Triplet and Reinforced Attention for Unsupervised Domain AdaptationSignal processing. Image communication (SPIC), 2022
Xinyao Shu
Shiyang Yan
Zhenyu Lu
Xinshao Wang
Yuan Xie
213
2
0
16 Nov 2022
PKCAM: Previous Knowledge Channel Attention Module
PKCAM: Previous Knowledge Channel Attention Module
Eslam Mohamed Bakr
Ahmad El-Sallab
M. Rashwan
116
1
0
14 Nov 2022
OSIC: A New One-Stage Image Captioner Coined
OSIC: A New One-Stage Image Captioner CoinedInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Bo Wang
Zhao Zhang
Ming Zhao
Xiaojie Jin
Mingliang Xu
Meng Wang
VLM
218
6
0
04 Nov 2022
Text-Only Training for Image Captioning using Noise-Injected CLIP
Text-Only Training for Image Captioning using Noise-Injected CLIPConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
David Nukrai
Ron Mokady
Amir Globerson
VLMCLIP
299
124
0
01 Nov 2022
Handwashing Action Detection System for an Autonomous Social Robot
Handwashing Action Detection System for an Autonomous Social RobotIEEE Region 10 Conference (TENCON), 2022
Sreejith Sasidharan
P. Prabha
Devasena Pasupuleti
Anand M. Das
Chaitanya Kapoor
Gayathri Manikutty
Praveen Pankajakshan
Bhavani R. Rao
69
3
0
27 Oct 2022
Towards Improving Workers' Safety and Progress Monitoring of
  Construction Sites Through Construction Site Understanding
Towards Improving Workers' Safety and Progress Monitoring of Construction Sites Through Construction Site Understanding
Mahdi Bonyani
Maryam Soleymani
132
0
0
27 Oct 2022
Prophet Attention: Predicting Attention with Future Attention for Image
  Captioning
Prophet Attention: Predicting Attention with Future Attention for Image CaptioningNeural Information Processing Systems (NeurIPS), 2022
Fenglin Liu
Xuancheng Ren
Xian Wu
Wei Fan
Yuexian Zou
Xu Sun
231
50
0
19 Oct 2022
Hierarchical and Progressive Image Matting
Hierarchical and Progressive Image Matting
Yu Qiao
Yuhao Liu
Ziqi Wei
Yuxin Wang
Qiang Cai
Guofeng Zhang
Xingbin Yang
160
14
0
13 Oct 2022
DCANet: Differential Convolution Attention Network for RGB-D Semantic
  Segmentation
DCANet: Differential Convolution Attention Network for RGB-D Semantic SegmentationPattern Recognition (Pattern Recogn.), 2022
Lizhi Bai
Jun Yang
Chunqi Tian
Yaoru Sun
Maoyu Mao
Yanjun Xu
Weirong Xu
187
27
0
13 Oct 2022
CIR-Net: Cross-modality Interaction and Refinement for RGB-D Salient
  Object Detection
CIR-Net: Cross-modality Interaction and Refinement for RGB-D Salient Object DetectionIEEE Transactions on Image Processing (IEEE TIP), 2022
Runmin Cong
Qin Lin
Chen Zhang
Chongyi Li
Xiaochun Cao
Qingming Huang
Yao-Min Zhao
ObjD
186
167
0
06 Oct 2022
Vision+X: A Survey on Multimodal Learning in the Light of Data
Vision+X: A Survey on Multimodal Learning in the Light of DataIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Ye Zhu
Yuehua Wu
Andrii Zadaianchuk
Yan Yan
354
38
0
05 Oct 2022
Learning to Collocate Visual-Linguistic Neural Modules for Image
  Captioning
Learning to Collocate Visual-Linguistic Neural Modules for Image CaptioningInternational Journal of Computer Vision (IJCV), 2022
Xu Yang
Hanwang Zhang
Chongyang Gao
Jianfei Cai
MLLM
273
10
0
04 Oct 2022
SEMICON: A Learning-to-hash Solution for Large-scale Fine-grained Image
  Retrieval
SEMICON: A Learning-to-hash Solution for Large-scale Fine-grained Image RetrievalEuropean Conference on Computer Vision (ECCV), 2022
Yang Shen
Xuhao Sun
Xiu-Shen Wei
Qing-Yuan Jiang
Jian Yang
267
27
0
28 Sep 2022
A Spatial-channel-temporal-fused Attention for Spiking Neural Networks
A Spatial-channel-temporal-fused Attention for Spiking Neural NetworksIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Wuque Cai
Hongze Sun
Rui Liu
Yan Cui
Jun Wang
Yang Xia
D. Yao
Daqing Guo
292
24
0
22 Sep 2022
Scale Attention for Learning Deep Face Representation: A Study Against
  Visual Scale Variation
Scale Attention for Learning Deep Face Representation: A Study Against Visual Scale Variation
Hailin Shi
Hang Du
Yibo Hu
Jun Wang
Dan Zeng
Ting Yao
CVBM
202
0
0
19 Sep 2022
SegNeXt: Rethinking Convolutional Attention Design for Semantic
  Segmentation
SegNeXt: Rethinking Convolutional Attention Design for Semantic SegmentationNeural Information Processing Systems (NeurIPS), 2022
Meng-Hao Guo
Chenggang Lu
Qibin Hou
Zheng Liu
Ming-Ming Cheng
Shiyong Hu
SSegViTVLM
321
989
0
18 Sep 2022
MIPI 2022 Challenge on Under-Display Camera Image Restoration: Methods
  and Results
MIPI 2022 Challenge on Under-Display Camera Image Restoration: Methods and Results
Ruicheng Feng
Chongyi Li
Shangchen Zhou
W. Sun
Qingpeng Zhu
Jun Jiang
Qingyu Yang
Chen Change Loy
Liang Feng
217
23
0
15 Sep 2022
Booster-SHOT: Boosting Stacked Homography Transformations for Multiview
  Pedestrian Detection with Attention
Booster-SHOT: Boosting Stacked Homography Transformations for Multiview Pedestrian Detection with AttentionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Jinwoo Hwang
Philipp Benz
Tae-Hoon Kim
ViT
133
7
0
19 Aug 2022
GSRFormer: Grounded Situation Recognition Transformer with Alternate
  Semantic Attention Refinement
GSRFormer: Grounded Situation Recognition Transformer with Alternate Semantic Attention RefinementACM Multimedia (ACM MM), 2022
Zhi-Qi Cheng
Qianwen Dai
Siyao Li
Teruko Mitamura
Alexander G. Hauptmann
277
47
0
18 Aug 2022
Aesthetic Attributes Assessment of Images with AMANv2 and DPC-CaptionsV2
Aesthetic Attributes Assessment of Images with AMANv2 and DPC-CaptionsV2
Xinghui Zhou
Xin Jin
Jianwen Lv
Heng Huang
Ming Mao
Shuai Cui
CoGe
115
0
0
09 Aug 2022
Integrating Object-aware and Interaction-aware Knowledge for Weakly
  Supervised Scene Graph Generation
Integrating Object-aware and Interaction-aware Knowledge for Weakly Supervised Scene Graph GenerationACM Multimedia (ACM MM), 2022
Xingchen Li
Long Chen
Wenbo Ma
Yi Yang
Jun Xiao
196
30
0
03 Aug 2022
NICEST: Noisy Label Correction and Training for Robust Scene Graph
  Generation
NICEST: Noisy Label Correction and Training for Robust Scene Graph GenerationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Lin Li
Jun Xiao
Hanrong Shi
Hanwang Zhang
Yi Yang
Wen Liu
Long Chen
205
30
0
27 Jul 2022
Rethinking the Reference-based Distinctive Image Captioning
Rethinking the Reference-based Distinctive Image CaptioningACM Multimedia (ACM MM), 2022
Yangjun Mao
Long Chen
Zhihong Jiang
Dong Zhang
Zhimeng Zhang
Jian Shao
Jun Xiao
DiffM
225
23
0
22 Jul 2022
Correspondence Matters for Video Referring Expression Comprehension
Correspondence Matters for Video Referring Expression ComprehensionACM Multimedia (ACM MM), 2022
Meng Cao
Ji Jiang
Long Chen
Yuexian Zou
VOS
308
21
0
21 Jul 2022
Explicit Image Caption Editing
Explicit Image Caption EditingEuropean Conference on Computer Vision (ECCV), 2022
Zhen Wang
Long Chen
Wenbo Ma
G. Han
Yulei Niu
Jian Shao
Jun Xiao
179
14
0
20 Jul 2022
Dynamic Prototype Mask for Occluded Person Re-Identification
Dynamic Prototype Mask for Occluded Person Re-IdentificationACM Multimedia (ACM MM), 2022
Lei Tan
Pingyang Dai
Rongrong Ji
Yongjian Wu
151
87
0
19 Jul 2022
Rethinking Data Augmentation for Robust Visual Question Answering
Rethinking Data Augmentation for Robust Visual Question AnsweringEuropean Conference on Computer Vision (ECCV), 2022
Long Chen
Yuhang Zheng
Jun Xiao
OOD
197
51
0
18 Jul 2022
Continuous Facial Motion Deblurring
Continuous Facial Motion DeblurringIEEE Access (IEEE Access), 2022
Tae Bok Lee
Sujy Han
Y. S. Heo
231
1
0
14 Jul 2022
Are metrics measuring what they should? An evaluation of image
  captioning task metrics
Are metrics measuring what they should? An evaluation of image captioning task metricsSignal processing. Image communication (SPIC), 2022
Othón González-Chávez
Guillermo Ruiz
Daniela Moctezuma
Tania A. Ramirez-delreal
219
9
0
04 Jul 2022
Trichomonas Vaginalis Segmentation in Microscope Images
Trichomonas Vaginalis Segmentation in Microscope ImagesInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022
Lin Li
Jingyi Liu
Shuo Wang
Xun Wang
Tian-Zhu Xiang
141
10
0
03 Jul 2022
Previous
123456789
Next