Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1611.05594
Cited By
v1
v2 (latest)
SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning
17 November 2016
Long Chen
Hanwang Zhang
Jun Xiao
Liqiang Nie
Jian Shao
Wei Liu
Tat-Seng Chua
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning"
50 / 440 papers shown
Learning Personalized Page Content Ranking Using Customer Representation
Xin Shen
Yan Zhao
Sujan Perera
Yujia Liu
Jinyun Yan
Mitchell Goodman
BDL
164
9
0
09 May 2023
From Association to Generation: Text-only Captioning by Unsupervised Cross-modal Mapping
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Junyan Wang
Ming Yan
Yi Zhang
Jitao Sang
CLIP
VLM
294
17
0
26 Apr 2023
Co-attention Propagation Network for Zero-Shot Video Object Segmentation
IEEE Transactions on Image Processing (IEEE TIP), 2023
Gensheng Pei
Yazhou Yao
Fumin Shen
Daniel Huang
Xing-Rui Huang
Hengtao Shen
VOS
278
15
0
08 Apr 2023
SARGAN: Spatial Attention-based Residuals for Facial Expression Manipulation
Arbish Akram
Nazar Khan
GAN
CVBM
209
12
0
30 Mar 2023
Multi-scale Hierarchical Vision Transformer with Cascaded Attention Decoding for Medical Image Segmentation
International Conference on Medical Imaging with Deep Learning (MIDL), 2023
Md Mostafijur Rahman
R. Marculescu
MedIm
ViT
194
85
0
29 Mar 2023
SiamTHN: Siamese Target Highlight Network for Visual Tracking
Jiahao Bao
Kaiqiang Chen
Xian Sun
Liangjin Zhao
Wenhui Diao
Ming Yan
145
22
0
22 Mar 2023
Learning Combinatorial Prompts for Universal Controllable Image Captioning
International Journal of Computer Vision (IJCV), 2023
Zhen Wang
Jun Xiao
Yueting Zhuang
Fei Gao
Jian Shao
Long Chen
198
11
0
11 Mar 2023
Distilled Reverse Attention Network for Open-world Compositional Zero-Shot Learning
IEEE International Conference on Computer Vision (ICCV), 2023
Yun Yvonna Li
Zhe Liu
Saurav Jha
Sally Cripps
Weitong Chen
BDL
CoGe
201
21
0
01 Mar 2023
GRAN: Ghost Residual Attention Network for Single Image Super Resolution
Axi Niu
Pei Wang
Yu Zhu
Jinqiu Sun
Qingsen Yan
Yanning Zhang
SupR
167
9
0
28 Feb 2023
Pixel Difference Convolutional Network for RGB-D Semantic Segmentation
Jun Yang
Lizhi Bai
Yaoru Sun
Chunqi Tian
Maoyu Mao
Guorun Wang
SSeg
232
42
0
23 Feb 2023
Stacked Cross-modal Feature Consolidation Attention Networks for Image Captioning
Mozhgan Pourkeshavarz
Shahabedin Nabavi
Mohsen Moghaddam
M. Shamsfard
189
4
0
08 Feb 2023
Multimodality Representation Learning: A Survey on Evolution, Pretraining and Its Applications
Muhammad Arslan Manzoor
S. Albarri
Ziting Xian
Zaiqiao Meng
Preslav Nakov
Shangsong Liang
AI4TS
330
52
0
01 Feb 2023
From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Chao-Han Huck Yang
Yue Liu
Yu Zhang
Nanxin Chen
Rohit Prabhavalkar
Tara N. Sainath
Trevor Strohman
178
32
0
19 Jan 2023
Collaborative Perception in Autonomous Driving: Methods, Datasets and Challenges
IEEE Intelligent Transportation Systems Magazine (ITS), 2023
Yushan Han
Hui Zhang
Huifang Li
Yi Jin
Congyan Lang
Yidong Li
314
191
0
16 Jan 2023
Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning
IEEE International Conference on Computer Vision (ICCV), 2022
Woohyun Kang
Jonghwan Mun
Sungjun Lee
Byungseok Roh
VLM
245
27
0
27 Dec 2022
CAT: Learning to Collaborate Channel and Spatial Attention from Multi-Information Fusion
IET Computer Vision (ICV), 2022
Zizhang Wu
Man Wang
Weiwei Sun
Yuchen Li
Tianhao Xu
Fan Wang
Keke Huang
198
5
0
13 Dec 2022
Semiconductor Defect Pattern Classification by Self-Proliferation-and-Attention Neural Network
IEEE transactions on semiconductor manufacturing (IEEE TSM), 2022
Yuanfu Yang
Min Sun
162
6
0
01 Dec 2022
ExpNet: A unified network for Expert-Level Classification
Junde Wu
Huihui Fang
Yehui Yang
Yu Zhang
Haoyi Xiong
Huazhu Fu
Yanwu Xu
228
1
0
29 Nov 2022
Conditioning Covert Geo-Location (CGL) Detection on Semantic Class Information
Pattern Recognition and Machine Intelligence (PRMI), 2022
Binoy Saha
Sukhendu Das
179
0
0
27 Nov 2022
A Novel Center-based Deep Contrastive Metric Learning Method for the Detection of Polymicrogyria in Pediatric Brain MRI
Lingfeng Zhang
N. Abdeen
Jochen Lang
129
3
0
22 Nov 2022
AdaTriplet-RA: Domain Matching via Adaptive Triplet and Reinforced Attention for Unsupervised Domain Adaptation
Signal processing. Image communication (SPIC), 2022
Xinyao Shu
Shiyang Yan
Zhenyu Lu
Xinshao Wang
Yuan Xie
213
2
0
16 Nov 2022
PKCAM: Previous Knowledge Channel Attention Module
Eslam Mohamed Bakr
Ahmad El-Sallab
M. Rashwan
116
1
0
14 Nov 2022
OSIC: A New One-Stage Image Captioner Coined
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Bo Wang
Zhao Zhang
Ming Zhao
Xiaojie Jin
Mingliang Xu
Meng Wang
VLM
218
6
0
04 Nov 2022
Text-Only Training for Image Captioning using Noise-Injected CLIP
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
David Nukrai
Ron Mokady
Amir Globerson
VLM
CLIP
299
124
0
01 Nov 2022
Handwashing Action Detection System for an Autonomous Social Robot
IEEE Region 10 Conference (TENCON), 2022
Sreejith Sasidharan
P. Prabha
Devasena Pasupuleti
Anand M. Das
Chaitanya Kapoor
Gayathri Manikutty
Praveen Pankajakshan
Bhavani R. Rao
69
3
0
27 Oct 2022
Towards Improving Workers' Safety and Progress Monitoring of Construction Sites Through Construction Site Understanding
Mahdi Bonyani
Maryam Soleymani
132
0
0
27 Oct 2022
Prophet Attention: Predicting Attention with Future Attention for Image Captioning
Neural Information Processing Systems (NeurIPS), 2022
Fenglin Liu
Xuancheng Ren
Xian Wu
Wei Fan
Yuexian Zou
Xu Sun
231
50
0
19 Oct 2022
Hierarchical and Progressive Image Matting
Yu Qiao
Yuhao Liu
Ziqi Wei
Yuxin Wang
Qiang Cai
Guofeng Zhang
Xingbin Yang
160
14
0
13 Oct 2022
DCANet: Differential Convolution Attention Network for RGB-D Semantic Segmentation
Pattern Recognition (Pattern Recogn.), 2022
Lizhi Bai
Jun Yang
Chunqi Tian
Yaoru Sun
Maoyu Mao
Yanjun Xu
Weirong Xu
187
27
0
13 Oct 2022
CIR-Net: Cross-modality Interaction and Refinement for RGB-D Salient Object Detection
IEEE Transactions on Image Processing (IEEE TIP), 2022
Runmin Cong
Qin Lin
Chen Zhang
Chongyi Li
Xiaochun Cao
Qingming Huang
Yao-Min Zhao
ObjD
186
167
0
06 Oct 2022
Vision+X: A Survey on Multimodal Learning in the Light of Data
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Ye Zhu
Yuehua Wu
Andrii Zadaianchuk
Yan Yan
354
38
0
05 Oct 2022
Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning
International Journal of Computer Vision (IJCV), 2022
Xu Yang
Hanwang Zhang
Chongyang Gao
Jianfei Cai
MLLM
273
10
0
04 Oct 2022
SEMICON: A Learning-to-hash Solution for Large-scale Fine-grained Image Retrieval
European Conference on Computer Vision (ECCV), 2022
Yang Shen
Xuhao Sun
Xiu-Shen Wei
Qing-Yuan Jiang
Jian Yang
267
27
0
28 Sep 2022
A Spatial-channel-temporal-fused Attention for Spiking Neural Networks
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Wuque Cai
Hongze Sun
Rui Liu
Yan Cui
Jun Wang
Yang Xia
D. Yao
Daqing Guo
292
24
0
22 Sep 2022
Scale Attention for Learning Deep Face Representation: A Study Against Visual Scale Variation
Hailin Shi
Hang Du
Yibo Hu
Jun Wang
Dan Zeng
Ting Yao
CVBM
202
0
0
19 Sep 2022
SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation
Neural Information Processing Systems (NeurIPS), 2022
Meng-Hao Guo
Chenggang Lu
Qibin Hou
Zheng Liu
Ming-Ming Cheng
Shiyong Hu
SSeg
ViT
VLM
321
989
0
18 Sep 2022
MIPI 2022 Challenge on Under-Display Camera Image Restoration: Methods and Results
Ruicheng Feng
Chongyi Li
Shangchen Zhou
W. Sun
Qingpeng Zhu
Jun Jiang
Qingyu Yang
Chen Change Loy
Liang Feng
217
23
0
15 Sep 2022
Booster-SHOT: Boosting Stacked Homography Transformations for Multiview Pedestrian Detection with Attention
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Jinwoo Hwang
Philipp Benz
Tae-Hoon Kim
ViT
133
7
0
19 Aug 2022
GSRFormer: Grounded Situation Recognition Transformer with Alternate Semantic Attention Refinement
ACM Multimedia (ACM MM), 2022
Zhi-Qi Cheng
Qianwen Dai
Siyao Li
Teruko Mitamura
Alexander G. Hauptmann
277
47
0
18 Aug 2022
Aesthetic Attributes Assessment of Images with AMANv2 and DPC-CaptionsV2
Xinghui Zhou
Xin Jin
Jianwen Lv
Heng Huang
Ming Mao
Shuai Cui
CoGe
115
0
0
09 Aug 2022
Integrating Object-aware and Interaction-aware Knowledge for Weakly Supervised Scene Graph Generation
ACM Multimedia (ACM MM), 2022
Xingchen Li
Long Chen
Wenbo Ma
Yi Yang
Jun Xiao
196
30
0
03 Aug 2022
NICEST: Noisy Label Correction and Training for Robust Scene Graph Generation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Lin Li
Jun Xiao
Hanrong Shi
Hanwang Zhang
Yi Yang
Wen Liu
Long Chen
205
30
0
27 Jul 2022
Rethinking the Reference-based Distinctive Image Captioning
ACM Multimedia (ACM MM), 2022
Yangjun Mao
Long Chen
Zhihong Jiang
Dong Zhang
Zhimeng Zhang
Jian Shao
Jun Xiao
DiffM
225
23
0
22 Jul 2022
Correspondence Matters for Video Referring Expression Comprehension
ACM Multimedia (ACM MM), 2022
Meng Cao
Ji Jiang
Long Chen
Yuexian Zou
VOS
308
21
0
21 Jul 2022
Explicit Image Caption Editing
European Conference on Computer Vision (ECCV), 2022
Zhen Wang
Long Chen
Wenbo Ma
G. Han
Yulei Niu
Jian Shao
Jun Xiao
179
14
0
20 Jul 2022
Dynamic Prototype Mask for Occluded Person Re-Identification
ACM Multimedia (ACM MM), 2022
Lei Tan
Pingyang Dai
Rongrong Ji
Yongjian Wu
151
87
0
19 Jul 2022
Rethinking Data Augmentation for Robust Visual Question Answering
European Conference on Computer Vision (ECCV), 2022
Long Chen
Yuhang Zheng
Jun Xiao
OOD
197
51
0
18 Jul 2022
Continuous Facial Motion Deblurring
IEEE Access (IEEE Access), 2022
Tae Bok Lee
Sujy Han
Y. S. Heo
231
1
0
14 Jul 2022
Are metrics measuring what they should? An evaluation of image captioning task metrics
Signal processing. Image communication (SPIC), 2022
Othón González-Chávez
Guillermo Ruiz
Daniela Moctezuma
Tania A. Ramirez-delreal
219
9
0
04 Jul 2022
Trichomonas Vaginalis Segmentation in Microscope Images
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022
Lin Li
Jingyi Liu
Shuo Wang
Xun Wang
Tian-Zhu Xiang
141
10
0
03 Jul 2022
Previous
1
2
3
4
5
6
7
8
9
Next