Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.14556
Cited By
On the Integration of Self-Attention and Convolution
29 November 2021
Xuran Pan
Chunjiang Ge
Rui Lu
S. Song
Guanfu Chen
Zeyi Huang
Gao Huang
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"On the Integration of Self-Attention and Convolution"
50 / 62 papers shown
Title
Contrastive Learning for Continuous Touch-Based Authentication
Mengyu Qiao
Yunpeng Zhai
Yang Wang
AAML
32
0
0
24 Apr 2025
YOLO-RS: Remote Sensing Enhanced Crop Detection Methods
Linlin Xiao
Zhang Tiancong
Yutong Jia
Xinyu Nie
Mengyao Wang
Xiaohang Shao
25
0
0
15 Apr 2025
HGFormer: Topology-Aware Vision Transformer with HyperGraph Learning
Hao Wang
Shuo Zhang
Biao Leng
ViT
62
0
0
03 Apr 2025
SCHNet: SAM Marries CLIP for Human Parsing
Kunliang Liu
Jianming Wang
Rize Jin
Wonjun Hwang
Tae-Sun Chung
VLM
66
0
0
28 Mar 2025
Improved YOLOv7x-Based Defect Detection Algorithm for Power Equipment
Jin Hou
Hao Tang
54
0
0
25 Feb 2025
SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation
Yunxiang Fu
Meng Lou
Yizhou Yu
112
1
0
16 Dec 2024
HorGait: A Hybrid Model for Accurate Gait Recognition in LiDAR Point Cloud Planar Projections
Jiaxing Hao
Yanxi Wang
Zhigang Chang
Hongmin Gao
Zihao Cheng
Chen Wu
Xin Zhao
Peiye Fang
Rachmat Muwardi
ViT
21
0
0
11 Oct 2024
Studying the Effects of Self-Attention on SAR Automatic Target Recognition
Jacob Fein-Ashley
R. Kannan
Viktor Prasanna
23
0
0
31 Aug 2024
LoG-VMamba: Local-Global Vision Mamba for Medical Image Segmentation
Trung Dang
Huy Hoang Nguyen
A. Tiulpin
Mamba
27
3
0
26 Aug 2024
CROCODILE: Causality aids RObustness via COntrastive DIsentangled LEarning
Gianluca Carloni
Sotirios A. Tsaftaris
Sara Colantonio
OOD
21
1
0
09 Aug 2024
Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model
Yuheng Shi
Minjing Dong
Chang Xu
Mamba
35
32
0
23 May 2024
HSViT: Horizontally Scalable Vision Transformer
Chenhao Xu
Chang-Tsun Li
Chee Peng Lim
Douglas Creighton
ViT
24
1
0
08 Apr 2024
Structured Initialization for Attention in Vision Transformers
Jianqiao Zheng
Xueqian Li
Simon Lucey
ViT
19
1
0
01 Apr 2024
CAMixerSR: Only Details Need More "Attention"
Yan Wang
Yi Liu
Shijie Zhao
Junlin Li
Li Zhang
SupR
41
17
0
29 Feb 2024
YOLO-Ant: A Lightweight Detector via Depthwise Separable Convolutional and Large Kernel Design for Antenna Interference Source Detection
Xiaoyu Tang
Xingming Chen
Jintao Cheng
Jin Wu
Rui Fan
Chengxi Zhang
Zebo Zhou
21
4
0
20 Feb 2024
Convolutional Initialization for Data-Efficient Vision Transformers
Jianqiao Zheng
Xueqian Li
Simon Lucey
23
2
0
23 Jan 2024
Self-Attention and Hybrid Features for Replay and Deep-Fake Audio Detection
Lian Huang
Chi-Man Pun
11
4
0
11 Jan 2024
Deformable Audio Transformer for Audio Event Detection
Wentao Zhu
25
0
0
24 Dec 2023
GSVA: Generalized Segmentation via Multimodal Large Language Models
Zhuofan Xia
Dongchen Han
Yizeng Han
Xuran Pan
Shiji Song
Gao Huang
VLM
23
54
0
15 Dec 2023
Transferring Modality-Aware Pedestrian Attentive Learning for Visible-Infrared Person Re-identification
Yuwei Guo
Wenhao Zhang
Licheng Jiao
Shuang Wang
Shuo Wang
Fang Liu
25
0
0
12 Dec 2023
Improved Dense Nested Attention Network Based on Transformer for Infrared Small Target Detection
Chun Bao
Jie Cao
Yaqian Ning
Tianhua Zhao
Zhijun Li
Zechen Wang
Li Zhang
Qun Hao
17
4
0
15 Nov 2023
OrthoNets: Orthogonal Channel Attention Networks
Hadi Salman
Caleb Parks
Matthew Swan
John Gauch
18
9
0
06 Nov 2023
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition
Meng Lou
Hong-Yu Zhou
Sibei Yang
Yizhou Yu
Chuan Wu
Yizhou Yu
ViT
31
35
0
30 Oct 2023
Enhancing Representations through Heterogeneous Self-Supervised Learning
Zhongyu Li
Bo-Wen Yin
Yongxiang Liu
Li Liu
Ming-Ming Cheng
SSL
19
2
0
08 Oct 2023
Reinforcement Learning-based Mixture of Vision Transformers for Video Violence Recognition
Hamid Reza Mohammadi
Ehsan Nazerfard
Tahereh Firoozi
ViT
10
2
0
04 Oct 2023
Logical Bias Learning for Object Relation Prediction
Xinyu Zhou
Zihan Ji
Anna Zhu
CVBM
14
0
0
01 Oct 2023
Interpret Vision Transformers as ConvNets with Dynamic Convolutions
Chong Zhou
Chen Change Loy
Bo Dai
ViT
25
1
0
19 Sep 2023
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
Zhuofan Xia
Xuran Pan
Shiji Song
Li Erran Li
Gao Huang
ViT
19
22
0
04 Sep 2023
Class-level Structural Relation Modelling and Smoothing for Visual Representation Learning
Zitan Chen
Zhuang Qi
Xiao Cao
Xiangxian Li
Xiangxu Meng
Lei Meng
13
8
0
08 Aug 2023
FLatten Transformer: Vision Transformer using Focused Linear Attention
Dongchen Han
Xuran Pan
Yizeng Han
Shiji Song
Gao Huang
23
152
0
01 Aug 2023
LGViT: Dynamic Early Exiting for Accelerating Vision Transformer
Guanyu Xu
Jiawei Hao
Li Shen
Han Hu
Yong Luo
Hui Lin
J. Shen
16
15
0
01 Aug 2023
Open-Set Domain Adaptation with Visual-Language Foundation Models
Qing Yu
Go Irie
Kiyoharu Aizawa
VLM
24
6
0
30 Jul 2023
Modularizing while Training: A New Paradigm for Modularizing DNN Models
Binhang Qi
Hailong Sun
Hongyu Zhang
Ruobing Zhao
Xiang Gao
MoMe
19
3
0
15 Jun 2023
Dual Path Transformer with Partition Attention
Zhengkai Jiang
Liang Liu
Jiangning Zhang
Yabiao Wang
Mingang Chen
Chengjie Wang
ViT
31
2
0
24 May 2023
Chest X-ray Image Classification: A Causal Perspective
Weizhi Nie
Chen Zhang
Dan Song
Lina Zhao
Yunru Bai
Keliang Xie
Anan Liu
CML
14
9
0
20 May 2023
MetaMorphosis: Task-oriented Privacy Cognizant Feature Generation for Multi-task Learning
Md. Adnan Arefeen
Zhouyu Li
M. Y. S. Uddin
Anupam Das
17
0
0
13 May 2023
CompletionFormer: Depth Completion with Convolutions and Vision Transformers
Youming Zhang
Xianda Guo
Poggi Matteo
Zheng Zhu
Guan Huang
S. Mattoccia
MDE
ViT
21
95
0
25 Apr 2023
Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention
Xuran Pan
Tianzhu Ye
Zhuofan Xia
S. Song
Gao Huang
ViT
19
53
0
09 Apr 2023
Xformer: Hybrid X-Shaped Transformer for Image Denoising
Jiale Zhang
Yulun Zhang
Jinjin Gu
Jiahua Dong
L. Kong
Xiaokang Yang
ViT
13
28
0
11 Mar 2023
Underwater target detection based on improved YOLOv7
Bing Li
B. Liu
Haiming Liu
Shuofeng Li
Nizhuan Wang
11
72
0
14 Feb 2023
EIT: Enhanced Interactive Transformer
Tong Zheng
Bei Li
Huiwen Bao
Tong Xiao
Jingbo Zhu
13
2
0
20 Dec 2022
Dunhuang murals contour generation network based on convolution and self-attention fusion
Bao-Yu Liu
Fengjie He
Shiqiang Du
Kaiwu Zhang
Jianhua Wang
3DPC
26
6
0
02 Dec 2022
Contrastive Language-Image Pre-Training with Knowledge Graphs
Xuran Pan
Tianzhu Ye
Dongchen Han
S. Song
Gao Huang
VLM
CLIP
16
42
0
17 Oct 2022
UGformer for Robust Left Atrium and Scar Segmentation Across Scanners
Tianyi Liu
Size Hou
Jiayu Zhu
Zilong Zhao
Haochuan Jiang
MedIm
6
2
0
11 Oct 2022
E-Branchformer: Branchformer with Enhanced merging for speech recognition
Kwangyoun Kim
Felix Wu
Yifan Peng
Jing Pan
Prashant Sridhar
Kyu Jeong Han
Shinji Watanabe
47
105
0
30 Sep 2022
Switchable Self-attention Module
Shan Zhong
Wushao Wen
Jinghui Qin
16
7
0
13 Sep 2022
ClusTR: Exploring Efficient Self-attention via Clustering for Vision Transformers
Yutong Xie
Jianpeng Zhang
Yong-quan Xia
A. Hengel
Qi Wu
23
6
0
28 Aug 2022
Adaptive Perception Transformer for Temporal Action Localization
Yizheng Ouyang
Tianjin Zhang
Weibo Gu
Hongfa Wang
21
3
0
25 Aug 2022
EMC2A-Net: An Efficient Multibranch Cross-channel Attention Network for SAR Target Classification
Xiang Yu
Zhe Geng
Xiaohua Huang
Qinglu Wang
Daiyin Zhu
12
5
0
03 Aug 2022
Robust RGB-D Fusion for Saliency Detection
Zongwei Wu
Shriarulmozhivarman Gobichettipalayam
Brahim Tamadazte
Guillaume Allibert
D. Paudel
C. Demonceaux
16
26
0
02 Aug 2022
1
2
Next