ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.13621
  4. Cited By
Exploring Self-attention for Image Recognition

Exploring Self-attention for Image Recognition

28 April 2020
Hengshuang Zhao
Jiaya Jia
V. Koltun
    SSL
ArXivPDFHTML

Papers citing "Exploring Self-attention for Image Recognition"

50 / 316 papers shown
Title
Dunhuang murals contour generation network based on convolution and
  self-attention fusion
Dunhuang murals contour generation network based on convolution and self-attention fusion
Bao-Yu Liu
Fengjie He
Shiqiang Du
Kaiwu Zhang
Jianhua Wang
3DPC
40
6
0
02 Dec 2022
Lightweight Structure-Aware Attention for Visual Understanding
Lightweight Structure-Aware Attention for Visual Understanding
Heeseung Kwon
F. M. Castro
M. Marín-Jiménez
N. Guil
Alahari Karteek
26
2
0
29 Nov 2022
EHSNet: End-to-End Holistic Learning Network for Large-Size Remote
  Sensing Image Semantic Segmentation
EHSNet: End-to-End Holistic Learning Network for Large-Size Remote Sensing Image Semantic Segmentation
Wei-Neng Chen
Yansheng Li
Bo Dang
Yongjun Zhang
22
3
0
21 Nov 2022
PKCAM: Previous Knowledge Channel Attention Module
PKCAM: Previous Knowledge Channel Attention Module
Eslam Mohamed Bakr
Ahmad El-Sallab
M. Rashwan
18
1
0
14 Nov 2022
Dual Complementary Dynamic Convolution for Image Recognition
Dual Complementary Dynamic Convolution for Image Recognition
Longbin Yan
Yunxiao Qin
Shumin Liu
Jie Chen
10
0
0
11 Nov 2022
Computer Vision on X-ray Data in Industrial Production and Security
  Applications: A Comprehensive Survey
Computer Vision on X-ray Data in Industrial Production and Security Applications: A Comprehensive Survey
M. Rafiei
Jenni Raitoharju
Alexandros Iosifidis
19
22
0
10 Nov 2022
EEG-Fest: Few-shot based Attention Network for Driver's Vigilance
  Estimation with EEG Signals
EEG-Fest: Few-shot based Attention Network for Driver's Vigilance Estimation with EEG Signals
Ning Ding
Ce Zhang
A. Eskandarian
42
4
0
07 Nov 2022
A Deep Learning Approach to Generating Photospheric Vector Magnetograms
  of Solar Active Regions for SOHO/MDI Using SDO/HMI and BBSO Data
A Deep Learning Approach to Generating Photospheric Vector Magnetograms of Solar Active Regions for SOHO/MDI Using SDO/HMI and BBSO Data
Haodi Jiang
Qin Li
Zhihang Hu
Nian Liu
Yasser Abduallah
...
Genwei Zhang
Yan Xu
Wynne Hsu
J. T. Wang
Haimin Wang
32
6
0
04 Nov 2022
Studying inductive biases in image classification task
Studying inductive biases in image classification task
N. Arizumi
21
1
0
31 Oct 2022
Automatic Diagnosis of Myocarditis Disease in Cardiac MRI Modality using
  Deep Transformers and Explainable Artificial Intelligence
Automatic Diagnosis of Myocarditis Disease in Cardiac MRI Modality using Deep Transformers and Explainable Artificial Intelligence
M. Jafari
A. Shoeibi
Navid Ghassemi
Jónathan Heras
Saiguang Ling
...
Shuihua Wang
R. Alizadehsani
Juan M Gorriz
U. Acharya
Hamid Alinejad-Rokny
MedIm
20
11
0
26 Oct 2022
LCPFormer: Towards Effective 3D Point Cloud Analysis via Local Context
  Propagation in Transformers
LCPFormer: Towards Effective 3D Point Cloud Analysis via Local Context Propagation in Transformers
Zhuo Huang
Zhiyou Zhao
Banghuai Li
Jungong Han
3DPC
ViT
27
55
0
23 Oct 2022
DCANet: Differential Convolution Attention Network for RGB-D Semantic
  Segmentation
DCANet: Differential Convolution Attention Network for RGB-D Semantic Segmentation
Lizhi Bai
Jun Yang
Chunqi Tian
Yaoru Sun
Maoyu Mao
Yanjun Xu
Weirong Xu
8
9
0
13 Oct 2022
Point Transformer V2: Grouped Vector Attention and Partition-based
  Pooling
Point Transformer V2: Grouped Vector Attention and Partition-based Pooling
Xiaoyang Wu
Yixing Lao
Li Jiang
Xihui Liu
Hengshuang Zhao
3DPC
ViT
21
367
0
11 Oct 2022
Neural Shape Deformation Priors
Neural Shape Deformation Priors
Jiapeng Tang
Lev Markhasin
Bi Wang
Justus Thies
Matthias Nießner
49
27
0
11 Oct 2022
FBNet: Feedback Network for Point Cloud Completion
FBNet: Feedback Network for Point Cloud Completion
Xuejun Yan
Hongyu Yan
Jingjing Wang
Hang Du
Zhihong Wu
Di Xie
Shiliang Pu
Li Lu
3DPC
20
28
0
08 Oct 2022
Accurate Image Restoration with Attention Retractable Transformer
Accurate Image Restoration with Attention Retractable Transformer
Jiale Zhang
Yulun Zhang
Jinjin Gu
Yongbing Zhang
L. Kong
X. Yuan
ViT
28
96
0
04 Oct 2022
Strong Instance Segmentation Pipeline for MMSports Challenge
Strong Instance Segmentation Pipeline for MMSports Challenge
Bo Yan
Fengliang Qi
Zhuang Li
Yadong Li
Hongbin Wang
22
2
0
28 Sep 2022
Dense-TNT: Efficient Vehicle Type Classification Neural Network Using
  Satellite Imagery
Dense-TNT: Efficient Vehicle Type Classification Neural Network Using Satellite Imagery
Ruikang Luo
Yaofeng Song
H. Zhao
Yicheng Zhang
Yi Zhang
Nanbin Zhao
Liping Huang
Rong Su
ViT
16
11
0
27 Sep 2022
Axially Expanded Windows for Local-Global Interaction in Vision
  Transformers
Axially Expanded Windows for Local-Global Interaction in Vision Transformers
Zhemin Zhang
Xun Gong
ViT
13
1
0
19 Sep 2022
Real-time 3D Single Object Tracking with Transformer
Real-time 3D Single Object Tracking with Transformer
Jiayao Shan
Sifan Zhou
Yubo Cui
Zheng Fang
ViT
20
50
0
02 Sep 2022
Swin-transformer-yolov5 For Real-time Wine Grape Bunch Detection
Swin-transformer-yolov5 For Real-time Wine Grape Bunch Detection
Shenglian Lu
Xiaoyu Liu
Zixaun He
Wenbo Liu
Xin Zhang
Manoj Karkee
10
38
0
30 Aug 2022
Conviformers: Convolutionally guided Vision Transformer
Conviformers: Convolutionally guided Vision Transformer
Mohit Vaishnav
Thomas Fel
I. F. Rodriguez
Thomas Serre
ViT
30
1
0
17 Aug 2022
A Vision Transformer-Based Approach to Bearing Fault Classification via
  Vibration Signals
A Vision Transformer-Based Approach to Bearing Fault Classification via Vibration Signals
Abid Hasan Zim
Aeyan Ashraf
Aquib Iqbal
Asad U. Malik
Minoru Kuribayashi
10
10
0
15 Aug 2022
Exploring Point-BEV Fusion for 3D Point Cloud Object Tracking with
  Transformer
Exploring Point-BEV Fusion for 3D Point Cloud Object Tracking with Transformer
Zhi-Chun Luo
Changqing Zhou
Liang Pan
Gongjie Zhang
Ti Liu
Yueru Luo
Haiyu Zhao
Ziwei Liu
Shijian Lu
3DPC
13
14
0
10 Aug 2022
Global Hierarchical Attention for 3D Point Cloud Analysis
Global Hierarchical Attention for 3D Point Cloud Analysis
Dan Jia
Alexander Hermans
Bastian Leibe
3DPC
21
0
0
07 Aug 2022
Jointformer: Single-Frame Lifting Transformer with Error Prediction and
  Refinement for 3D Human Pose Estimation
Jointformer: Single-Frame Lifting Transformer with Error Prediction and Refinement for 3D Human Pose Estimation
Sebastian Lutz
R. Blythman
Koustav Ghosal
Matthew Moynihan
C. Simms
A. Smolic
ViT
21
15
0
07 Aug 2022
PointConvFormer: Revenge of the Point-based Convolution
PointConvFormer: Revenge of the Point-based Convolution
Wenxuan Wu
Li Fuxin
Qi Shan
3DPC
23
30
0
04 Aug 2022
Action Quality Assessment using Transformers
Action Quality Assessment using Transformers
Abhay Iyer
Mohammad Alali
Hemanth Bodala
Sunit Vaidya
ViT
19
0
0
20 Jul 2022
Learning Sequence Representations by Non-local Recurrent Neural Memory
Learning Sequence Representations by Non-local Recurrent Neural Memory
Wenjie Pei
Xin Feng
Canmiao Fu
Qi Cao
Guangming Lu
Yu-Wing Tai
AI4TS
19
1
0
20 Jul 2022
Vision Transformers: From Semantic Segmentation to Dense Prediction
Vision Transformers: From Semantic Segmentation to Dense Prediction
Li Zhang
Jiachen Lu
Sixiao Zheng
Xinxuan Zhao
Xiatian Zhu
Yanwei Fu
Tao Xiang
Jianfeng Feng
Philip H. S. Torr
ViT
24
7
0
19 Jul 2022
Vision Transformer for NeRF-Based View Synthesis from a Single Input
  Image
Vision Transformer for NeRF-Based View Synthesis from a Single Input Image
Kai-En Lin
Yen-Chen Lin
Wei-Sheng Lai
Tsung-Yi Lin
Yichang Shih
R. Ramamoorthi
ViT
17
111
0
12 Jul 2022
Efficient Human Vision Inspired Action Recognition using Adaptive
  Spatiotemporal Sampling
Efficient Human Vision Inspired Action Recognition using Adaptive Spatiotemporal Sampling
Khoi-Nguyen C. Mac
Minh Do
Minh Vo
TTA
11
1
0
12 Jul 2022
Wave-ViT: Unifying Wavelet and Transformers for Visual Representation
  Learning
Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning
Ting Yao
Yingwei Pan
Yehao Li
Chong-Wah Ngo
Tao Mei
ViT
146
137
0
11 Jul 2022
Dual Vision Transformer
Dual Vision Transformer
Ting Yao
Yehao Li
Yingwei Pan
Yu Wang
Xiaoping Zhang
Tao Mei
ViT
141
75
0
11 Jul 2022
Attention and Self-Attention in Random Forests
Attention and Self-Attention in Random Forests
Lev V. Utkin
A. Konstantinov
32
3
0
09 Jul 2022
Softmax-free Linear Transformers
Softmax-free Linear Transformers
Jiachen Lu
Junge Zhang
Xiatian Zhu
Jianfeng Feng
Tao Xiang
Li Zhang
ViT
11
7
0
05 Jul 2022
Overview of Deep Learning-based CSI Feedback in Massive MIMO Systems
Overview of Deep Learning-based CSI Feedback in Massive MIMO Systems
Jiajia Guo
Chao-Kai Wen
Shi Jin
Geoffrey Ye Li
29
146
0
29 Jun 2022
The Third Place Solution for CVPR2022 AVA Accessibility Vision and
  Autonomy Challenge
The Third Place Solution for CVPR2022 AVA Accessibility Vision and Autonomy Challenge
Bo Yan
Leilei Cao
Zhuang Li
Hongbin Wang
24
0
0
28 Jun 2022
Bilateral Network with Channel Splitting Network and Transformer for
  Thermal Image Super-Resolution
Bilateral Network with Channel Splitting Network and Transformer for Thermal Image Super-Resolution
Bo Yan
Leilei Cao
Fengliang Qi
Hongbin Wang
ViT
12
1
0
24 Jun 2022
A novel adversarial learning strategy for medical image classification
A novel adversarial learning strategy for medical image classification
Zong Fan
Xiaohui Zhang
Jacob A. Gasienica
Jennifer Potts
S. Ruan
W. Thorstad
Hiram Gay
Pengfei Song
Xiaowei Wang
Hua Li
GAN
MedIm
16
5
0
23 Jun 2022
Vicinity Vision Transformer
Vicinity Vision Transformer
Weixuan Sun
Zhen Qin
Huiyuan Deng
Jianyuan Wang
Yi Zhang
Kaihao Zhang
Nick Barnes
Stan Birchfield
Lingpeng Kong
Yiran Zhong
ViT
34
31
0
21 Jun 2022
Positional Label for Self-Supervised Vision Transformer
Positional Label for Self-Supervised Vision Transformer
Zhemin Zhang
Xun Gong
ViT
MDE
12
6
0
10 Jun 2022
Blind Face Restoration: Benchmark Datasets and a Baseline Model
Blind Face Restoration: Benchmark Datasets and a Baseline Model
Puyang Zhang
Kaihao Zhang
Wenhan Luo
Changsheng Li
Guoren Wang
CVBM
29
17
0
08 Jun 2022
A Survey on Deep Learning for Skin Lesion Segmentation
A Survey on Deep Learning for Skin Lesion Segmentation
Z. Mirikharaji
Kumar Abhishek
Alceu Bissoto
Catarina Barata
Sandra Avila
Eduardo Valle
M. Celebi
Ghassan Hamarneh
31
82
0
01 Jun 2022
Efficient Multi-Purpose Cross-Attention Based Image Alignment Block for
  Edge Devices
Efficient Multi-Purpose Cross-Attention Based Image Alignment Block for Edge Devices
Bahri Batuhan Bilecen
Alparslan Fisne
Mustafa Ayazoglu
18
2
0
01 Jun 2022
WaveMix: A Resource-efficient Neural Network for Image Analysis
WaveMix: A Resource-efficient Neural Network for Image Analysis
Pranav Jeevan
Kavitha Viswanathan
S. AnanduA
A. Sethi
15
20
0
28 May 2022
VNT-Net: Rotational Invariant Vector Neuron Transformers
VNT-Net: Rotational Invariant Vector Neuron Transformers
Hedi Zisling
Andrei Sharf
3DPC
24
1
0
19 May 2022
Transformers in 3D Point Clouds: A Survey
Transformers in 3D Point Clouds: A Survey
Dening Lu
Qian Xie
Mingqiang Wei
Kyle Gao
Linlin Xu
Jonathan Li
3DPC
ViT
32
49
0
16 May 2022
TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation
TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation
Wenqiang Zhang
Zilong Huang
Guozhong Luo
Tao Chen
Xinggang Wang
Wenyu Liu
Gang Yu
Chunhua Shen
ViT
22
198
0
12 Apr 2022
HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud
HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud
Zhixing Hou
Yan Yan
Chengzhong Xu
Hui Kong
ViT
6
23
0
12 Apr 2022
Previous
1234567
Next