ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
  • Feedback
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.13621
  4. Cited By
Exploring Self-attention for Image Recognition

Exploring Self-attention for Image Recognition

28 April 2020
Hengshuang Zhao
Jiaya Jia
V. Koltun
    SSL
ArXiv (abs)PDFHTML

Papers citing "Exploring Self-attention for Image Recognition"

50 / 319 papers shown
Title
Two Heads are Better than One: Geometric-Latent Attention for Point
  Cloud Classification and Segmentation
Two Heads are Better than One: Geometric-Latent Attention for Point Cloud Classification and Segmentation
Hanz Cuevas-Velasquez
Antonio Javier Gallego
Robert B. Fisher
3DPC
63
1
0
30 Oct 2021
SOFT: Softmax-free Transformer with Linear Complexity
SOFT: Softmax-free Transformer with Linear Complexity
Jiachen Lu
Jinghan Yao
Junge Zhang
Martin Danelljan
Hang Xu
Weiguo Gao
Chunjing Xu
Thomas B. Schon
Li Zhang
165
175
0
22 Oct 2021
AIR-Nets: An Attention-Based Framework for Locally Conditioned Implicit
  Representations
AIR-Nets: An Attention-Based Framework for Locally Conditioned Implicit Representations
Simon Giebenhain
Bastian Goldlücke
3DPC
77
16
0
22 Oct 2021
Multi-Stream Attention Learning for Monocular Vehicle Velocity and
  Inter-Vehicle Distance Estimation
Multi-Stream Attention Learning for Monocular Vehicle Velocity and Inter-Vehicle Distance Estimation
Kuan-Chih Huang
Yu-Kai Huang
Winston H. Hsu
50
4
0
22 Oct 2021
Finding Strong Gravitational Lenses Through Self-Attention
Finding Strong Gravitational Lenses Through Self-Attention
H. Thuruthipilly
A. Zadrożny
Agnieszka Pollo
Marek Biesiada
94
6
0
18 Oct 2021
Attention meets Geometry: Geometry Guided Spatial-Temporal Attention for
  Consistent Self-Supervised Monocular Depth Estimation
Attention meets Geometry: Geometry Guided Spatial-Temporal Attention for Consistent Self-Supervised Monocular Depth Estimation
Patrick Ruhkamp
Daoyi Gao
Hanzhi Chen
Nassir Navab
Benjamin Busam
ViTMDE
145
37
0
15 Oct 2021
Development and testing of an image transformer for explainable
  autonomous driving systems
Development and testing of an image transformer for explainable autonomous driving systems
Jiqian Dong
Sikai Chen
Shuya Zong
Tiantian Chen
Mohammad Miralinaghi
Samuel Labi
ViT
95
23
0
11 Oct 2021
Attentive Walk-Aggregating Graph Neural Networks
Attentive Walk-Aggregating Graph Neural Networks
M. F. Demirel
Shengchao Liu
Siddhant Garg
Zhenmei Shi
Yingyu Liang
153
11
0
06 Oct 2021
LEA-Net: Layer-wise External Attention Network for Efficient Color
  Anomaly Detection
LEA-Net: Layer-wise External Attention Network for Efficient Color Anomaly Detection
Ryoya Katafuchi
T. Tokunaga
106
3
0
12 Sep 2021
Pose-guided Inter- and Intra-part Relational Transformer for Occluded
  Person Re-Identification
Pose-guided Inter- and Intra-part Relational Transformer for Occluded Person Re-Identification
Zhongxing Ma
Yifan Zhao
Jia Li
ViT
93
57
0
08 Sep 2021
Impact of Attention on Adversarial Robustness of Image Classification
  Models
Impact of Attention on Adversarial Robustness of Image Classification Models
Prachi Agrawal
Narinder Singh Punn
S. K. Sonbhadra
Sonali Agarwal
AAML
91
6
0
02 Sep 2021
Searching for Efficient Multi-Stage Vision Transformers
Searching for Efficient Multi-Stage Vision Transformers
Yi-Lun Liao
S. Karaman
Vivienne Sze
ViT
82
19
0
01 Sep 2021
Learning Inner-Group Relations on Point Clouds
Learning Inner-Group Relations on Point Clouds
Haoxi Ran
Wei Zhuo
Jing Liu
Li Lu
3DPC
123
64
0
27 Aug 2021
Relational Embedding for Few-Shot Classification
Relational Embedding for Few-Shot Classification
Dahyun Kang
Heeseung Kwon
Juhong Min
Minsu Cho
131
202
0
22 Aug 2021
PTT: Point-Track-Transformer Module for 3D Single Object Tracking in
  Point Clouds
PTT: Point-Track-Transformer Module for 3D Single Object Tracking in Point Clouds
Jiayao Shan
Sifan Zhou
Zheng Fang
Yubo Cui
ViT
143
88
0
14 Aug 2021
PVT: Point-Voxel Transformer for Point Cloud Learning
PVT: Point-Voxel Transformer for Point Cloud Learning
Cheng Zhang
Haocheng Wan
Xinyi Shen
Zizhao Wu
3DPCViT
153
94
0
13 Aug 2021
Transductive Few-Shot Classification on the Oblique Manifold
Transductive Few-Shot Classification on the Oblique Manifold
Guodong Qi
Huimin Yu
Zhaohui Lu
Shuzhao Li
86
46
0
09 Aug 2021
Unifying Global-Local Representations in Salient Object Detection with
  Transformer
Unifying Global-Local Representations in Salient Object Detection with Transformer
Sucheng Ren
Qiang Wen
Nanxuan Zhao
Guoqiang Han
Shengfeng He
ViT
113
28
0
05 Aug 2021
Contextual Transformer Networks for Visual Recognition
Contextual Transformer Networks for Visual Recognition
Yehao Li
Ting Yao
Yingwei Pan
Tao Mei
ViT
133
518
0
26 Jul 2021
All the attention you need: Global-local, spatial-channel attention for
  image retrieval
All the attention you need: Global-local, spatial-channel attention for image retrieval
Chull Hwan Song
Hye Joo Han
Yannis Avrithis
134
48
0
16 Jul 2021
Visual Parser: Representing Part-whole Hierarchies with Transformers
Visual Parser: Representing Part-whole Hierarchies with Transformers
Shuyang Sun
Xiaoyu Yue
S. Bai
Philip Torr
156
28
0
13 Jul 2021
Locally Enhanced Self-Attention: Combining Self-Attention and
  Convolution as Local and Context Terms
Locally Enhanced Self-Attention: Combining Self-Attention and Convolution as Local and Context Terms
Chenglin Yang
Siyuan Qiao
Adam Kortylewski
Alan Yuille
164
4
0
12 Jul 2021
GLiT: Neural Architecture Search for Global and Local Image Transformer
GLiT: Neural Architecture Search for Global and Local Image Transformer
Boyu Chen
Peixia Li
Chuming Li
Baopu Li
Mengwei He
Chen Lin
Ming Sun
Junjie Yan
Wanli Ouyang
ViT
185
87
0
07 Jul 2021
Test-Time Personalization with a Transformer for Human Pose Estimation
Test-Time Personalization with a Transformer for Human Pose Estimation
Yizhuo Li
Miao Hao
Zonglin Di
N. B. Gundavarapu
Xiaolong Wang
ViT
163
50
0
05 Jul 2021
Learning Geometric Combinatorial Optimization Problems using
  Self-attention and Domain Knowledge
Learning Geometric Combinatorial Optimization Problems using Self-attention and Domain Knowledge
Jaeseung Lee
Woojin Choi
Jibum Kim
3DPC
54
0
0
05 Jul 2021
AutoFormer: Searching Transformers for Visual Recognition
AutoFormer: Searching Transformers for Visual Recognition
Minghao Chen
Houwen Peng
Jianlong Fu
Haibin Ling
ViT
135
286
0
01 Jul 2021
Early Convolutions Help Transformers See Better
Early Convolutions Help Transformers See Better
Tete Xiao
Mannat Singh
Eric Mintun
Trevor Darrell
Piotr Dollár
Ross B. Girshick
178
802
0
28 Jun 2021
VOLO: Vision Outlooker for Visual Recognition
VOLO: Vision Outlooker for Visual Recognition
Li-xin Yuan
Qibin Hou
Zihang Jiang
Jiashi Feng
Shuicheng Yan
ViT
240
342
0
24 Jun 2021
Probabilistic Attention for Interactive Segmentation
Probabilistic Attention for Interactive Segmentation
Prasad Gabbur
Manjot Bilkhu
J. Movellan
108
13
0
23 Jun 2021
TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?
TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?
Michael S. Ryoo
A. Piergiovanni
Anurag Arnab
Mostafa Dehghani
A. Angelova
ViT
232
139
0
21 Jun 2021
Visual Correspondence Hallucination
Visual Correspondence Hallucination
Hugo Germain
Vincent Lepetit
Guillaume Bourmaud
99
11
0
17 Jun 2021
XCiT: Cross-Covariance Image Transformers
XCiT: Cross-Covariance Image Transformers
Alaaeldin El-Nouby
Hugo Touvron
Mathilde Caron
Piotr Bojanowski
Matthijs Douze
...
Ivan Laptev
Natalia Neverova
Gabriel Synnaeve
Jakob Verbeek
Edouard Grave
ViT
227
539
0
17 Jun 2021
Transformed CNNs: recasting pre-trained convolutional layers with
  self-attention
Transformed CNNs: recasting pre-trained convolutional layers with self-attention
Stéphane dÁscoli
Levent Sagun
Giulio Biroli
Ari S. Morcos
ViT
62
6
0
10 Jun 2021
On the Connection between Local Attention and Dynamic Depth-wise
  Convolution
On the Connection between Local Attention and Dynamic Depth-wise Convolution
Qi Han
Zejia Fan
Qi Dai
Lei-huan Sun
Ming-Ming Cheng
Jiaying Liu
Jingdong Wang
ViT
196
116
0
08 Jun 2021
Signal Transformer: Complex-valued Attention and Meta-Learning for
  Signal Recognition
Signal Transformer: Complex-valued Attention and Meta-Learning for Signal Recognition
Yihong Dong
Ying Peng
Muqiao Yang
Songtao Lu
Qingjiang Shi
151
9
0
05 Jun 2021
RegionViT: Regional-to-Local Attention for Vision Transformers
RegionViT: Regional-to-Local Attention for Vision Transformers
Chun-Fu Chen
Yikang Shen
Quanfu Fan
ViT
209
206
0
04 Jun 2021
Glance-and-Gaze Vision Transformer
Glance-and-Gaze Vision Transformer
Qihang Yu
Yingda Xia
Yutong Bai
Yongyi Lu
Alan Yuille
Wei Shen
ViT
108
77
0
04 Jun 2021
Multi-Scale Feature Aggregation by Cross-Scale Pixel-to-Region Relation
  Operation for Semantic Segmentation
Multi-Scale Feature Aggregation by Cross-Scale Pixel-to-Region Relation Operation for Semantic Segmentation
Yechao Bai
Ziyuan Huang
Lyuyu Shen
Hongliang Guo
Marcelo H. Ang Jr
Daniela Rus
SSeg
47
4
0
03 Jun 2021
DLA-Net: Learning Dual Local Attention Features for Semantic
  Segmentation of Large-Scale Building Facade Point Clouds
DLA-Net: Learning Dual Local Attention Features for Semantic Segmentation of Large-Scale Building Facade Point Clouds
Yanfei Su
Weiquan Liu
Zhimin Yuan
Ming Cheng
Zhihong Zhang
Xuelun Shen
Cheng-Yu Wang
3DPC
138
44
0
01 Jun 2021
MSG-Transformer: Exchanging Local Spatial Information by Manipulating
  Messenger Tokens
MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens
Jiemin Fang
Lingxi Xie
Xinggang Wang
Xiaopeng Zhang
Wenyu Liu
Qi Tian
ViT
94
78
0
31 May 2021
KVT: k-NN Attention for Boosting Vision Transformers
KVT: k-NN Attention for Boosting Vision Transformers
Pichao Wang
Qingsong Wen
F. Wang
Ming Lin
Shuning Chang
Hao Li
Rong Jin
ViT
169
112
0
28 May 2021
Interpretable UAV Collision Avoidance using Deep Reinforcement Learning
Interpretable UAV Collision Avoidance using Deep Reinforcement Learning
Deep Thomas
Daniil Olshanskyi
Karter Krueger
Tichakorn Wongpiromsarn
Ali Jannesari
122
5
0
25 May 2021
Content-Augmented Feature Pyramid Network with Light Linear Spatial
  Transformers for Object Detection
Content-Augmented Feature Pyramid Network with Light Linear Spatial Transformers for Object Detection
Yongxiang Gu
Xiaolin Qin
Yuncong Peng
Lu Li
ViT
66
7
0
20 May 2021
SCTN: Sparse Convolution-Transformer Network for Scene Flow Estimation
SCTN: Sparse Convolution-Transformer Network for Scene Flow Estimation
Bing Li
Cheng Zheng
Silvio Giancola
Guohao Li
ViT3DPC
120
46
0
10 May 2021
Graph Attention Networks with Positional Embeddings
Graph Attention Networks with Positional Embeddings
Liheng Ma
Reihaneh Rabbany
Adriana Romero Soriano
GNN
119
21
0
09 May 2021
ResMLP: Feedforward networks for image classification with
  data-efficient training
ResMLP: Feedforward networks for image classification with data-efficient training
Hugo Touvron
Piotr Bojanowski
Mathilde Caron
Matthieu Cord
Alaaeldin El-Nouby
...
Gautier Izacard
Armand Joulin
Gabriel Synnaeve
Jakob Verbeek
Edouard Grave
VLM
220
711
0
07 May 2021
Beyond Self-attention: External Attention using Two Linear Layers for
  Visual Tasks
Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks
Meng-Hao Guo
Zheng-Ning Liu
Tai-Jiang Mu
Shimin Hu
126
526
0
05 May 2021
Dual-Cross Central Difference Network for Face Anti-Spoofing
Dual-Cross Central Difference Network for Face Anti-Spoofing
Zitong Yu
Yunxiao Qin
Hengshuang Zhao
Amritpal Singh
Guoying Zhao
CVBM
87
111
0
04 May 2021
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Edouard Grave
Julien Mairal
Piotr Bojanowski
Armand Joulin
1.4K
6,761
0
29 Apr 2021
ConTNet: Why not use convolution and transformer at the same time?
ConTNet: Why not use convolution and transformer at the same time?
Haotian Yan
Zhe Li
Weijian Li
Changhu Wang
Ming Wu
Chuang Zhang
ViT
134
80
0
27 Apr 2021
Previous
1234567
Next