Exploring Self-attention for Image Recognition

28 April 2020

Papers citing "Exploring Self-attention for Image Recognition"

50 / 320 papers shown

Title
Relational Self-Attention: What's Missing in Attention for Video Understanding Manjin Kim Heeseung Kwon Chunyu Wang Suha Kwak Minsu Cho ViT 106 30 0 02 Nov 2021
Two Heads are Better than One: Geometric-Latent Attention for Point Cloud Classification and Segmentation Hanz Cuevas-Velasquez Antonio Javier Gallego Robert B. Fisher 3DPC 63 1 0 30 Oct 2021
SOFT: Softmax-free Transformer with Linear Complexity Jiachen Lu Jinghan Yao Junge Zhang Martin Danelljan Hang Xu Weiguo Gao Chunjing Xu Thomas B. Schon Li Zhang 165 175 0 22 Oct 2021
AIR-Nets: An Attention-Based Framework for Locally Conditioned Implicit Representations Simon Giebenhain Bastian Goldlücke 3DPC 77 16 0 22 Oct 2021
Multi-Stream Attention Learning for Monocular Vehicle Velocity and Inter-Vehicle Distance Estimation Kuan-Chih Huang Yu-Kai Huang Winston H. Hsu 50 4 0 22 Oct 2021
Finding Strong Gravitational Lenses Through Self-Attention H. Thuruthipilly A. Zadrożny Agnieszka Pollo Marek Biesiada 94 6 0 18 Oct 2021
Attention meets Geometry: Geometry Guided Spatial-Temporal Attention for Consistent Self-Supervised Monocular Depth Estimation Patrick Ruhkamp Daoyi Gao Hanzhi Chen Nassir Navab Benjamin Busam ViT MDE 149 37 0 15 Oct 2021
Development and testing of an image transformer for explainable autonomous driving systems Jiqian Dong Sikai Chen Shuya Zong Tiantian Chen Mohammad Miralinaghi Samuel Labi ViT 95 23 0 11 Oct 2021
Attentive Walk-Aggregating Graph Neural Networks M. F. Demirel Shengchao Liu Siddhant Garg Zhenmei Shi Yingyu Liang 157 11 0 06 Oct 2021
LEA-Net: Layer-wise External Attention Network for Efficient Color Anomaly Detection Ryoya Katafuchi T. Tokunaga 111 3 0 12 Sep 2021
Pose-guided Inter- and Intra-part Relational Transformer for Occluded Person Re-Identification Zhongxing Ma Yifan Zhao Jia Li ViT 93 57 0 08 Sep 2021
Impact of Attention on Adversarial Robustness of Image Classification Models Prachi Agrawal Narinder Singh Punn S. K. Sonbhadra Sonali Agarwal AAML 91 6 0 02 Sep 2021
Searching for Efficient Multi-Stage Vision Transformers Yi-Lun Liao S. Karaman Vivienne Sze ViT 82 19 0 01 Sep 2021
Learning Inner-Group Relations on Point Clouds Haoxi Ran Wei Zhuo Jing Liu Li Lu 3DPC 123 64 0 27 Aug 2021
Relational Embedding for Few-Shot Classification Dahyun Kang Heeseung Kwon Juhong Min Minsu Cho 131 202 0 22 Aug 2021
PTT: Point-Track-Transformer Module for 3D Single Object Tracking in Point Clouds Jiayao Shan Sifan Zhou Zheng Fang Yubo Cui ViT 143 88 0 14 Aug 2021
PVT: Point-Voxel Transformer for Point Cloud Learning Cheng Zhang Haocheng Wan Xinyi Shen Zizhao Wu 3DPC ViT 153 94 0 13 Aug 2021
Transductive Few-Shot Classification on the Oblique Manifold Guodong Qi Huimin Yu Zhaohui Lu Shuzhao Li 86 46 0 09 Aug 2021
Unifying Global-Local Representations in Salient Object Detection with Transformer Sucheng Ren Qiang Wen Nanxuan Zhao Guoqiang Han Shengfeng He ViT 113 28 0 05 Aug 2021
Contextual Transformer Networks for Visual Recognition Yehao Li Ting Yao Yingwei Pan Tao Mei ViT 133 518 0 26 Jul 2021
All the attention you need: Global-local, spatial-channel attention for image retrieval Chull Hwan Song Hye Joo Han Yannis Avrithis 138 50 0 16 Jul 2021
Visual Parser: Representing Part-whole Hierarchies with Transformers Shuyang Sun Xiaoyu Yue S. Bai Philip Torr 159 28 0 13 Jul 2021
Locally Enhanced Self-Attention: Combining Self-Attention and Convolution as Local and Context Terms Chenglin Yang Siyuan Qiao Adam Kortylewski Alan Yuille 164 4 0 12 Jul 2021
GLiT: Neural Architecture Search for Global and Local Image Transformer Boyu Chen Peixia Li Chuming Li Baopu Li Mengwei He Chen Lin Ming Sun Junjie Yan Wanli Ouyang ViT 185 87 0 07 Jul 2021
Test-Time Personalization with a Transformer for Human Pose Estimation Yizhuo Li Miao Hao Zonglin Di N. B. Gundavarapu Xiaolong Wang ViT 163 50 0 05 Jul 2021
Learning Geometric Combinatorial Optimization Problems using Self-attention and Domain Knowledge Jaeseung Lee Woojin Choi Jibum Kim 3DPC 54 0 0 05 Jul 2021
AutoFormer: Searching Transformers for Visual Recognition Minghao Chen Houwen Peng Jianlong Fu Haibin Ling ViT 141 287 0 01 Jul 2021
Early Convolutions Help Transformers See Better Tete Xiao Mannat Singh Eric Mintun Trevor Darrell Piotr Dollár Ross B. Girshick 210 803 0 28 Jun 2021
VOLO: Vision Outlooker for Visual Recognition Li-xin Yuan Qibin Hou Zihang Jiang Jiashi Feng Shuicheng Yan ViT 256 342 0 24 Jun 2021
Probabilistic Attention for Interactive Segmentation Prasad Gabbur Manjot Bilkhu J. Movellan 108 13 0 23 Jun 2021
TokenLearner: What Can 8 Learned Tokens Do for Images and Videos? Michael S. Ryoo A. Piergiovanni Anurag Arnab Mostafa Dehghani A. Angelova ViT 244 139 0 21 Jun 2021
Visual Correspondence Hallucination Hugo Germain Vincent Lepetit Guillaume Bourmaud 99 11 0 17 Jun 2021
XCiT: Cross-Covariance Image Transformers Alaaeldin El-Nouby Hugo Touvron Mathilde Caron Piotr Bojanowski Matthijs Douze ... Ivan Laptev Natalia Neverova Gabriel Synnaeve Jakob Verbeek Edouard Grave ViT 235 539 0 17 Jun 2021
Transformed CNNs: recasting pre-trained convolutional layers with self-attention Stéphane dÁscoli Levent Sagun Giulio Biroli Ari S. Morcos ViT 62 6 0 10 Jun 2021
On the Connection between Local Attention and Dynamic Depth-wise Convolution Qi Han Zejia Fan Qi Dai Lei-huan Sun Ming-Ming Cheng Jiaying Liu Jingdong Wang ViT 196 116 0 08 Jun 2021
Signal Transformer: Complex-valued Attention and Meta-Learning for Signal Recognition Yihong Dong Ying Peng Muqiao Yang Songtao Lu Qingjiang Shi 155 9 0 05 Jun 2021
RegionViT: Regional-to-Local Attention for Vision Transformers Chun-Fu Chen Yikang Shen Quanfu Fan ViT 209 206 0 04 Jun 2021
Glance-and-Gaze Vision Transformer Qihang Yu Yingda Xia Yutong Bai Yongyi Lu Alan Yuille Wei Shen ViT 108 77 0 04 Jun 2021
Multi-Scale Feature Aggregation by Cross-Scale Pixel-to-Region Relation Operation for Semantic Segmentation Yechao Bai Ziyuan Huang Lyuyu Shen Hongliang Guo Marcelo H. Ang Jr Daniela Rus SSeg 47 4 0 03 Jun 2021
DLA-Net: Learning Dual Local Attention Features for Semantic Segmentation of Large-Scale Building Facade Point Clouds Yanfei Su Weiquan Liu Zhimin Yuan Ming Cheng Zhihong Zhang Xuelun Shen Cheng-Yu Wang 3DPC 138 44 0 01 Jun 2021
MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens Jiemin Fang Lingxi Xie Xinggang Wang Xiaopeng Zhang Wenyu Liu Qi Tian ViT 94 78 0 31 May 2021
KVT: k-NN Attention for Boosting Vision Transformers Pichao Wang Qingsong Wen F. Wang Ming Lin Shuning Chang Hao Li Rong Jin ViT 169 112 0 28 May 2021
Interpretable UAV Collision Avoidance using Deep Reinforcement Learning Deep Thomas Daniil Olshanskyi Karter Krueger Tichakorn Wongpiromsarn Ali Jannesari 126 5 0 25 May 2021
Content-Augmented Feature Pyramid Network with Light Linear Spatial Transformers for Object Detection Yongxiang Gu Xiaolin Qin Yuncong Peng Lu Li ViT 66 7 0 20 May 2021
SCTN: Sparse Convolution-Transformer Network for Scene Flow Estimation Bing Li Cheng Zheng Silvio Giancola Guohao Li ViT 3DPC 124 46 0 10 May 2021
Graph Attention Networks with Positional Embeddings Liheng Ma Reihaneh Rabbany Adriana Romero Soriano GNN 119 21 0 09 May 2021
ResMLP: Feedforward networks for image classification with data-efficient training Hugo Touvron Piotr Bojanowski Mathilde Caron Matthieu Cord Alaaeldin El-Nouby ... Gautier Izacard Armand Joulin Gabriel Synnaeve Jakob Verbeek Edouard Grave VLM 220 712 0 07 May 2021
Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks Meng-Hao Guo Zheng-Ning Liu Tai-Jiang Mu Shimin Hu 126 528 0 05 May 2021
Dual-Cross Central Difference Network for Face Anti-Spoofing Zitong Yu Yunxiao Qin Hengshuang Zhao Amritpal Singh Guoying Zhao CVBM 87 111 0 04 May 2021
Emerging Properties in Self-Supervised Vision Transformers Mathilde Caron Hugo Touvron Ishan Misra Edouard Grave Julien Mairal Piotr Bojanowski Armand Joulin 1.4K 6,775 0 29 Apr 2021