ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.13621
  4. Cited By
Exploring Self-attention for Image Recognition

Exploring Self-attention for Image Recognition

28 April 2020
Hengshuang Zhao
Jiaya Jia
V. Koltun
    SSL
ArXivPDFHTML

Papers citing "Exploring Self-attention for Image Recognition"

50 / 316 papers shown
Title
Differentiable Channel Selection in Self-Attention For Person Re-Identification
Differentiable Channel Selection in Self-Attention For Person Re-Identification
Yancheng Wang
Nebojsa Jojic
Yingzhen Yang
19
0
0
13 May 2025
Multimodal Fusion of Glucose Monitoring and Food Imagery for Caloric Content Prediction
Multimodal Fusion of Glucose Monitoring and Food Imagery for Caloric Content Prediction
Adarsh Kumar
13
0
0
13 May 2025
SignX: The Foundation Model for Sign Recognition
SignX: The Foundation Model for Sign Recognition
Sen Fang
Chunyu Sui
Hongwei Yi
C. Neidle
Dimitris N. Metaxas
SLR
35
0
0
22 Apr 2025
MAAM: A Lightweight Multi-Agent Aggregation Module for Efficient Image Classification Based on the MindSpore Framework
MAAM: A Lightweight Multi-Agent Aggregation Module for Efficient Image Classification Based on the MindSpore Framework
Zhenkai Qin
Feng Zhu
Huan Zeng
Xunyi Nong
26
0
0
18 Apr 2025
Forward Learning with Differential Privacy
Forward Learning with Differential Privacy
Mingqian Feng
Zeliang Zhang
Jinyang Jiang
Yijie Peng
Chenliang Xu
39
0
0
01 Apr 2025
DCAT: Dual Cross-Attention Fusion for Disease Classification in Radiological Images with Uncertainty Estimation
DCAT: Dual Cross-Attention Fusion for Disease Classification in Radiological Images with Uncertainty Estimation
Jutika Borah
H. Singh
MedIm
45
0
0
14 Mar 2025
LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention
LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention
Zewen Du
Zhenjiang Hu
Guiyu Zhao
Ying Jin
Hongbin Ma
64
0
0
29 Nov 2024
Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer
Shitong Shao
Zikai Zhou
Tian Ye
Lichen Bai
Zhiqiang Xu
Zeke Xie
DiffM
46
0
0
16 Nov 2024
Exploring contextual modeling with linear complexity for point cloud
  segmentation
Exploring contextual modeling with linear complexity for point cloud segmentation
Yong Xien Chng
Xuchong Qiu
Yizeng Han
Yifan Pu
Jiewei Cao
Gao Huang
Mamba
31
1
0
28 Oct 2024
Learning to rumble: Automated elephant call classification, detection
  and endpointing using deep architectures
Learning to rumble: Automated elephant call classification, detection and endpointing using deep architectures
Christiaan M. Geldenhuys
Thomas R. Niesler
19
0
0
15 Oct 2024
UnSeGArmaNet: Unsupervised Image Segmentation using Graph Neural
  Networks with Convolutional ARMA Filters
UnSeGArmaNet: Unsupervised Image Segmentation using Graph Neural Networks with Convolutional ARMA Filters
Kovvuri Sai Gopal Reddy
Bodduluri Saran
A. M. Adityaja
Saurabh J. Shigwan
Nitin Kumar
Snehasis Mukherjee
16
1
0
08 Oct 2024
IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video
  Synthesis
IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis
Shitong Shao
Zikai Zhou
Lichen Bai
Haoyi Xiong
Zeke Xie
VGen
51
1
0
05 Oct 2024
Feature Importance in Pedestrian Intention Prediction: A Context-Aware
  Review
Feature Importance in Pedestrian Intention Prediction: A Context-Aware Review
Mohsen Azarmi
Mahdi Rezaei
He Wang
Ali Arabian
34
1
0
11 Sep 2024
MobileUNETR: A Lightweight End-To-End Hybrid Vision Transformer For
  Efficient Medical Image Segmentation
MobileUNETR: A Lightweight End-To-End Hybrid Vision Transformer For Efficient Medical Image Segmentation
Shehan Perera
Yunus Erzurumlu
Deepak Gulati
Alper Yilmaz
ViT
MedIm
24
0
0
04 Sep 2024
Creating a Gen-AI based Track and Trace Assistant MVP (SuperTracy) for
  PostNL
Creating a Gen-AI based Track and Trace Assistant MVP (SuperTracy) for PostNL
Mohammad Reshadati
35
0
0
04 Sep 2024
Panoptic Perception for Autonomous Driving: A Survey
Panoptic Perception for Autonomous Driving: A Survey
Yunge Li
Lanyu Xu
32
2
0
27 Aug 2024
PointMT: Efficient Point Cloud Analysis with Hybrid MLP-Transformer
  Architecture
PointMT: Efficient Point Cloud Analysis with Hybrid MLP-Transformer Architecture
Qiang Zheng
Chao Zhang
Jian Sun
30
1
0
10 Aug 2024
Cross-Layer Feature Pyramid Transformer for Small Object Detection in
  Aerial Images
Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images
Zewen Du
Zhenjiang Hu
Guiyu Zhao
Ying Jin
Hongbin Ma
ViT
24
2
0
29 Jul 2024
Rethinking Attention Module Design for Point Cloud Analysis
Rethinking Attention Module Design for Point Cloud Analysis
Chengzhi Wu
Kaige Wang
Zeyun Zhong
Hao Fu
Junwei Zheng
Jiaming Zhang
Julius Pfrommer
Jürgen Beyerer
3DPC
46
1
0
27 Jul 2024
GMT: A Robust Global Association Model for Multi-Target Multi-Camera
  Tracking
GMT: A Robust Global Association Model for Multi-Target Multi-Camera Tracking
Huijie Fan
Tinghui Zhao
Qiang Wang
Baojie Fan
Yandong Tang
Lianqing Liu
40
2
0
01 Jul 2024
ATAC-Net: Zoomed view works better for Anomaly Detection
ATAC-Net: Zoomed view works better for Anomaly Detection
Shaurya Gupta
Neil Gautam
Anurag Malyala
21
0
0
20 Jun 2024
Neural Pose Representation Learning for Generating and Transferring
  Non-Rigid Object Poses
Neural Pose Representation Learning for Generating and Transferring Non-Rigid Object Poses
Seungwoo Yoo
Juil Koo
Kyeongmin Yeo
Minhyuk Sung
3DH
DRL
27
0
0
14 Jun 2024
A Multimodal Dangerous State Recognition and Early Warning System for
  Elderly with Intermittent Dementia
A Multimodal Dangerous State Recognition and Early Warning System for Elderly with Intermittent Dementia
Liyun Deng
Lei Jin
Guangcheng Wang
Quan Shi
Han Wang
11
0
0
30 May 2024
Towards Natural Machine Unlearning
Towards Natural Machine Unlearning
Zhengbao He
Tao Li
Xinwen Cheng
Zhehao Huang
Xiaolin Huang
MU
28
3
0
24 May 2024
Mesh Denoising Transformer
Mesh Denoising Transformer
Wenbo Zhao
Xianming Liu
Deming Zhai
Junjun Jiang
Xiangyang Ji
AI4CE
23
0
0
10 May 2024
UnSegGNet: Unsupervised Image Segmentation using Graph Neural Networks
UnSegGNet: Unsupervised Image Segmentation using Graph Neural Networks
Kovvuri Sai
Bodduluri Saran
A. M. Adityaja
Saurabh J. Shigwan
Nitin Kumar
26
1
0
09 May 2024
CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks
CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks
Nick Nikzad
Yongsheng Gao
Jun Zhou
24
0
0
09 May 2024
AFter: Attention-based Fusion Router for RGBT Tracking
AFter: Attention-based Fusion Router for RGBT Tracking
Andong Lu
Wanyu Wang
Chenglong Li
Jin Tang
Bin Luo
52
5
0
04 May 2024
Neuromorphic Vision-based Motion Segmentation with Graph Transformer
  Neural Network
Neuromorphic Vision-based Motion Segmentation with Graph Transformer Neural Network
Yusra Alkendi
Rana Azzam
Sajid Javed
Lakmal Seneviratne
Yahya Zweiri
ViT
29
4
0
16 Apr 2024
Improving Visual Recognition with Hyperbolical Visual Hierarchy Mapping
Improving Visual Recognition with Hyperbolical Visual Hierarchy Mapping
Hyeongjun Kwon
Jinhyun Jang
Jin-Hwa Kim
Kwonyoung Kim
Kwanghoon Sohn
35
1
0
01 Apr 2024
Surface Reconstruction from Point Clouds via Grid-based Intersection
  Prediction
Surface Reconstruction from Point Clouds via Grid-based Intersection Prediction
Hui Tian
Kai Xu
3DPC
3DV
31
1
0
21 Mar 2024
EfficientMorph: Parameter-Efficient Transformer-Based Architecture for
  3D Image Registration
EfficientMorph: Parameter-Efficient Transformer-Based Architecture for 3D Image Registration
Abu Zahid Bin Aziz
Mokshagna Sai Teja Karanam
Tushar Kataria
Shireen Elhabian
ViT
MedIm
29
1
0
16 Mar 2024
LVIC: Multi-modality segmentation by Lifting Visual Info as Cue
LVIC: Multi-modality segmentation by Lifting Visual Info as Cue
Zichao Dong
Bowen Pang
Xufeng Huang
Hang Ji
Xin Zhan
Junbo Chen
3DPC
35
0
0
08 Mar 2024
ARNN: Attentive Recurrent Neural Network for Multi-channel EEG Signals
  to Identify Epileptic Seizures
ARNN: Attentive Recurrent Neural Network for Multi-channel EEG Signals to Identify Epileptic Seizures
S. Rukhsar
Anil Kumar Tiwari
21
2
0
05 Mar 2024
Region-Transformer: Self-Attention Region Based Class-Agnostic Point
  Cloud Segmentation
Region-Transformer: Self-Attention Region Based Class-Agnostic Point Cloud Segmentation
Dipesh Gyawali
Jian Zhang
B. Karki
ViT
3DPC
19
0
0
03 Mar 2024
Parameter-efficient Prompt Learning for 3D Point Cloud Understanding
Parameter-efficient Prompt Learning for 3D Point Cloud Understanding
Hongyu Sun
Yongcai Wang
Wang Chen
Haoran Deng
Deying Li
VPVLM
44
5
0
24 Feb 2024
PIP-Net: Pedestrian Intention Prediction in the Wild
PIP-Net: Pedestrian Intention Prediction in the Wild
Mohsen Azarmi
Mahdi Rezaei
He Wang
Sebastien Glaser
24
6
0
20 Feb 2024
PointMamba: A Simple State Space Model for Point Cloud Analysis
PointMamba: A Simple State Space Model for Point Cloud Analysis
Dingkang Liang
Xin Zhou
Wei Xu
Xingkui Zhu
Zhikang Zou
Xiaoqing Ye
Xinyu Wang
Xiang Bai
86
90
0
16 Feb 2024
Exploring the Synergies of Hybrid CNNs and ViTs Architectures for
  Computer Vision: A survey
Exploring the Synergies of Hybrid CNNs and ViTs Architectures for Computer Vision: A survey
Haruna Yunusa
Shiyin Qin
Abdulrahman Hamman Adama Chukkol
Abdulganiyu Abdu Yusuf
Isah Bello
A. Lawan
ViT
30
13
0
05 Feb 2024
3D Landmark Detection on Human Point Clouds: A Benchmark and A Dual
  Cascade Point Transformer Framework
3D Landmark Detection on Human Point Clouds: A Benchmark and A Dual Cascade Point Transformer Framework
Fan Zhang
Shuyi Mao
Qing Li
Xiaojiang Peng
3DPC
3DH
14
0
0
14 Jan 2024
Self-Attention and Hybrid Features for Replay and Deep-Fake Audio
  Detection
Self-Attention and Hybrid Features for Replay and Deep-Fake Audio Detection
Lian Huang
Chi-Man Pun
15
4
0
11 Jan 2024
CoordGate: Efficiently Computing Spatially-Varying Convolutions in
  Convolutional Neural Networks
CoordGate: Efficiently Computing Spatially-Varying Convolutions in Convolutional Neural Networks
S. Howard
P. Norreys
Andreas Döpp
23
1
0
09 Jan 2024
BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model
BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model
Yiran Song
Qianyu Zhou
Xiangtai Li
Deng-Ping Fan
Xuequan Lu
Lizhuang Ma
VLM
30
14
0
04 Jan 2024
ROI-Aware Multiscale Cross-Attention Vision Transformer for Pest Image
  Identification
ROI-Aware Multiscale Cross-Attention Vision Transformer for Pest Image Identification
Ga-Eun Kim
Chang-Hwan Son
13
1
0
28 Dec 2023
Real-time Neural Network Inference on Extremely Weak Devices: Agile
  Offloading with Explainable AI
Real-time Neural Network Inference on Extremely Weak Devices: Agile Offloading with Explainable AI
Kai Huang
Wei Gao
15
35
0
21 Dec 2023
Integrating Human Vision Perception in Vision Transformers for
  Classifying Waste Items
Integrating Human Vision Perception in Vision Transformers for Classifying Waste Items
Akshat Shrivastava
Tapan K. Gandhi
19
1
0
19 Dec 2023
Semantic-Aware Transformation-Invariant RoI Align
Semantic-Aware Transformation-Invariant RoI Align
Guo-Ye Yang
George Kiyohiro Nakayama
Zikai Xiao
Tai-Jiang Mu
Xiaolei Huang
Shi-Min Hu
ObjD
21
0
0
15 Dec 2023
Multi-Granularity Framework for Unsupervised Representation Learning of
  Time Series
Multi-Granularity Framework for Unsupervised Representation Learning of Time Series
Chengyang Ye
Qiang Ma
AI4TS
8
0
0
12 Dec 2023
Large Language Models Meet Computer Vision: A Brief Survey
Large Language Models Meet Computer Vision: A Brief Survey
Raby Hamadi
LM&MA
21
4
0
28 Nov 2023
Plug-and-Play Feature Generation for Few-Shot Medical Image
  Classification
Plug-and-Play Feature Generation for Few-Shot Medical Image Classification
Qianyu Guo
Huifang Du
Xing Jia
Shuyong Gao
Yan Teng
Haofen Wang
Wenqiang Zhang
MedIm
VLM
19
0
0
14 Oct 2023
1234567
Next