ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.03348
  4. Cited By
ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias

ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias

7 June 2021
Yufei Xu
Qiming Zhang
Jing Zhang
Dacheng Tao
    ViT
ArXivPDFHTML

Papers citing "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias"

50 / 197 papers shown
Title
TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series
TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series
Xiaolei Qin
Di Wang
J. Zhang
Fengxiang Wang
Xin Su
Bo Du
Liangpei Zhang
AI4TS
18
0
0
13 May 2025
Attention IoU: Examining Biases in CelebA using Attention Maps
Attention IoU: Examining Biases in CelebA using Attention Maps
Aaron Serianni
Tyler Zhu
Olga Russakovsky
V. V. Ramaswamy
34
0
0
25 Mar 2025
Towards Long-Range ENSO Prediction with an Explainable Deep Learning Model
Towards Long-Range ENSO Prediction with an Explainable Deep Learning Model
Qi Chen
Yinghao Cui
Guobin Hong
Karumuri Ashok
Yuchun Pu
Xiaogu Zheng
Xuanze Zhang
Wei Zhong
Peng Zhan
Z. Wang
AI4Cl
35
0
0
25 Mar 2025
Fractal-IR: A Unified Framework for Efficient and Scalable Image Restoration
Fractal-IR: A Unified Framework for Efficient and Scalable Image Restoration
Yawei Li
Bin Ren
Jingyun Liang
Rakesh Ranjan
Mengyuan Liu
N. Sebe
Ming-Hsuan Yang
Luca Benini
56
0
0
22 Mar 2025
DocVideoQA: Towards Comprehensive Understanding of Document-Centric Videos through Question Answering
DocVideoQA: Towards Comprehensive Understanding of Document-Centric Videos through Question Answering
H. Wang
Kai Hu
Liangcai Gao
129
0
0
20 Mar 2025
DVHGNN: Multi-Scale Dilated Vision HGNN for Efficient Vision Recognition
DVHGNN: Multi-Scale Dilated Vision HGNN for Efficient Vision Recognition
Caoshuo Li
Tanzhe Li
Xiaobin Hu
Donghao Luo
Taisong Jin
53
0
0
19 Mar 2025
MaskAttn-UNet: A Mask Attention-Driven Framework for Universal Low-Resolution Image Segmentation
MaskAttn-UNet: A Mask Attention-Driven Framework for Universal Low-Resolution Image Segmentation
Anzhe Cheng
Chenzhong Yin
Yu Chang
Heng Ping
Shixuan Li
Shahin Nazarian
Paul Bogdan
SSeg
86
0
0
11 Mar 2025
Meta Learning not to Learn: Robustly Informing Meta-Learning under Nuisance-Varying Families
Louis McConnell
OOD
CML
42
0
0
06 Mar 2025
PARF-Net: integrating pixel-wise adaptive receptive fields into hybrid Transformer-CNN network for medical image segmentation
Xu Ma
Mengsheng Chen
Junhui Zhang
Lijuan Song
Fang Du
Zhenhua Yu
ViT
MedIm
31
0
0
06 Jan 2025
MLLA-UNet: Mamba-like Linear Attention in an Efficient U-Shape Model for
  Medical Image Segmentation
MLLA-UNet: Mamba-like Linear Attention in an Efficient U-Shape Model for Medical Image Segmentation
Yufeng Jiang
Zongxi Li
Xiangyan Chen
Haoran Xie
Jing Cai
Mamba
37
1
0
31 Oct 2024
HRPVT: High-Resolution Pyramid Vision Transformer for medium and
  small-scale human pose estimation
HRPVT: High-Resolution Pyramid Vision Transformer for medium and small-scale human pose estimation
Zhoujie Xu
ViT
3DH
36
2
0
29 Oct 2024
PViT: Prior-augmented Vision Transformer for Out-of-distribution Detection
PViT: Prior-augmented Vision Transformer for Out-of-distribution Detection
Tianhao Zhang
Zhixiang Chen
Lyudmila Mihaylova
92
0
0
27 Oct 2024
Designing Concise ConvNets with Columnar Stages
Designing Concise ConvNets with Columnar Stages
Ashish Kumar
Jaesik Park
MQ
23
0
0
05 Oct 2024
How Effective is Pre-training of Large Masked Autoencoders for
  Downstream Earth Observation Tasks?
How Effective is Pre-training of Large Masked Autoencoders for Downstream Earth Observation Tasks?
Jose Sosa
Mohamed Aloulou
Danila Rukhovich
Rim Sleimi
Boonyarit Changaival
Anis Kacem
Djamila Aouada
35
0
0
27 Sep 2024
TBConvL-Net: A Hybrid Deep Learning Architecture for Robust Medical
  Image Segmentation
TBConvL-Net: A Hybrid Deep Learning Architecture for Robust Medical Image Segmentation
Shahzaib Iqbal
Tariq M. Khan
Syed S. Naqvi
Asim Naveed
Erik H. W. Meijering
MedIm
48
6
0
05 Sep 2024
AstroMAE: Redshift Prediction Using a Masked Autoencoder with a Novel
  Fine-Tuning Architecture
AstroMAE: Redshift Prediction Using a Masked Autoencoder with a Novel Fine-Tuning Architecture
Amirreza Dolatpour Fathkouhi
Geoffrey Charles Fox
21
1
0
03 Sep 2024
Towards Modality-agnostic Label-efficient Segmentation with
  Entropy-Regularized Distribution Alignment
Towards Modality-agnostic Label-efficient Segmentation with Entropy-Regularized Distribution Alignment
Liyao Tang
Zhe Chen
Shanshan Zhao
Chaoyue Wang
Dacheng Tao
32
0
0
29 Aug 2024
SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through
  Residual Visual Mamba Layers and Shape Priors
SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors
Zhiqing Zhang
Tianyong Liu
Guojia Fan
Bin Li
Qianjin Feng
Shoujun Zhou
Mamba
29
1
0
28 Aug 2024
Disentangle and denoise: Tackling context misalignment for video moment
  retrieval
Disentangle and denoise: Tackling context misalignment for video moment retrieval
Kaijing Ma
Han Fang
Xianghao Zang
Chao Ban
Lanxiang Zhou
Zhongjiang He
Yongxiang Li
Hao Sun
Zerun Feng
Xingsong Hou
37
1
0
14 Aug 2024
Pre-trained Encoder Inference: Revealing Upstream Encoders In Downstream
  Machine Learning Services
Pre-trained Encoder Inference: Revealing Upstream Encoders In Downstream Machine Learning Services
Shaopeng Fu
Xuexue Sun
Ke Qing
Tianhang Zheng
Di Wang
AAML
MIACV
SILM
48
0
0
05 Aug 2024
DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved
  Denoising Training
DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training
Xi Chen
Qian Qiao
Jun Gao
Tianxiang Wu
Rahul Bhadani
...
Ziqiang Cao
Larry Head
Yue Zhang
Jielei Zhang
Huyang Sun
DiffM
23
5
0
01 Aug 2024
Depth-Wise Convolutions in Vision Transformers for Efficient Training on
  Small Datasets
Depth-Wise Convolutions in Vision Transformers for Efficient Training on Small Datasets
Tianxiao Zhang
Wenju Xu
Bo Luo
Guanghui Wang
ViT
MDE
36
7
0
28 Jul 2024
HDKD: Hybrid Data-Efficient Knowledge Distillation Network for Medical Image Classification
HDKD: Hybrid Data-Efficient Knowledge Distillation Network for Medical Image Classification
Omar S. El-Assiouti
Ghada Hamed
Dina Khattab
H. M. Ebied
27
1
0
10 Jul 2024
CTRL-F: Pairing Convolution with Transformer for Image Classification
  via Multi-Level Feature Cross-Attention and Representation Learning Fusion
CTRL-F: Pairing Convolution with Transformer for Image Classification via Multi-Level Feature Cross-Attention and Representation Learning Fusion
Hosam S. El-Assiouti
Hadeer El-Saadawy
M. Al-Berry
M. Tolba
ViT
47
0
0
09 Jul 2024
PoseBench: Benchmarking the Robustness of Pose Estimation Models under
  Corruptions
PoseBench: Benchmarking the Robustness of Pose Estimation Models under Corruptions
Sihan Ma
Jing Zhang
Qiong Cao
Dacheng Tao
24
2
0
20 Jun 2024
HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model
HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model
Di Wang
Meiqi Hu
Yao Jin
Yuchun Miao
Jiaqi Yang
...
Lefei Zhang
Chen Wu
Bo Du
Dacheng Tao
Liangpei Zhang
59
23
0
17 Jun 2024
DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for
  Hyperspectral Image Classification
DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification
Jiamu Sheng
Jingyi Zhou
Jiong Wang
Peng Ye
Jiayuan Fan
40
12
0
11 Jun 2024
LookHere: Vision Transformers with Directed Attention Generalize and
  Extrapolate
LookHere: Vision Transformers with Directed Attention Generalize and Extrapolate
A. Fuller
Daniel G. Kyrollos
Yousef Yassin
James R. Green
34
2
0
22 May 2024
LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for
  Remote Sensing Image Interpretation
LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image Interpretation
Wentao Jiang
Jing Zhang
Di Wang
Qiming Zhang
Zengmao Wang
Bo Du
29
5
0
16 May 2024
Promoting AI Equity in Science: Generalized Domain Prompt Learning for
  Accessible VLM Research
Promoting AI Equity in Science: Generalized Domain Prompt Learning for Accessible VLM Research
Qinglong Cao
Yuntian Chen
Lu Lu
Hao Sun
Zhenzhong Zeng
Xiaokang Yang
Dong-juan Zhang
VLM
24
1
0
14 May 2024
Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout
  Analysis
Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
Tianci Bi
Xiaoyi Zhang
Zhizheng Zhang
Wenxuan Xie
Cuiling Lan
Yan Lu
Nanning Zheng
VLM
45
1
0
13 May 2024
Self-supervised visual learning in the low-data regime: a comparative
  evaluation
Self-supervised visual learning in the low-data regime: a comparative evaluation
Sotirios Konstantakos
Despina Ioanna Chalkiadaki
Ioannis Mademlis
Yuki M. Asano
E. Gavves
Georgios Th. Papadopoulos
29
6
0
26 Apr 2024
Federated Learning with Only Positive Labels by Exploring Label
  Correlations
Federated Learning with Only Positive Labels by Exploring Label Correlations
Xuming An
Dui Wang
Li Shen
Yong Luo
Han Hu
Bo Du
Yonggang Wen
Dacheng Tao
FedML
20
0
0
24 Apr 2024
Change Guiding Network: Incorporating Change Prior to Guide Change
  Detection in Remote Sensing Imagery
Change Guiding Network: Incorporating Change Prior to Guide Change Detection in Remote Sensing Imagery
Chengxi Han
Chen Wu
Haonan Guo
Meiqi Hu
Jiepan Li
Hongruixuan Chen
45
62
0
14 Apr 2024
HANet: A Hierarchical Attention Network for Change Detection With
  Bitemporal Very-High-Resolution Remote Sensing Images
HANet: A Hierarchical Attention Network for Change Detection With Bitemporal Very-High-Resolution Remote Sensing Images
Chengxi Han
Chen Wu
Haonan Guo
Meiqi Hu
Hongruixuan Chen
23
87
0
14 Apr 2024
Pneumonia App: a mobile application for efficient pediatric pneumonia
  diagnosis using explainable convolutional neural networks (CNN)
Pneumonia App: a mobile application for efficient pediatric pneumonia diagnosis using explainable convolutional neural networks (CNN)
Jiaming Deng
Zhenglin Chen
Minjiang Chen
Lulu Xu
Jiaqi Yang
Zhendong Luo
Peiwu Qin
46
2
0
31 Mar 2024
Video-Based Human Pose Regression via Decoupled Space-Time Aggregation
Video-Based Human Pose Regression via Decoupled Space-Time Aggregation
Jijie He
Wenwu Yang
3DH
35
2
0
29 Mar 2024
PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
Chenhongyi Yang
Zehui Chen
Miguel Espinosa
Linus Ericsson
Zhenyu Wang
Jiaming Liu
Elliot J. Crowley
Mamba
26
86
0
26 Mar 2024
MTP: Advancing Remote Sensing Foundation Model via Multi-Task
  Pretraining
MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining
Di Wang
Jing Zhang
Minqiang Xu
Lin Liu
Dongsheng Wang
...
Chengxi Han
Haonan Guo
Bo Du
Dacheng Tao
L. Zhang
31
44
0
20 Mar 2024
LSKNet: A Foundation Lightweight Backbone for Remote Sensing
LSKNet: A Foundation Lightweight Backbone for Remote Sensing
Yuxuan Li
Xiang Li
Yimain Dai
Qibin Hou
Li Liu
Yongxiang Liu
Ming-Ming Cheng
Jian Yang
34
31
0
18 Mar 2024
What's in the Flow? Exploiting Temporal Motion Cues for Unsupervised
  Generic Event Boundary Detection
What's in the Flow? Exploiting Temporal Motion Cues for Unsupervised Generic Event Boundary Detection
Sourabh Vasant Gothe
Vibhav Agarwal
Sourav Ghosh
Jayesh Rajkumar Vachhani
Pranay Kashyap
Barath Raj Kandur
20
2
0
15 Feb 2024
Exploring the Synergies of Hybrid CNNs and ViTs Architectures for
  Computer Vision: A survey
Exploring the Synergies of Hybrid CNNs and ViTs Architectures for Computer Vision: A survey
Haruna Yunusa
Shiyin Qin
Abdulrahman Hamman Adama Chukkol
Abdulganiyu Abdu Yusuf
Isah Bello
A. Lawan
ViT
22
13
0
05 Feb 2024
Topology-Informed Graph Transformer
Topology-Informed Graph Transformer
Yuncheol Choi
Sun Woo Park
Minho Lee
Youngho Woo
21
3
0
03 Feb 2024
CMRNext: Camera to LiDAR Matching in the Wild for Localization and Extrinsic Calibration
CMRNext: Camera to LiDAR Matching in the Wild for Localization and Extrinsic Calibration
Daniele Cattaneo
Abhinav Valada
29
6
0
31 Jan 2024
SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design
SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design
Seokju Yun
Youngmin Ro
ViT
36
29
0
29 Jan 2024
Unraveling the Key Components of OOD Generalization via Diversification
Unraveling the Key Components of OOD Generalization via Diversification
Harold Benoit
Liangze Jiang
Andrei Atanov
Ouguzhan Fatih Kar
Mattia Rigotti
Amir Zamir
CML
29
2
0
26 Dec 2023
APTv2: Benchmarking Animal Pose Estimation and Tracking with a
  Large-scale Dataset and Beyond
APTv2: Benchmarking Animal Pose Estimation and Tracking with a Large-scale Dataset and Beyond
Yuxiang Yang
Yingqi Deng
Yufei Xu
Jing Zhang
23
4
0
25 Dec 2023
Pre-trained Trojan Attacks for Visual Recognition
Pre-trained Trojan Attacks for Visual Recognition
Aishan Liu
Xinwei Zhang
Yisong Xiao
Yuguang Zhou
Siyuan Liang
Jiakai Wang
Xianglong Liu
Xiaochun Cao
Dacheng Tao
AAML
61
25
0
23 Dec 2023
ConDaFormer: Disassembled Transformer with Local Structure Enhancement
  for 3D Point Cloud Understanding
ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding
Lunhao Duan
Shanshan Zhao
Nan Xue
Mingming Gong
Gui-Song Xia
Dacheng Tao
ViT
16
18
0
18 Dec 2023
Domain Prompt Learning with Quaternion Networks
Domain Prompt Learning with Quaternion Networks
Qinglong Cao
Zhengqin Xu
Yuntian Chen
Chao Ma
Xiaokang Yang
VLM
29
10
0
12 Dec 2023
1234
Next