ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.03348
  4. Cited By
ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias

ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias

7 June 2021
Yufei Xu
Qiming Zhang
Jing Zhang
Dacheng Tao
    ViT
ArXivPDFHTML

Papers citing "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias"

50 / 197 papers shown
Title
MARformer: An Efficient Metal Artifact Reduction Transformer for Dental
  CBCT Images
MARformer: An Efficient Metal Artifact Reduction Transformer for Dental CBCT Images
Yuxuan Shi
Jun Xu
Dinggang Shen
MedIm
25
0
0
16 Nov 2023
BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species
  Classification and Mapping
BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping
S. Sastry
Subash Khanal
A. Dhakal
Di Huang
Nathan Jacobs
40
9
0
29 Oct 2023
$\mathbb{VD}$-$\mathbb{GR}$: Boosting $\mathbb{V}$isual
  $\mathbb{D}$ialog with Cascaded Spatial-Temporal Multi-Modal
  $\mathbb{GR}$aphs
VD\mathbb{VD}VD-GR\mathbb{GR}GR: Boosting V\mathbb{V}Visual D\mathbb{D}Dialog with Cascaded Spatial-Temporal Multi-Modal GR\mathbb{GR}GRaphs
Adnen Abdessaied
Lei Shi
Andreas Bulling
3DH
19
3
0
25 Oct 2023
VcT: Visual change Transformer for Remote Sensing Image Change Detection
VcT: Visual change Transformer for Remote Sensing Image Change Detection
Bo Jiang
Zitian Wang
Xixi Wang
Ziyan Zhang
Lan Chen
Xiao Wang
Bin Luo
ViT
21
38
0
17 Oct 2023
Distilling Efficient Vision Transformers from CNNs for Semantic
  Segmentation
Distilling Efficient Vision Transformers from CNNs for Semantic Segmentation
Xueye Zheng
Yunhao Luo
Pengyuan Zhou
Lin Wang
27
12
0
11 Oct 2023
EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention
EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention
Yulong Shi
Mingwei Sun
Yongshuai Wang
Hui Sun
Zengqiang Chen
29
4
0
10 Oct 2023
Logical Bias Learning for Object Relation Prediction
Logical Bias Learning for Object Relation Prediction
Xinyu Zhou
Zihan Ji
Anna Zhu
CVBM
16
0
0
01 Oct 2023
Domain-Controlled Prompt Learning
Domain-Controlled Prompt Learning
Qinglong Cao
Zhengqin Xu
Yuantian Chen
Chao Ma
Xiaokang Yang
VLM
21
16
0
30 Sep 2023
Multi-Dimensional Hyena for Spatial Inductive Bias
Multi-Dimensional Hyena for Spatial Inductive Bias
Itamar Zimerman
Lior Wolf
ViT
25
4
0
24 Sep 2023
RBFormer: Improve Adversarial Robustness of Transformer by Robust Bias
RBFormer: Improve Adversarial Robustness of Transformer by Robust Bias
Hao Cheng
Jinhao Duan
Hui Li
Lyutianyang Zhang
Jiahang Cao
Ping Wang
Jize Zhang
Kaidi Xu
Renjing Xu
AAML
27
3
0
23 Sep 2023
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
Zhuofan Xia
Xuran Pan
Shiji Song
Li Erran Li
Gao Huang
ViT
19
24
0
04 Sep 2023
Computation-efficient Deep Learning for Computer Vision: A Survey
Computation-efficient Deep Learning for Computer Vision: A Survey
Yulin Wang
Yizeng Han
Chaofei Wang
Shiji Song
Qi Tian
Gao Huang
VLM
26
20
0
27 Aug 2023
CS-Mixer: A Cross-Scale Vision MLP Model with Spatial-Channel Mixing
CS-Mixer: A Cross-Scale Vision MLP Model with Spatial-Channel Mixing
Jianwei Cui
David A. Araujo
Suman Saha
Md Faisal Kabir
BDL
36
0
0
25 Aug 2023
PVG: Progressive Vision Graph for Vision Recognition
PVG: Progressive Vision Graph for Vision Recognition
Jiafu Wu
Jian Li
Jiangning Zhang
Boshen Zhang
M. Chi
Yabiao Wang
Chengjie Wang
ViT
15
12
0
01 Aug 2023
DINO-CXR: A self supervised method based on vision transformer for chest
  X-ray classification
DINO-CXR: A self supervised method based on vision transformer for chest X-ray classification
M. Shakouri
F. Iranmanesh
M. Eftekhari
MedIm
4
4
0
01 Aug 2023
Towards Generalist Biomedical AI
Towards Generalist Biomedical AI
Tao Tu
Shekoofeh Azizi
Danny Driess
M. Schaekermann
Mohamed Amin
...
Yossi Matias
K. Singhal
Peter R. Florence
Alan Karthikesalingam
Vivek Natarajan
LM&MA
MedIm
AI4MH
35
241
0
26 Jul 2023
ESSAformer: Efficient Transformer for Hyperspectral Image
  Super-resolution
ESSAformer: Efficient Transformer for Hyperspectral Image Super-resolution
Mingjin Zhang
Chi Zhang
Qiming Zhang
Jie-Ru Guo
Xinbo Gao
Jing Zhang
13
28
0
26 Jul 2023
Towards General Game Representations: Decomposing Games Pixels into
  Content and Style
Towards General Game Representations: Decomposing Games Pixels into Content and Style
C. Trivedi
Konstantinos Makantasis
Antonios Liapis
Georgios N. Yannakakis
OCL
27
3
0
20 Jul 2023
RefSAM: Efficiently Adapting Segmenting Anything Model for Referring
  Video Object Segmentation
RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object Segmentation
Yonglin Li
Jing Zhang
Xiao Teng
Long Lan
VOS
VLM
23
17
0
03 Jul 2023
Dissecting Multimodality in VideoQA Transformer Models by Impairing
  Modality Fusion
Dissecting Multimodality in VideoQA Transformer Models by Impairing Modality Fusion
Isha Rawal
Alexander Matyasko
Shantanu Jaiswal
Basura Fernando
Cheston Tan
21
1
0
15 Jun 2023
Semi-supervised Cell Recognition under Point Supervision
Semi-supervised Cell Recognition under Point Supervision
Zhongyi Shui
Yizhi Zhao
S. Zheng
Yunlong Zhang
Honglin Li
Shichuan Zhang
Xiaoxuan Yu
Chenglu Zhu
Lin Yang
23
1
0
14 Jun 2023
FasterViT: Fast Vision Transformers with Hierarchical Attention
FasterViT: Fast Vision Transformers with Hierarchical Attention
Ali Hatamizadeh
Greg Heinrich
Hongxu Yin
Andrew Tao
J. Álvarez
Jan Kautz
Pavlo Molchanov
ViT
17
66
0
09 Jun 2023
Inflated 3D Convolution-Transformer for Weakly-supervised Carotid
  Stenosis Grading with Ultrasound Videos
Inflated 3D Convolution-Transformer for Weakly-supervised Carotid Stenosis Grading with Ultrasound Videos
Xinrui Zhou
Yuhao Huang
Wufeng Xue
Xin Yang
Yuxin Zou
Qilong Ying
Yuanji Zhang
Jia Liu
Jie Jessie Ren
Dong Ni
ViT
MedIm
28
4
0
05 Jun 2023
DeepSolo++: Let Transformer Decoder with Explicit Points Solo for
  Multilingual Text Spotting
DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multilingual Text Spotting
Maoyuan Ye
Jing Zhang
Shanshan Zhao
Juhua Liu
Tongliang Liu
Bo Du
Dacheng Tao
33
2
0
31 May 2023
SAR-to-Optical Image Translation via Thermodynamics-inspired Network
SAR-to-Optical Image Translation via Thermodynamics-inspired Network
Mingjin Zhang
Jiamin Xu
Chengyu He
Wenteng Shang
Yunsong Li
Xinbo Gao
DiffM
15
2
0
23 May 2023
VanillaNet: the Power of Minimalism in Deep Learning
VanillaNet: the Power of Minimalism in Deep Learning
Hanting Chen
Yunhe Wang
Jianyuan Guo
Dacheng Tao
VLM
34
85
0
22 May 2023
CB-HVTNet: A channel-boosted hybrid vision transformer network for
  lymphocyte assessment in histopathological images
CB-HVTNet: A channel-boosted hybrid vision transformer network for lymphocyte assessment in histopathological images
Momina Liaqat Ali
Zunaira Rauf
Asifullah Khan
A. Sohail
Rafi Ullah
Jeonghwan Gwak
MedIm
ViT
34
2
0
16 May 2023
Vision-Language Models in Remote Sensing: Current Progress and Future
  Trends
Vision-Language Models in Remote Sensing: Current Progress and Future Trends
Xiang Li
Congcong Wen
Yuan Hu
Zhenghang Yuan
Xiao Xiang Zhu
VLM
16
71
0
09 May 2023
Scalable Mask Annotation for Video Text Spotting
Scalable Mask Annotation for Video Text Spotting
Haibin He
Jing Zhang
Mengyang Xu
Juhua Liu
Bo Du
Dacheng Tao
90
14
0
02 May 2023
Radar-Camera Fusion for Object Detection and Semantic Segmentation in
  Autonomous Driving: A Comprehensive Review
Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive Review
Shanliang Yao
Runwei Guan
Xiaoyu Huang
Zhuoxiao Li
Xiangyu Sha
...
Eng Gee Lim
H. Seo
Ka Lok Man
Xiaohui Zhu
Yutao Yue
25
91
0
20 Apr 2023
DCN-T: Dual Context Network with Transformer for Hyperspectral Image
  Classification
DCN-T: Dual Context Network with Transformer for Hyperspectral Image Classification
Di Wang
Jing Zhang
Bo Du
L. Zhang
Dacheng Tao
14
50
0
19 Apr 2023
Transformer-Based Visual Segmentation: A Survey
Transformer-Based Visual Segmentation: A Survey
Xiangtai Li
Henghui Ding
Haobo Yuan
Wenwei Zhang
Jiangmiao Pang
Guangliang Cheng
Kai-xiang Chen
Ziwei Liu
Chen Change Loy
ViT
MedIm
37
132
0
19 Apr 2023
Dynamic Mobile-Former: Strengthening Dynamic Convolution with Attention
  and Residual Connection in Kernel Space
Dynamic Mobile-Former: Strengthening Dynamic Convolution with Attention and Residual Connection in Kernel Space
Seokju Yun
Youngmin Ro
ViT
19
2
0
13 Apr 2023
OccFormer: Dual-path Transformer for Vision-based 3D Semantic Occupancy
  Prediction
OccFormer: Dual-path Transformer for Vision-based 3D Semantic Occupancy Prediction
Yunpeng Zhang
Zhengbiao Zhu
Dalong Du
3DPC
23
182
0
11 Apr 2023
A Billion-scale Foundation Model for Remote Sensing Images
A Billion-scale Foundation Model for Remote Sensing Images
Keumgang Cha
Junghoon Seo
Taekyung Lee
30
63
0
11 Apr 2023
Deep Image Matting: A Comprehensive Survey
Deep Image Matting: A Comprehensive Survey
Jizhizi Li
Jing Zhang
Dacheng Tao
VLM
33
14
0
10 Apr 2023
On Efficient Training of Large-Scale Deep Learning Models: A Literature
  Review
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review
Li Shen
Yan Sun
Zhiyuan Yu
Liang Ding
Xinmei Tian
Dacheng Tao
VLM
24
39
0
07 Apr 2023
Visual Dependency Transformers: Dependency Tree Emerges from Reversed
  Attention
Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention
Mingyu Ding
Yikang Shen
Lijie Fan
Zhenfang Chen
Z. Chen
Ping Luo
J. Tenenbaum
Chuang Gan
ViT
77
14
0
06 Apr 2023
What Affects Learned Equivariance in Deep Image Recognition Models?
What Affects Learned Equivariance in Deep Image Recognition Models?
Robert-Jan Bruintjes
Tomasz Motyka
J. C. V. Gemert
15
7
0
05 Apr 2023
Unsupervised Cross-domain Pulmonary Nodule Detection without Source Data
Unsupervised Cross-domain Pulmonary Nodule Detection without Source Data
Rui Xu
Yong Luo
Bo Du
OOD
16
1
0
03 Apr 2023
GLT-T++: Global-Local Transformer for 3D Siamese Tracking with Ranking Loss
Jiahao Nie
Zhiwei He
Yuxiang Yang
Xudong Lv
Mingchen Gao
Jing Zhang
ViT
3DPC
34
7
0
01 Apr 2023
LaCViT: A Label-aware Contrastive Fine-tuning Framework for Vision
  Transformers
LaCViT: A Label-aware Contrastive Fine-tuning Framework for Vision Transformers
Zijun Long
Zaiqiao Meng
Gerardo Aragon Camarasa
R. McCreadie
VLM
29
5
0
31 Mar 2023
Vision Transformer with Quadrangle Attention
Vision Transformer with Quadrangle Attention
Qiming Zhang
Jing Zhang
Yufei Xu
Dacheng Tao
ViT
19
38
0
27 Mar 2023
Rethinking Dual-Domain Undersampled MRI reconstruction: domain-specific
  design from the perspective of the receptive field
Rethinking Dual-Domain Undersampled MRI reconstruction: domain-specific design from the perspective of the receptive field
Ziqi Gao
S. Kevin Zhou
OOD
20
1
0
19 Mar 2023
Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning
Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning
Haoyu He
Jianfei Cai
Jing Zhang
Dacheng Tao
Bohan Zhuang
VPVLM
14
50
0
15 Mar 2023
SGDA: Towards 3D Universal Pulmonary Nodule Detection via Slice Grouped
  Domain Attention
SGDA: Towards 3D Universal Pulmonary Nodule Detection via Slice Grouped Domain Attention
Rui Xu
Zhi Liu
Yong Luo
Han Hu
Li Shen
Bo Du
Kaiming Kuang
Jiancheng Yang
ViT
170
14
0
07 Mar 2023
DPA-P2PNet: Deformable Proposal-aware P2PNet for Accurate Point-based
  Cell Detection
DPA-P2PNet: Deformable Proposal-aware P2PNet for Accurate Point-based Cell Detection
Zhongyi Shui
S. Zheng
Chenglu Zhu
Shichuan Zhang
Xiaoxuan Yu
Honglin Li
Jingxiong Li
Pingyi Chen
L. Yang
3DPC
32
4
0
05 Mar 2023
OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge
  Collaborative AutoML System
OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System
Chao Xue
W. Liu
Shunxing Xie
Zhenfang Wang
Jiaxing Li
...
Shi-Yong Chen
Yibing Zhan
Jing Zhang
Chaoyue Wang
Dacheng Tao
32
2
0
01 Mar 2023
Oriented Object Detection in Optical Remote Sensing Images using Deep Learning: A Survey
Oriented Object Detection in Optical Remote Sensing Images using Deep Learning: A Survey
Kunlin Wang
Zi Wang
Zhang Li
Ang Su
Xichao Teng
Minhao Liu
Qifeng Yu
Qifeng Yu
ObjD
81
8
0
21 Feb 2023
Efficiency 360: Efficient Vision Transformers
Efficiency 360: Efficient Vision Transformers
Badri N. Patro
Vijay Srinivas Agneeswaran
21
6
0
16 Feb 2023
Previous
1234
Next