ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.12956
  4. Cited By
Inception Transformer

Inception Transformer

25 May 2022
Chenyang Si
Weihao Yu
Pan Zhou
Yichen Zhou
Xinchao Wang
Shuicheng Yan
    ViT
ArXivPDFHTML

Papers citing "Inception Transformer"

50 / 72 papers shown
Title
CVVNet: A Cross-Vertical-View Network for Gait Recognition
CVVNet: A Cross-Vertical-View Network for Gait Recognition
X. Li
Wei Song
Yingda Huang
Wei Meng
Le Chang
CVBM
28
0
0
03 May 2025
BARIS: Boundary-Aware Refinement with Environmental Degradation Priors for Robust Underwater Instance Segmentation
BARIS: Boundary-Aware Refinement with Environmental Degradation Priors for Robust Underwater Instance Segmentation
Pin-Chi Pan
Soo-Chang Pei
54
0
0
28 Apr 2025
UniViTAR: Unified Vision Transformer with Native Resolution
UniViTAR: Unified Vision Transformer with Native Resolution
Limeng Qiao
Yiyang Gan
Bairui Wang
Jie Qin
Shuang Xu
Siqi Yang
Lin Ma
50
0
0
02 Apr 2025
Enhancing Parameter-Efficient Fine-Tuning of Vision Transformers through
  Frequency-Based Adaptation
Enhancing Parameter-Efficient Fine-Tuning of Vision Transformers through Frequency-Based Adaptation
S. Ly
Hien Nguyen
72
1
0
28 Nov 2024
TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba
TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba
Xiaowen Ma
Zhenliang Ni
Xinghao Chen
Mamba
73
2
0
26 Nov 2024
Breaking the Low-Rank Dilemma of Linear Attention
Breaking the Low-Rank Dilemma of Linear Attention
Qihang Fan
Huaibo Huang
Ran He
33
0
0
12 Nov 2024
MoH: Multi-Head Attention as Mixture-of-Head Attention
MoH: Multi-Head Attention as Mixture-of-Head Attention
Peng Jin
Bo Zhu
Li Yuan
Shuicheng Yan
MoE
29
13
0
15 Oct 2024
Advancing Depth Anything Model for Unsupervised Monocular Depth Estimation in Endoscopy
Advancing Depth Anything Model for Unsupervised Monocular Depth Estimation in Endoscopy
Bojian Li
Bo Liu
Jinghua Yue
F. Zhou
Fugen Zhou
MedIm
MDE
45
2
0
12 Sep 2024
LowFormer: Hardware Efficient Design for Convolutional Transformer
  Backbones
LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones
Moritz Nottebaum
Matteo Dunnhofer
C. Micheloni
ViT
29
1
0
05 Sep 2024
AstroMAE: Redshift Prediction Using a Masked Autoencoder with a Novel
  Fine-Tuning Architecture
AstroMAE: Redshift Prediction Using a Masked Autoencoder with a Novel Fine-Tuning Architecture
Amirreza Dolatpour Fathkouhi
Geoffrey Charles Fox
21
1
0
03 Sep 2024
Towards the Spectral bias Alleviation by Normalizations in Coordinate
  Networks
Towards the Spectral bias Alleviation by Normalizations in Coordinate Networks
Zhicheng Cai
Hao Zhu
Qiu Shen
Xinran Wang
Xun Cao
27
0
0
25 Jul 2024
TCFormer: Visual Recognition via Token Clustering Transformer
TCFormer: Visual Recognition via Token Clustering Transformer
Wang Zeng
Sheng Jin
Lumin Xu
Wentao Liu
Chao Qian
Wanli Ouyang
Ping Luo
Xiaogang Wang
26
3
0
16 Jul 2024
Adapting Pretrained ViTs with Convolution Injector for Visuo-Motor
  Control
Adapting Pretrained ViTs with Convolution Injector for Visuo-Motor Control
Dongyoon Hwang
ByungKun Lee
Hojoon Lee
Hyunseung Kim
Jaegul Choo
35
0
0
10 Jun 2024
Conv-INR: Convolutional Implicit Neural Representation for Multimodal
  Visual Signals
Conv-INR: Convolutional Implicit Neural Representation for Multimodal Visual Signals
Zhicheng Cai
28
0
0
06 Jun 2024
Encoding Semantic Priors into the Weights of Implicit Neural
  Representation
Encoding Semantic Priors into the Weights of Implicit Neural Representation
Zhicheng Cai
Qiu Shen
28
0
0
06 Jun 2024
MSPE: Multi-Scale Patch Embedding Prompts Vision Transformers to Any
  Resolution
MSPE: Multi-Scale Patch Embedding Prompts Vision Transformers to Any Resolution
Wenzhuo Liu
Fei Zhu
Shijie Ma
Cheng-Lin Liu
18
4
0
28 May 2024
Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for
  Vision Transformer
Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer
Qihang Fan
Huaibo Huang
Mingrui Chen
Ran He
39
3
0
22 May 2024
Vision Transformer with Sparse Scan Prior
Vision Transformer with Sparse Scan Prior
Qihang Fan
Huaibo Huang
Mingrui Chen
Ran He
ViT
36
5
0
22 May 2024
Score-CDM: Score-Weighted Convolutional Diffusion Model for Multivariate
  Time Series Imputation
Score-CDM: Score-Weighted Convolutional Diffusion Model for Multivariate Time Series Imputation
Shunyang Zhang
Senzhang Wang
Hao Miao
Hao Chen
Changjun Fan
Jian Zhang
38
2
0
21 May 2024
MambaOut: Do We Really Need Mamba for Vision?
MambaOut: Do We Really Need Mamba for Vision?
Weihao Yu
Xinchao Wang
Mamba
39
47
0
13 May 2024
Mamba-360: Survey of State Space Models as Transformer Alternative for
  Long Sequence Modelling: Methods, Applications, and Challenges
Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges
Badri N. Patro
Vijay Srinivas Agneeswaran
Mamba
35
38
0
24 Apr 2024
MambaMixer: Efficient Selective State Space Models with Dual Token and
  Channel Selection
MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection
Ali Behrouz
Michele Santacatterina
Ramin Zabih
39
31
0
29 Mar 2024
ViTAR: Vision Transformer with Any Resolution
ViTAR: Vision Transformer with Any Resolution
Qihang Fan
Quanzeng You
Xiaotian Han
Yongfei Liu
Yunzhe Tao
Huaibo Huang
Ran He
Hongxia Yang
ViT
37
14
0
27 Mar 2024
Heracles: A Hybrid SSM-Transformer Model for High-Resolution Image and
  Time-Series Analysis
Heracles: A Hybrid SSM-Transformer Model for High-Resolution Image and Time-Series Analysis
Badri N. Patro
Suhas Ranganath
Vinay P. Namboodiri
Vijay Srinivas Agneeswaran
43
2
0
26 Mar 2024
SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate
  Time series
SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series
Badri N. Patro
Vijay Srinivas Agneeswaran
Mamba
51
50
0
22 Mar 2024
Spiking Wavelet Transformer
Spiking Wavelet Transformer
Yuetong Fang
Ziqing Wang
Lingfeng Zhang
Jiahang Cao
Honglei Chen
Renjing Xu
59
4
0
17 Mar 2024
ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature
  Interaction for Dense Predictions
ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions
Chunlong Xia
Xinliang Wang
Feng Lv
Xin Hao
Yifeng Shi
ViT
26
45
0
12 Mar 2024
Frequency Decoupling for Motion Magnification via Multi-Level Isomorphic
  Architecture
Frequency Decoupling for Motion Magnification via Multi-Level Isomorphic Architecture
Fei Wang
Dan Guo
Kun Li
Zhun Zhong
Mengqing Wang
34
16
0
12 Mar 2024
GenFace: A Large-Scale Fine-Grained Face Forgery Benchmark and Cross
  Appearance-Edge Learning
GenFace: A Large-Scale Fine-Grained Face Forgery Benchmark and Cross Appearance-Edge Learning
Yaning Zhang
Zitong Yu
Xiaobin Huang
Linlin Shen
Jianfeng Ren
22
5
0
03 Feb 2024
SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design
SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design
Seokju Yun
Youngmin Ro
ViT
34
29
0
29 Jan 2024
Leveraging Frequency Domain Learning in 3D Vessel Segmentation
Leveraging Frequency Domain Learning in 3D Vessel Segmentation
Xinyuan Wang
C. Pan
Hongming Dai
Gangming Zhao
Jinpeng Li
Xiao Zhang
Yizhou Yu
20
0
0
11 Jan 2024
Exploring Plain ViT Reconstruction for Multi-class Unsupervised Anomaly
  Detection
Exploring Plain ViT Reconstruction for Multi-class Unsupervised Anomaly Detection
Jiangning Zhang
Xuhai Chen
Yabiao Wang
Chengjie Wang
Yong Liu
Xiangtai Li
Ming-Hsuan Yang
Dacheng Tao
23
24
0
12 Dec 2023
Advancing Vision Transformers with Group-Mix Attention
Advancing Vision Transformers with Group-Mix Attention
Chongjian Ge
Xiaohan Ding
Zhan Tong
Li Yuan
Jiangliu Wang
Yibing Song
Ping Luo
112
16
0
26 Nov 2023
Scattering Vision Transformer: Spectral Mixing Matters
Scattering Vision Transformer: Spectral Mixing Matters
Badri N. Patro
Vijay Srinivas Agneeswaran
24
14
0
02 Nov 2023
Entropic Score metric: Decoupling Topology and Size in Training-free NAS
Entropic Score metric: Decoupling Topology and Size in Training-free NAS
Niccolò Cavagnero
Luc Robbiano
Francesca Pistilli
Barbara Caputo
Giuseppe Averta
16
2
0
06 Oct 2023
SeisT: A foundational deep learning model for earthquake monitoring
  tasks
SeisT: A foundational deep learning model for earthquake monitoring tasks
Sen Li
Xu Yang
Anye Cao
Changbin Wang
Yaoqi Liu
Yapeng Liu
Qiang Niu
20
3
0
02 Oct 2023
FreeU: Free Lunch in Diffusion U-Net
FreeU: Free Lunch in Diffusion U-Net
Chenyang Si
Ziqi Huang
Yuming Jiang
Ziwei Liu
DiffM
38
128
0
20 Sep 2023
RMT: Retentive Networks Meet Vision Transformers
RMT: Retentive Networks Meet Vision Transformers
Qihang Fan
Huaibo Huang
Mingrui Chen
Hongmin Liu
Ran He
ViT
30
73
0
20 Sep 2023
RingMo-lite: A Remote Sensing Multi-task Lightweight Network with
  CNN-Transformer Hybrid Framework
RingMo-lite: A Remote Sensing Multi-task Lightweight Network with CNN-Transformer Hybrid Framework
Yuelei Wang
Ting Zhang
Liangjin Zhao
Lin Hu
Zhechao Wang
...
Kaiqiang Chen
Xuan Zeng
Zhirui Wang
Hongqi Wang
Xian Sun
19
4
0
16 Sep 2023
MB-TaylorFormer: Multi-branch Efficient Transformer Expanded by Taylor
  Formula for Image Dehazing
MB-TaylorFormer: Multi-branch Efficient Transformer Expanded by Taylor Formula for Image Dehazing
Yuwei Qiu
Kaihao Zhang
Chenxi Wang
Wenhan Luo
Hongdong Li
Zhi Jin
ViT
29
82
0
27 Aug 2023
Computation-efficient Deep Learning for Computer Vision: A Survey
Computation-efficient Deep Learning for Computer Vision: A Survey
Yulin Wang
Yizeng Han
Chaofei Wang
Shiji Song
Qi Tian
Gao Huang
VLM
26
20
0
27 Aug 2023
EFormer: Enhanced Transformer towards Semantic-Contour Features of
  Foreground for Portraits Matting
EFormer: Enhanced Transformer towards Semantic-Contour Features of Foreground for Portraits Matting
Zitao Wang
Qiguang Miao
Peipei Zhao
Yue Xi
ViT
30
2
0
24 Aug 2023
Local Distortion Aware Efficient Transformer Adaptation for Image
  Quality Assessment
Local Distortion Aware Efficient Transformer Adaptation for Image Quality Assessment
Kangmin Xu
Liang Liao
Jing Xiao
Chaofeng Chen
Haoning Wu
Qiong Yan
Weisi Lin
ViT
13
5
0
23 Aug 2023
Multi-Architecture Multi-Expert Diffusion Models
Multi-Architecture Multi-Expert Diffusion Models
Yunsung Lee
Jin-Young Kim
Hyojun Go
Myeongho Jeong
Shinhyeok Oh
Seungtaek Choi
DiffM
26
29
0
08 Jun 2023
Lightweight Vision Transformer with Bidirectional Interaction
Lightweight Vision Transformer with Bidirectional Interaction
Qihang Fan
Huaibo Huang
Xiaoqiang Zhou
Ran He
ViT
31
27
0
01 Jun 2023
Dual Path Transformer with Partition Attention
Dual Path Transformer with Partition Attention
Zhengkai Jiang
Liang Liu
Jiangning Zhang
Yabiao Wang
Mingang Chen
Chengjie Wang
ViT
31
2
0
24 May 2023
FreMIM: Fourier Transform Meets Masked Image Modeling for Medical Image
  Segmentation
FreMIM: Fourier Transform Meets Masked Image Modeling for Medical Image Segmentation
Wenxuan Wang
Jing Wang
C. L. P. Chen
Jianbo Jiao
Yuanxiu Cai
Shanshan Song
Jiangyun Li
MedIm
18
17
0
21 Apr 2023
SpectFormer: Frequency and Attention is what you need in a Vision
  Transformer
SpectFormer: Frequency and Attention is what you need in a Vision Transformer
Badri N. Patro
Vinay P. Namboodiri
Vijay Srinivas Agneeswaran
ViT
22
47
0
13 Apr 2023
Dynamic Mobile-Former: Strengthening Dynamic Convolution with Attention
  and Residual Connection in Kernel Space
Dynamic Mobile-Former: Strengthening Dynamic Convolution with Attention and Residual Connection in Kernel Space
Seokju Yun
Youngmin Ro
ViT
11
2
0
13 Apr 2023
APPT : Asymmetric Parallel Point Transformer for 3D Point Cloud
  Understanding
APPT : Asymmetric Parallel Point Transformer for 3D Point Cloud Understanding
Hengjia Li
Tu Zheng
Zhihao Chi
Zheng Yang
Wenxiao Wang
Boxi Wu
Binbin Lin
Deng Cai
3DPC
30
1
0
31 Mar 2023
12
Next