ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.15156
  4. Cited By
Blending Anti-Aliasing into Vision Transformer

Blending Anti-Aliasing into Vision Transformer

28 October 2021
Shengju Qian
Hao Shao
Yi Zhu
Mu Li
Jiaya Jia
ArXivPDFHTML

Papers citing "Blending Anti-Aliasing into Vision Transformer"

19 / 19 papers shown
Title
Spectral-Adaptive Modulation Networks for Visual Perception
Spectral-Adaptive Modulation Networks for Visual Perception
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Paul Hongsuck Seo
Dong Hwan Kim
32
0
0
31 Mar 2025
Universal Functional Regression with Neural Operator Flows
Universal Functional Regression with Neural Operator Flows
Yaozhong Shi
Angela F. Gao
Zachary E. Ross
Kamyar Azizzadenesheli
31
3
0
03 Apr 2024
FeatUp: A Model-Agnostic Framework for Features at Any Resolution
FeatUp: A Model-Agnostic Framework for Features at Any Resolution
Stephanie Fu
Mark Hamilton
Laura E. Brandt
Axel Feldmann
Zhoutong Zhang
William T. Freeman
MDE
27
49
0
15 Mar 2024
When Semantic Segmentation Meets Frequency Aliasing
When Semantic Segmentation Meets Frequency Aliasing
Linwei Chen
Lin Gu
Ying Fu
27
5
0
14 Mar 2024
Frequency-Adaptive Dilated Convolution for Semantic Segmentation
Frequency-Adaptive Dilated Convolution for Semantic Segmentation
Linwei Chen
Lin Gu
Ying Fu
18
21
0
08 Mar 2024
Pre-trained Transformer-Enabled Strategies with Human-Guided Fine-Tuning
  for End-to-end Navigation of Autonomous Vehicles
Pre-trained Transformer-Enabled Strategies with Human-Guided Fine-Tuning for End-to-end Navigation of Autonomous Vehicles
Dong Hu
Chao Huang
Jingda Wu
Hongbo Gao
25
5
0
20 Feb 2024
SPANet: Frequency-balancing Token Mixer using Spectral Pooling
  Aggregation Modulation
SPANet: Frequency-balancing Token Mixer using Spectral Pooling Aggregation Modulation
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Dong Hwan Kim
MoE
11
8
0
22 Aug 2023
Optimizing PatchCore for Few/many-shot Anomaly Detection
Optimizing PatchCore for Few/many-shot Anomaly Detection
Joao Santos
Triet Tran
Oliver Rippel
16
10
0
20 Jul 2023
ReasonNet: End-to-End Driving with Temporal and Global Reasoning
ReasonNet: End-to-End Driving with Temporal and Global Reasoning
Hao Shao
Letian Wang
Ruobing Chen
Steven L. Waslander
Hongsheng Li
Y. Liu
LRM
22
69
0
17 May 2023
AIM: Adapting Image Models for Efficient Video Action Recognition
AIM: Adapting Image Models for Efficient Video Action Recognition
Taojiannan Yang
Yi Zhu
Yusheng Xie
Aston Zhang
C. L. P. Chen
Mu Li
ViT
39
143
0
06 Feb 2023
What Makes for Good Tokenizers in Vision Transformer?
What Makes for Good Tokenizers in Vision Transformer?
Shengju Qian
Yi Zhu
Wenbo Li
Mu Li
Jiaya Jia
ViT
29
13
0
21 Dec 2022
Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion
  Transformer
Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer
Hao Shao
Letian Wang
Ruobing Chen
Hongsheng Li
Y. Liu
28
194
0
28 Jul 2022
MLP-Mixer: An all-MLP Architecture for Vision
MLP-Mixer: An all-MLP Architecture for Vision
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
...
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
239
2,554
0
04 May 2021
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
283
5,723
0
29 Apr 2021
VidTr: Video Transformer Without Convolutions
VidTr: Video Transformer Without Convolutions
Yanyi Zhang
Xinyu Li
Chunhui Liu
Bing Shuai
Yi Zhu
Biagio Brattoli
Hao Chen
I. Marsic
Joseph Tighe
ViT
136
193
0
23 Apr 2021
On Aliased Resizing and Surprising Subtleties in GAN Evaluation
On Aliased Resizing and Surprising Subtleties in GAN Evaluation
Gaurav Parmar
Richard Y. Zhang
Jun-Yan Zhu
EGVM
22
74
0
22 Apr 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction
  without Convolutions
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
263
3,538
0
24 Feb 2021
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,735
0
24 Feb 2021
Semantic Understanding of Scenes through the ADE20K Dataset
Semantic Understanding of Scenes through the ADE20K Dataset
Bolei Zhou
Hang Zhao
Xavier Puig
Tete Xiao
Sanja Fidler
Adela Barriuso
Antonio Torralba
SSeg
249
1,817
0
18 Aug 2016
1