ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.08810
  4. Cited By
Do Vision Transformers See Like Convolutional Neural Networks?

Do Vision Transformers See Like Convolutional Neural Networks?

19 August 2021
M. Raghu
Thomas Unterthiner
Simon Kornblith
Chiyuan Zhang
Alexey Dosovitskiy
    ViT
ArXivPDFHTML

Papers citing "Do Vision Transformers See Like Convolutional Neural Networks?"

50 / 440 papers shown
Title
DIAL: Dense Image-text ALignment for Weakly Supervised Semantic
  Segmentation
DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation
Soojin Jang
Jungmin Yun
Junehyoung Kwon
Eunju Lee
Youngbin Kim
38
3
0
24 Sep 2024
Artificial Human Intelligence: The role of Humans in the Development of Next Generation AI
Artificial Human Intelligence: The role of Humans in the Development of Next Generation AI
Suayb S. Arslan
23
2
0
24 Sep 2024
Are Music Foundation Models Better at Singing Voice Deepfake Detection?
  Far-Better Fuse them with Speech Foundation Models
Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models
Orchid Chetia Phukan
Sarthak Jain
Swarup Ranjan Behera
Arun Balaji Buduru
Rajesh Sharma
S. R Mahadeva Prasanna
26
0
0
21 Sep 2024
Agglomerative Token Clustering
Agglomerative Token Clustering
Joakim Bruslund Haurum
Sergio Escalera
Graham W. Taylor
T. Moeslund
31
1
0
18 Sep 2024
SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision
  Mamba and Transformer Networks
SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks
Meng Lou
Yunxiang Fu
Yizhou Yu
Mamba
53
5
0
15 Sep 2024
Approximating Metric Magnitude of Point Sets
Approximating Metric Magnitude of Point Sets
R. Andreeva
James Ward
Primoz Skraba
Jie Gao
Rik Sarkar
21
1
0
06 Sep 2024
MobileUNETR: A Lightweight End-To-End Hybrid Vision Transformer For
  Efficient Medical Image Segmentation
MobileUNETR: A Lightweight End-To-End Hybrid Vision Transformer For Efficient Medical Image Segmentation
Shehan Perera
Yunus Erzurumlu
Deepak Gulati
Alper Yilmaz
ViT
MedIm
19
0
0
04 Sep 2024
A Review of Transformer-Based Models for Computer Vision Tasks:
  Capturing Global Context and Spatial Relationships
A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships
Gracile Astlin Pereira
Muhammad Hussain
ViT
32
7
0
27 Aug 2024
Adaptive Layer Selection for Efficient Vision Transformer Fine-Tuning
Adaptive Layer Selection for Efficient Vision Transformer Fine-Tuning
Alessio Devoto
Federico Alvetreti
Jary Pomponi
P. Lorenzo
Pasquale Minervini
Simone Scardapane
49
2
0
16 Aug 2024
Scene123: One Prompt to 3D Scene Generation via Video-Assisted and
  Consistency-Enhanced MAE
Scene123: One Prompt to 3D Scene Generation via Video-Assisted and Consistency-Enhanced MAE
Yiying Yang
Fukun Yin
Jiayuan Fan
Xin Chen
Wanzhang Li
Gang Yu
VGen
44
0
0
10 Aug 2024
M2EF-NNs: Multimodal Multi-instance Evidence Fusion Neural Networks for
  Cancer Survival Prediction
M2EF-NNs: Multimodal Multi-instance Evidence Fusion Neural Networks for Cancer Survival Prediction
Hui Luo
Jiashuang Huang
Hengrong Ju
Tianyi Zhou
Weiping Ding
28
0
0
08 Aug 2024
A Survey on Cell Nuclei Instance Segmentation and Classification:
  Leveraging Context and Attention
A Survey on Cell Nuclei Instance Segmentation and Classification: Leveraging Context and Attention
João D. Nunes
D. Montezuma
Domingos Oliveira
Tania Pereira
Jaime S. Cardoso
49
1
0
26 Jul 2024
SwinSF: Image Reconstruction from Spatial-Temporal Spike Streams
SwinSF: Image Reconstruction from Spatial-Temporal Spike Streams
Liangyan Jiang
Chuang Zhu
Yanxu Chen
46
2
0
22 Jul 2024
FairViT: Fair Vision Transformer via Adaptive Masking
FairViT: Fair Vision Transformer via Adaptive Masking
Bowei Tian
Ruijie Du
Yanning Shen
19
1
0
20 Jul 2024
DuoFormer: Leveraging Hierarchical Visual Representations by Local and
  Global Attention
DuoFormer: Leveraging Hierarchical Visual Representations by Local and Global Attention
Xiaoya Tang
Bodong Zhang
Beatrice S. Knudsen
Tolga Tasdizen
ViT
MedIm
40
1
0
18 Jul 2024
Revealing the Dark Secrets of Extremely Large Kernel ConvNets on
  Robustness
Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness
Honghao Chen
Yurong Zhang
Xiaokun Feng
Xiangxiang Chu
Kaiqi Huang
AAML
30
5
0
12 Jul 2024
Knowledge distillation to effectively attain both region-of-interest and
  global semantics from an image where multiple objects appear
Knowledge distillation to effectively attain both region-of-interest and global semantics from an image where multiple objects appear
Seonwhee Jin
18
0
0
11 Jul 2024
HAFormer: Unleashing the Power of Hierarchy-Aware Features for
  Lightweight Semantic Segmentation
HAFormer: Unleashing the Power of Hierarchy-Aware Features for Lightweight Semantic Segmentation
Guoan Xu
Wenjing Jia
Tao Wu
Ligeng Chen
Guangwei Gao
ViT
30
9
0
10 Jul 2024
Understanding Visual Feature Reliance through the Lens of Complexity
Understanding Visual Feature Reliance through the Lens of Complexity
Thomas Fel
Louis Bethune
Andrew Kyle Lampinen
Thomas Serre
Katherine Hermann
FAtt
CoGe
30
6
0
08 Jul 2024
Topological Persistence Guided Knowledge Distillation for Wearable
  Sensor Data
Topological Persistence Guided Knowledge Distillation for Wearable Sensor Data
Eun Som Jeon
Hongjun Choi
A. Shukla
Yuan Wang
Hyunglae Lee
M. Buman
P. Turaga
27
3
0
07 Jul 2024
SafaRi:Adaptive Sequence Transformer for Weakly Supervised Referring
  Expression Segmentation
SafaRi:Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation
Sayan Nag
Koustava Goswami
Srikrishna Karanam
42
2
0
02 Jul 2024
How Does Overparameterization Affect Features?
How Does Overparameterization Affect Features?
Ahmet Cagri Duzgun
Samy Jelassi
Yuanzhi Li
23
0
0
01 Jul 2024
Structural Attention: Rethinking Transformer for Unpaired Medical Image
  Synthesis
Structural Attention: Rethinking Transformer for Unpaired Medical Image Synthesis
Vu Minh Hieu Phan
Yutong Xie
Bowen Zhang
Yuankai Qi
Zhibin Liao
Antonios Perperidis
S. L. Phung
Johan W. Verjans
Minh Nguyen Nhat To
MedIm
37
4
0
27 Jun 2024
MD tree: a model-diagnostic tree grown on loss landscape
MD tree: a model-diagnostic tree grown on loss landscape
Yefan Zhou
Jianlong Chen
Qinxue Cao
Konstantin Schürholt
Yaoqing Yang
29
2
0
24 Jun 2024
Beyond the Doors of Perception: Vision Transformers Represent Relations
  Between Objects
Beyond the Doors of Perception: Vision Transformers Represent Relations Between Objects
Michael A. Lepori
Alexa R. Tartaglini
Wai Keen Vong
Thomas Serre
Brenden Lake
Ellie Pavlick
36
2
0
22 Jun 2024
Intrinsic Dimension Correlation: uncovering nonlinear connections in multimodal representations
Intrinsic Dimension Correlation: uncovering nonlinear connections in multimodal representations
Lorenzo Basile
Santiago Acevedo
Luca Bortolussi
Fabio Anselmi
Alex Rodriguez
36
4
0
22 Jun 2024
DeciMamba: Exploring the Length Extrapolation Potential of Mamba
DeciMamba: Exploring the Length Extrapolation Potential of Mamba
Assaf Ben-Kish
Itamar Zimerman
Shady Abu Hussein
Nadav Cohen
Amir Globerson
Lior Wolf
Raja Giryes
Mamba
67
13
0
20 Jun 2024
MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in
  Multimodal Large Language Model
MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model
Jiahao Huo
Yibo Yan
Boren Hu
Yutao Yue
Xuming Hu
LRM
MLLM
32
7
0
17 Jun 2024
OT-VP: Optimal Transport-guided Visual Prompting for Test-Time
  Adaptation
OT-VP: Optimal Transport-guided Visual Prompting for Test-Time Adaptation
Yunbei Zhang
Akshay Mehra
Jihun Hamm
VLM
34
2
0
12 Jun 2024
U-KAN Makes Strong Backbone for Medical Image Segmentation and
  Generation
U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation
Chenxin Li
Xinyu Liu
W. J. Li
Cheng Wang
Hengyu Liu
Yifan Liu
Zhen Chen
Yixuan Yuan
MedIm
DiffM
SSeg
46
114
0
05 Jun 2024
Improving Object Detector Training on Synthetic Data by Starting With a
  Strong Baseline Methodology
Improving Object Detector Training on Synthetic Data by Starting With a Strong Baseline Methodology
Frank Ruis
Alma M. Liezenga
Friso G. Heslinga
Luca Ballan
Thijs A. Eker
Richard J. M. den Hollander
Martin C. van Leeuwen
Judith Dijk
Wyke Huizinga
31
4
0
30 May 2024
What Variables Affect Out-Of-Distribution Generalization in Pretrained
  Models?
What Variables Affect Out-Of-Distribution Generalization in Pretrained Models?
Md Yousuf Harun
Kyungbok Lee
Jhair Gallardo
Giri Krishnan
Christopher Kanan
31
2
0
23 May 2024
LookHere: Vision Transformers with Directed Attention Generalize and
  Extrapolate
LookHere: Vision Transformers with Directed Attention Generalize and Extrapolate
A. Fuller
Daniel G. Kyrollos
Yousef Yassin
James R. Green
36
2
0
22 May 2024
Scaling-laws for Large Time-series Models
Scaling-laws for Large Time-series Models
Thomas D. P. Edwards
James Alvey
Justin Alsing
Nam H. Nguyen
Benjamin Dan Wandelt
AI4TS
AIFin
31
7
0
22 May 2024
Weakly supervised alignment and registration of MR-CT for cervical
  cancer radiotherapy
Weakly supervised alignment and registration of MR-CT for cervical cancer radiotherapy
Jjahao Zhang
Yin Gu
Deyu Sun
Yuhua Gao
Ming Gao
Ming Cui
Teng Zhang
He Ma
29
0
0
21 May 2024
Data Science Principles for Interpretable and Explainable AI
Data Science Principles for Interpretable and Explainable AI
Kris Sankaran
FaML
38
0
0
17 May 2024
EfficientTrain++: Generalized Curriculum Learning for Efficient Visual
  Backbone Training
EfficientTrain++: Generalized Curriculum Learning for Efficient Visual Backbone Training
Yulin Wang
Yang Yue
Rui Lu
Yizeng Han
Shiji Song
Gao Huang
VLM
56
12
0
14 May 2024
Language-Enhanced Latent Representations for Out-of-Distribution
  Detection in Autonomous Driving
Language-Enhanced Latent Representations for Out-of-Distribution Detection in Autonomous Driving
Zhenjiang Mao
Dong-You Jhong
Ao Wang
Ivan Ruchkin
OODD
31
2
0
02 May 2024
Saliency Suppressed, Semantics Surfaced: Visual Transformations in
  Neural Networks and the Brain
Saliency Suppressed, Semantics Surfaced: Visual Transformations in Neural Networks and the Brain
Gustaw Opielka
Jessica Loke
Steven Scholte
21
0
0
29 Apr 2024
Attention-aware non-rigid image registration for accelerated MR imaging
Attention-aware non-rigid image registration for accelerated MR imaging
Aya Ghoul
Jiazhen Pan
Andreas Lingg
J. Kübler
Patrick Krumm
Kerstin Hammernik
Daniel Rueckert
S. Gatidis
Thomas Kustner
MedIm
32
7
0
26 Apr 2024
Data-independent Module-aware Pruning for Hierarchical Vision
  Transformers
Data-independent Module-aware Pruning for Hierarchical Vision Transformers
Yang He
Joey Tianyi Zhou
ViT
42
3
0
21 Apr 2024
HSViT: Horizontally Scalable Vision Transformer
HSViT: Horizontally Scalable Vision Transformer
Chenhao Xu
Chang-Tsun Li
Chee Peng Lim
Douglas Creighton
ViT
27
2
0
08 Apr 2024
GvT: A Graph-based Vision Transformer with Talking-Heads Utilizing
  Sparsity, Trained from Scratch on Small Datasets
GvT: A Graph-based Vision Transformer with Talking-Heads Utilizing Sparsity, Trained from Scratch on Small Datasets
Dongjing Shan
guiqiang chen
ViT
37
0
0
07 Apr 2024
Vision Transformers in Domain Adaptation and Generalization: A Study of
  Robustness
Vision Transformers in Domain Adaptation and Generalization: A Study of Robustness
Shadi Alijani
Jamil Fayyad
H. Najjaran
OOD
27
9
0
05 Apr 2024
NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation
  Learning for Neural Radiance Fields
NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields
Muhammad Zubair Irshad
Sergey Zakahrov
Vitor Campagnolo Guizilini
Adrien Gaidon
Z. Kira
Rares Ambrus
ViT
37
12
0
01 Apr 2024
VideoDistill: Language-aware Vision Distillation for Video Question
  Answering
VideoDistill: Language-aware Vision Distillation for Video Question Answering
Bo Zou
Chao Yang
Yu Qiao
Chengbin Quan
Youjian Zhao
VGen
42
1
0
01 Apr 2024
Rethinking Information Loss in Medical Image Segmentation with
  Various-sized Targets
Rethinking Information Loss in Medical Image Segmentation with Various-sized Targets
Tianyi Liu
Zhaorui Tan
Kaizhu Huang
Haochuan Jiang
25
0
0
28 Mar 2024
A Personalized Video-Based Hand Taxonomy: Application for Individuals
  with Spinal Cord Injury
A Personalized Video-Based Hand Taxonomy: Application for Individuals with Spinal Cord Injury
Mehdy Dousty
David J. Fleet
José Zariffa
14
0
0
26 Mar 2024
AOCIL: Exemplar-free Analytic Online Class Incremental Learning with Low
  Time and Resource Consumption
AOCIL: Exemplar-free Analytic Online Class Incremental Learning with Low Time and Resource Consumption
Huiping Zhuang
Yuchen Liu
Run He
Kai Tong
Ziqian Zeng
Cen Chen
Yi Wang
Lap-Pui Chau
CLL
33
1
0
23 Mar 2024
Do not trust what you trust: Miscalibration in Semi-supervised Learning
Do not trust what you trust: Miscalibration in Semi-supervised Learning
Shambhavi Mishra
Balamurali Murugesan
Ismail Ben Ayed
M. Pedersoli
Jose Dolz
38
2
0
22 Mar 2024
Previous
123456789
Next