Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2012.12556
Cited By

A Survey on Visual Transformer

v1v2v3v4v5v6 (latest)

A Survey on Visual Transformer

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020

23 December 2020

ArXiv (abs)PDF HTML

Papers citing "A Survey on Visual Transformer"

50 / 564 papers shown

FedRE: A Representation Entanglement Framework for Model-Heterogeneous Federated Learning

FedRE: A Representation Entanglement Framework for Model-Heterogeneous Federated Learning

199

0

0

30 Mar 2026

Temp-SCONE: A Novel Out-of-Distribution Detection and Domain Generalization Framework for Wild Data with Temporal Shift

Temp-SCONE: A Novel Out-of-Distribution Detection and Domain Generalization Framework for Wild Data with Temporal Shift

Aditi Naiknaware

Hajar Homayouni

161

1

0

04 Dec 2025

Unrolled Networks are Conditional Probability Flows in MRI Reconstruction

Unrolled Networks are Conditional Probability Flows in MRI Reconstruction

Chao Chen

360

0

0

02 Dec 2025

Benchmarking machine learning models for multi-class state recognition in double quantum dot data

Benchmarking machine learning models for multi-class state recognition in double quantum dot data

Valeria Díaz Moreno

Patrick J. Walsh

Justyna P. Zwolak

176

1

0

27 Nov 2025

Collaborative Learning with Multiple Foundation Models for Source-Free Domain Adaptation

Collaborative Learning with Multiple Foundation Models for Source-Free Domain Adaptation

314

0

0

24 Nov 2025

From Low-Rank Features to Encoding Mismatch: Rethinking Feature Distillation in Vision Transformers

From Low-Rank Features to Encoding Mismatch: Rethinking Feature Distillation in Vision Transformers

165

1

0

19 Nov 2025

Naga: Vedic Encoding for Deep State Space Models

Naga: Vedic Encoding for Deep State Space Models

Melanie Schaller

241

0

0

17 Nov 2025

Intelligent Collaborative Optimization for Rubber Tyre Film Production Based on Multi-path Differentiated Clipping Proximal Policy Optimization

Intelligent Collaborative Optimization for Rubber Tyre Film Production Based on Multi-path Differentiated Clipping Proximal Policy Optimization

246

0

0

15 Nov 2025

TEDxTN: A Three-way Speech Translation Corpus for Code-Switched Tunisian Arabic - English

TEDxTN: A Three-way Speech Translation Corpus for Code-Switched Tunisian Arabic - English

Salima Mdhaffar

66

0

0

13 Nov 2025

Densemarks: Learning Canonical Embeddings for Human Heads Images via Point Tracks

Densemarks: Learning Canonical Embeddings for Human Heads Images via Point Tracks

Dmitrii Pozdeev

Artem Sevastopolsky

297

0

0

04 Nov 2025

REASON: Probability map-guided dual-branch fusion framework for gastric content assessment

REASON: Probability map-guided dual-branch fusion framework for gastric content assessment

...

194

0

0

03 Nov 2025

CoMViT: An Efficient Vision Backbone for Supervised Classification in Medical Imaging

CoMViT: An Efficient Vision Backbone for Supervised Classification in Medical Imaging

Mohamed Saadeldin

121

0

0

31 Oct 2025

Integrating ConvNeXt and Vision Transformers for Enhancing Facial Age Estimation

Integrating ConvNeXt and Vision Transformers for Enhancing Facial Age EstimationComputer Vision and Image Understanding (CVIU), 2025

Salah Eddine Bekhouche

154

1

0

31 Oct 2025

Mixture-of-Transformers Learn Faster: A Theoretical Study on Classification Problems

Mixture-of-Transformers Learn Faster: A Theoretical Study on Classification Problems

213

0

0

30 Oct 2025

CLFSeg: A Fuzzy-Logic based Solution for Boundary Clarity and Uncertainty Reduction in Medical Image Segmentation

CLFSeg: A Fuzzy-Logic based Solution for Boundary Clarity and Uncertainty Reduction in Medical Image Segmentation

118

0

0

28 Oct 2025

VESSA: Video-based objEct-centric Self-Supervised Adaptation for Visual Foundation Models

VESSA: Video-based objEct-centric Self-Supervised Adaptation for Visual Foundation Models

Jesimon Barreto

William Robson Schwartz

165

0

0

23 Oct 2025

BrainPuzzle: Hybrid Physics and Data-Driven Reconstruction for Transcranial Ultrasound Tomography

BrainPuzzle: Hybrid Physics and Data-Driven Reconstruction for Transcranial Ultrasound Tomography

165

0

0

22 Oct 2025

ArmFormer: Lightweight Transformer Architecture for Real-Time Multi-Class Weapon Segmentation and Classification

ArmFormer: Lightweight Transformer Architecture for Real-Time Multi-Class Weapon Segmentation and Classification

Akhila Kambhatla

212

0

0

19 Oct 2025

Cross-Layer Feature Self-Attention Module for Multi-Scale Object Detection

Cross-Layer Feature Self-Attention Module for Multi-Scale Object Detection

194

0

0

16 Oct 2025

Minkowski-MambaNet: A Point Cloud Framework with Selective State Space Models for Forest Biomass Quantification

Minkowski-MambaNet: A Point Cloud Framework with Selective State Space Models for Forest Biomass Quantification

154

0

0

10 Oct 2025

Data driven approaches in nanophotonics: A review of AI-enabled metadevices

Data driven approaches in nanophotonics: A review of AI-enabled metadevices

Sawyer D. Campbell

Douglas H. Werner

252

4

0

30 Sep 2025

Causally Guided Gaussian Perturbations for Out-Of-Distribution Generalization in Medical Imaging

Causally Guided Gaussian Perturbations for Out-Of-Distribution Generalization in Medical Imaging

OOD OODD CML MedIm

251

0

0

30 Sep 2025

When MLLMs Meet Compression Distortion: A Coding Paradigm Tailored to MLLMs

When MLLMs Meet Compression Distortion: A Coding Paradigm Tailored to MLLMs

149

2

0

29 Sep 2025

OmniScene: Attention-Augmented Multimodal 4D Scene Understanding for Autonomous Driving

OmniScene: Attention-Augmented Multimodal 4D Scene Understanding for Autonomous Driving

241

3

0

24 Sep 2025

Lightweight Vision Transformer with Window and Spatial Attention for Food Image Classification

Lightweight Vision Transformer with Window and Spatial Attention for Food Image Classification

93

2

0

23 Sep 2025

Towards a Transparent and Interpretable AI Model for Medical Image Classifications

Towards a Transparent and Interpretable AI Model for Medical Image ClassificationsCognitive Neurodynamics (Cogn Neurodyn), 2025

164

0

0

20 Sep 2025

Learning Hyperspectral Images with Curated Text Prompts for Efficient Multimodal Alignment

Learning Hyperspectral Images with Curated Text Prompts for Efficient Multimodal Alignment

Abhiroop Chatterjee

Susmita K. Ghosh

128

1

0

20 Sep 2025

Sequential Token Merging: Revisiting Hidden States

Sequential Token Merging: Revisiting Hidden States

178

0

0

19 Sep 2025

Layout Stroke Imitation: A Layout Guided Handwriting Stroke Generation for Style Imitation with Diffusion Model

Layout Stroke Imitation: A Layout Guided Handwriting Stroke Generation for Style Imitation with Diffusion Model

Longin Jan Latecki

263

0

0

19 Sep 2025

Which Direction to Choose? An Analysis on the Representation Power of Self-Supervised ViTs in Downstream Tasks

Which Direction to Choose? An Analysis on the Representation Power of Self-Supervised ViTs in Downstream Tasks

Yannis Kaltampanidis

Alexandros Doumanoglou

226

1

0

18 Sep 2025

A Framework for Generating Artificial Datasets to Validate Absolute and Relative Position Concepts

A Framework for Generating Artificial Datasets to Validate Absolute and Relative Position Concepts

George Correa de Araujo

196

0

0

17 Sep 2025

FusionMAE: large-scale pretrained model to optimize and simplify diagnostic and control of fusion plasma

FusionMAE: large-scale pretrained model to optimize and simplify diagnostic and control of fusion plasma

...

247

0

0

16 Sep 2025

Hierarchical MLANet: Multi-level Attention for 3D Face Reconstruction From Single Images

Hierarchical MLANet: Multi-level Attention for 3D Face Reconstruction From Single Images

504

0

0

12 Sep 2025

Dynamic Structural Recovery Parameters Enhance Prediction of Visual Outcomes After Macular Hole Surgery

Dynamic Structural Recovery Parameters Enhance Prediction of Visual Outcomes After Macular Hole Surgery

Louisa Sackewitz

Peter Charbel Issa

135

0

0

11 Sep 2025

E2E Learning Massive MIMO for Multimodal Semantic Non-Orthogonal Transmission and Fusion

E2E Learning Massive MIMO for Multimodal Semantic Non-Orthogonal Transmission and Fusion

242

0

0

09 Sep 2025

Comparative Analysis of Transformer Models in Disaster Tweet Classification for Public Safety

Comparative Analysis of Transformer Models in Disaster Tweet Classification for Public Safety

Sharif Noor Zisad

N. M. Istiak Chowdhury

273

1

0

04 Sep 2025

Multimodal Feature Fusion Network with Text Difference Enhancement for Remote Sensing Change Detection

Multimodal Feature Fusion Network with Text Difference Enhancement for Remote Sensing Change Detection

Hongsheng Zhang

C. L. Philip Chen

191

4

0

04 Sep 2025

SDiFL: Stable Diffusion-Driven Framework for Image Forgery Localization

SDiFL: Stable Diffusion-Driven Framework for Image Forgery Localization

127

0

0

27 Aug 2025

A Lightweight Group Multiscale Bidirectional Interactive Network for Real-Time Steel Surface Defect Detection

A Lightweight Group Multiscale Bidirectional Interactive Network for Real-Time Steel Surface Defect Detection

226

0

0

22 Aug 2025

A Comprehensive Review of Agricultural Parcel and Boundary Delineation from Remote Sensing Images: Recent Progress and Future Perspectives

A Comprehensive Review of Agricultural Parcel and Boundary Delineation from Remote Sensing Images: Recent Progress and Future Perspectives

146

2

0

20 Aug 2025

CuMoLoS-MAE: A Masked Autoencoder for Remote Sensing Data Reconstruction

CuMoLoS-MAE: A Masked Autoencoder for Remote Sensing Data Reconstruction

Nathanael Zhixin Wong

130

0

0

20 Aug 2025

Unifying Scale-Aware Depth Prediction and Perceptual Priors for Monocular Endoscope Pose Estimation and Tissue Reconstruction

Unifying Scale-Aware Depth Prediction and Perceptual Priors for Monocular Endoscope Pose Estimation and Tissue Reconstruction

Matteo Fusaglia

Françoise J. Siepel

128

0

0

15 Aug 2025

Edge General Intelligence Through World Models and Agentic AI: Fundamentals, Solutions, and Challenges

Edge General Intelligence Through World Models and Agentic AI: Fundamentals, Solutions, and Challenges

...

260

11

0

13 Aug 2025

Automated Segmentation of Coronal Brain Tissue Slabs for 3D Neuropathology

Automated Segmentation of Coronal Brain Tissue Slabs for 3D Neuropathology

Jonathan Williams Ramirez

Dina Zemlyanker

Lucas Jacob Deden Binder

Erendira Garcia Pallares

...

Derek H. Oakley

Bradley T. Hyman

Juan Eugenio Iglesias

131

0

0

13 Aug 2025

Aligning Effective Tokens with Video Anomaly in Large Language Models

Aligning Effective Tokens with Video Anomaly in Large Language Models

261

4

0

08 Aug 2025

Zero-shot Shape Classification of Nanoparticles in SEM Images using Vision Foundation Models

Zero-shot Shape Classification of Nanoparticles in SEM Images using Vision Foundation Models

Freida Barnatan

Emunah Goldstein

Yaákov Mandelbaum

133

1

0

05 Aug 2025

SpectraLLM: Uncovering the Ability of LLMs for Molecule Structure Elucidation from Multi-Spectral

SpectraLLM: Uncovering the Ability of LLMs for Molecule Structure Elucidation from Multi-Spectral

Zhaoxiang Zhang

238

2

0

04 Aug 2025

Multimodal Large Language Models for End-to-End Affective Computing: Benchmarking and Boosting with Generative Knowledge Prompting

Multimodal Large Language Models for End-to-End Affective Computing: Benchmarking and Boosting with Generative Knowledge Prompting

285

3

0

04 Aug 2025

Large AI Model-Enabled Secure Communications in Low-Altitude Wireless Networks: Concepts, Perspectives and Case Study

Large AI Model-Enabled Secure Communications in Low-Altitude Wireless Networks: Concepts, Perspectives and Case Study

191

1

0

01 Aug 2025

A Survey on Deep Multi-Task Learning in Connected Autonomous Vehicles

A Survey on Deep Multi-Task Learning in Connected Autonomous Vehicles

Farhad Pourpanah

Q. M. Jonathan Wu

177

1

0

29 Jul 2025

1 2 3 4...10 11 12

Page 1 of 12

Pageof 12