ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.10270
  4. Cited By
How to train your ViT? Data, Augmentation, and Regularization in Vision
  Transformers

How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers

18 June 2021
Andreas Steiner
Alexander Kolesnikov
Xiaohua Zhai
Ross Wightman
Jakob Uszkoreit
Lucas Beyer
    ViT
ArXivPDFHTML

Papers citing "How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers"

50 / 415 papers shown
Title
Histo-Miner: Deep Learning based Tissue Features Extraction Pipeline from H&E Whole Slide Images of Cutaneous Squamous Cell Carcinoma
Histo-Miner: Deep Learning based Tissue Features Extraction Pipeline from H&E Whole Slide Images of Cutaneous Squamous Cell Carcinoma
Lucas Sancéré
Carina Lorenz
Doris Helbig
Oana-Diana Persa
Sonja Dengler
...
Martim Laimer
Anne Fröhlich
Jennifer Landsberg
Johannes Brägelmann
Katarzyna Bozek
33
0
0
07 May 2025
Improving the Reproducibility of Deep Learning Software: An Initial Investigation through a Case Study Analysis
Improving the Reproducibility of Deep Learning Software: An Initial Investigation through a Case Study Analysis
Nikita Ravi
Abhinav Goel
James C. Davis
George K. Thiruvathukal
35
0
0
06 May 2025
Always Skip Attention
Always Skip Attention
Yiping Ji
Hemanth Saratchandran
Peyman Moghaddam
Simon Lucey
55
0
0
04 May 2025
Transformer-Empowered Actor-Critic Reinforcement Learning for Sequence-Aware Service Function Chain Partitioning
Transformer-Empowered Actor-Critic Reinforcement Learning for Sequence-Aware Service Function Chain Partitioning
Cyril Shih-Huan Hsu
Anestis Dalgkitsis
Chrysa Papagianni
Paola Grosso
12
0
0
26 Apr 2025
The effects of Hessian eigenvalue spectral density type on the applicability of Hessian analysis to generalization capability assessment of neural networks
The effects of Hessian eigenvalue spectral density type on the applicability of Hessian analysis to generalization capability assessment of neural networks
Nikita Gabdullin
16
0
0
24 Apr 2025
LIFT+: Lightweight Fine-Tuning for Long-Tail Learning
LIFT+: Lightweight Fine-Tuning for Long-Tail Learning
Jiang-Xin Shi
Tong Wei
Yu-Feng Li
25
0
0
17 Apr 2025
The Impact of Model Zoo Size and Composition on Weight Space Learning
The Impact of Model Zoo Size and Composition on Weight Space Learning
Damian Falk
Konstantin Schurholt
Damian Borth
32
0
0
14 Apr 2025
FATE: A Prompt-Tuning-Based Semi-Supervised Learning Framework for Extremely Limited Labeled Data
FATE: A Prompt-Tuning-Based Semi-Supervised Learning Framework for Extremely Limited Labeled Data
Hezhao Liu
Yang Lu
Mengke Li
Yiqun Zhang
Shreyank N Gowda
Chen Gong
Hanzi Wang
29
0
0
14 Apr 2025
A Model Zoo of Vision Transformers
A Model Zoo of Vision Transformers
Damian Falk
Léo Meynent
Florence Pfammatter
Konstantin Schurholt
Damian Borth
32
0
0
14 Apr 2025
AutoSSVH: Exploring Automated Frame Sampling for Efficient Self-Supervised Video Hashing
AutoSSVH: Exploring Automated Frame Sampling for Efficient Self-Supervised Video Hashing
Niu Lian
Jun Li
Jinpeng Wang
Ruisheng Luo
Yaowei Wang
Shu-Tao Xia
Bin Chen
44
0
0
04 Apr 2025
Efficient Token Compression for Vision Transformer with Spatial Information Preserved
Efficient Token Compression for Vision Transformer with Spatial Information Preserved
Junzhu Mao
Yang Shen
Jinyang Guo
Yazhou Yao
Xiansheng Hua
ViT
31
0
0
30 Mar 2025
Evaluating Text-to-Image Synthesis with a Conditional Fréchet Distance
Evaluating Text-to-Image Synthesis with a Conditional Fréchet Distance
Jaywon Koo
J. Hernandez
Moayed Haji-Ali
Ziyan Yang
Vicente Ordonez
EGVM
67
0
0
27 Mar 2025
Improving Food Image Recognition with Noisy Vision Transformer
Improving Food Image Recognition with Noisy Vision Transformer
Tonmoy Ghosh
Edward Sazonov
ViT
31
0
0
24 Mar 2025
Beyond Accuracy: What Matters in Designing Well-Behaved Models?
Beyond Accuracy: What Matters in Designing Well-Behaved Models?
Robin Hesse
Doğukan Bağcı
Bernt Schiele
Simone Schaub-Meyer
Stefan Roth
VLM
54
0
0
21 Mar 2025
Structural and Statistical Texture Knowledge Distillation and Learning for Segmentation
Structural and Statistical Texture Knowledge Distillation and Learning for Segmentation
Deyi Ji
Feng Zhao
Hongtao Lu
Feng Wu
Jieping Ye
68
2
0
11 Mar 2025
VORTEX: Challenging CNNs at Texture Recognition by using Vision Transformers with Orderless and Randomized Token Encodings
Leonardo F. S. Scabini
Kallil M. C. Zielinski
Emir Konuk
Ricardo T. Fares
L. C. Ribas
Kevin Smith
Odemir M. Bruno
ViT
41
0
0
09 Mar 2025
Med-LEGO: Editing and Adapting toward Generalist Medical Image Diagnosis
Yitao Zhu
Yuan Yin
Jiaming Li
Mengjie Xu
Zihao Zhao
Honglin Xiong
Sheng Wang
Qian Wang
MedIm
65
0
0
03 Mar 2025
A Lightweight and Extensible Cell Segmentation and Classification Model for Whole Slide Images
A Lightweight and Extensible Cell Segmentation and Classification Model for Whole Slide Images
N. Shvetsov
T. Kilvaer
M. Tafavvoghi
Anders Sildnes
Kajsa Møllersen
Lill-ToveRasmussen Busund
L. A. Bongo
VLM
66
1
0
26 Feb 2025
Escaping The Big Data Paradigm in Self-Supervised Representation Learning
Escaping The Big Data Paradigm in Self-Supervised Representation Learning
Carlos Vélez García
Miguel Cazorla
Jorge Pomares
43
0
0
25 Feb 2025
Simpler Fast Vision Transformers with a Jumbo CLS Token
Simpler Fast Vision Transformers with a Jumbo CLS Token
A. Fuller
Yousef Yassin
Daniel G. Kyrollos
Evan Shelhamer
James R. Green
67
0
0
24 Feb 2025
DiTASK: Multi-Task Fine-Tuning with Diffeomorphic Transformations
DiTASK: Multi-Task Fine-Tuning with Diffeomorphic Transformations
Krishna Sri Ipsit Mantri
Carola-Bibiane Schönlieb
Bruno Ribeiro
Chaim Baskin
Moshe Eliasof
38
0
0
09 Feb 2025
A generalizable 3D framework and model for self-supervised learning in medical imaging
A generalizable 3D framework and model for self-supervised learning in medical imaging
Tony Xu
Sepehr Hosseini
Chris Anderson
Anthony Rinaldi
Rahul G. Krishnan
Anne L. Martel
Maged Goubran
MedIm
29
3
0
20 Jan 2025
How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks?
How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks?
Wenxuan Li
Alan L. Yuille
Zongwei Zhou
MedIm
41
8
0
20 Jan 2025
A Room to Roam: Reset Prediction Based on Physical Object Placement for
  Redirected Walking
A Room to Roam: Reset Prediction Based on Physical Object Placement for Redirected Walking
Sulim Chun
Ho Jung Lee
In-Kwon Lee
28
0
0
23 Dec 2024
No More Adam: Learning Rate Scaling at Initialization is All You Need
No More Adam: Learning Rate Scaling at Initialization is All You Need
Minghao Xu
Lichuan Xiang
Xu Cai
Hongkai Wen
73
2
0
16 Dec 2024
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders
Fiona Ryan
Ajay Bati
Sangmin Lee
Daniel Bolya
Judy Hoffman
James M. Rehg
90
2
0
12 Dec 2024
Mixture of Physical Priors Adapter for Parameter-Efficient Fine-Tuning
Mixture of Physical Priors Adapter for Parameter-Efficient Fine-Tuning
Z. Wang
C. J. Li
QiXiang Ye
Tong Zhang
MoE
67
1
0
03 Dec 2024
Dual-Representation Interaction Driven Image Quality Assessment with
  Restoration Assistance
Dual-Representation Interaction Driven Image Quality Assessment with Restoration Assistance
Jingtong Yue
Xin Lin
Zijiu Yang
Chao Ren
78
0
0
26 Nov 2024
Semantic Shield: Defending Vision-Language Models Against Backdooring
  and Poisoning via Fine-grained Knowledge Alignment
Semantic Shield: Defending Vision-Language Models Against Backdooring and Poisoning via Fine-grained Knowledge Alignment
Alvi Md Ishmam
Christopher Thomas
AAML
110
3
0
23 Nov 2024
Federated Learning Client Pruning for Noisy Labels
Federated Learning Client Pruning for Noisy Labels
Mahdi Morafah
Hojin Chang
C. L. P. Chen
Bill Lin
30
0
0
11 Nov 2024
Few-shot Semantic Learning for Robust Multi-Biome 3D Semantic Mapping in
  Off-Road Environments
Few-shot Semantic Learning for Robust Multi-Biome 3D Semantic Mapping in Off-Road Environments
Deegan Atha
Xianmei Lei
Shehryar Khattak
Anna Sabel
Elle Miller
Aurelio Noca
Grace Lim
J. Edlund
Curtis Padgett
Patrick Spieler
29
2
0
10 Nov 2024
Feature Fusion Transferability Aware Transformer for Unsupervised Domain
  Adaptation
Feature Fusion Transferability Aware Transformer for Unsupervised Domain Adaptation
Xiaowei Yu
Zhe Huang
Zao Zhang
ViT
21
1
0
10 Nov 2024
Learning Where to Edit Vision Transformers
Learning Where to Edit Vision Transformers
Yunqiao Yang
Long-Kai Huang
Shengzhuang Chen
Kede Ma
Ying Wei
KELM
23
1
0
04 Nov 2024
Efficient Adaptation of Pre-trained Vision Transformer via Householder
  Transformation
Efficient Adaptation of Pre-trained Vision Transformer via Householder Transformation
Wei Dong
Yuan Sun
Yiting Yang
Xing Zhang
Zhijun Lin
Qingsen Yan
H. Zhang
Peng Wang
Yang Yang
Hengtao Shen
26
0
0
30 Oct 2024
Rethinking Softmax: Self-Attention with Polynomial Activations
Rethinking Softmax: Self-Attention with Polynomial Activations
Hemanth Saratchandran
Jianqiao Zheng
Yiping Ji
Wenbo Zhang
Simon Lucey
16
3
0
24 Oct 2024
PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers in a
  resource-limited Context
PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers in a resource-limited Context
Maximilian Augustin
Syed Shakib Sarwar
Mostafa Elhoushi
Sai Qian Zhang
Yuecheng Li
B. D. Salvo
20
0
0
23 Oct 2024
Bridging the Gaps: Utilizing Unlabeled Face Recognition Datasets to
  Boost Semi-Supervised Facial Expression Recognition
Bridging the Gaps: Utilizing Unlabeled Face Recognition Datasets to Boost Semi-Supervised Facial Expression Recognition
Jie Song
Mengqiao He
Jinhua Feng
B. S.
18
0
0
23 Oct 2024
DiRecNetV2: A Transformer-Enhanced Network for Aerial Disaster
  Recognition
DiRecNetV2: A Transformer-Enhanced Network for Aerial Disaster Recognition
Demetris Shianios
Panayiotis Kolios
Christos Kyrkou
21
3
0
17 Oct 2024
The Ingredients for Robotic Diffusion Transformers
The Ingredients for Robotic Diffusion Transformers
Sudeep Dasari
Oier Mees
Sebastian Zhao
M. K. Srirama
Sergey Levine
46
19
0
14 Oct 2024
Enhancing Performance of Point Cloud Completion Networks with Consistency Loss
Enhancing Performance of Point Cloud Completion Networks with Consistency Loss
Christofel Rio Goenawan
Kevin Tirta Wijaya
Seung-Hyun Kong
3DPC
56
1
0
09 Oct 2024
MatMamba: A Matryoshka State Space Model
MatMamba: A Matryoshka State Space Model
Abhinav Shukla
Sai H. Vemprala
Aditya Kusupati
Ashish Kapoor
Mamba
28
0
0
09 Oct 2024
Foldable SuperNets: Scalable Merging of Transformers with Different
  Initializations and Tasks
Foldable SuperNets: Scalable Merging of Transformers with Different Initializations and Tasks
Edan Kinderman
Itay Hubara
Haggai Maron
Daniel Soudry
MoMe
45
0
0
02 Oct 2024
PACE: Marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularization
PACE: Marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularization
Yao Ni
Shan Zhang
Piotr Koniusz
55
2
0
25 Sep 2024
Hyperbolic Image-and-Pointcloud Contrastive Learning for 3D
  Classification
Hyperbolic Image-and-Pointcloud Contrastive Learning for 3D Classification
Naiwen Hu
Haozhe Cheng
Yifan Xie
Pengcheng Shi
Jihua Zhu
3DPC
24
0
0
24 Sep 2024
Frequency-Guided Spatial Adaptation for Camouflaged Object Detection
Frequency-Guided Spatial Adaptation for Camouflaged Object Detection
Shizhou Zhang
Dexuan Kong
Yinghui Xing
Yue Lu
Lingyan Ran
Guoqiang Liang
Hexu Wang
Yanning Zhang
25
5
0
19 Sep 2024
Agglomerative Token Clustering
Agglomerative Token Clustering
Joakim Bruslund Haurum
Sergio Escalera
Graham W. Taylor
T. Moeslund
24
1
0
18 Sep 2024
Token Turing Machines are Efficient Vision Models
Token Turing Machines are Efficient Vision Models
Purvish Jajal
Nick Eliopoulos
Benjamin Shiue-Hal Chou
George K. Thiravathukal
James C. Davis
Yung-Hsiang Lu
80
0
0
11 Sep 2024
EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation
EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation
Nischal Khanal
Shivanand Venkanna Sheshappanavar
MDE
29
0
0
10 Sep 2024
Input Space Mode Connectivity in Deep Neural Networks
Input Space Mode Connectivity in Deep Neural Networks
Jakub Vrabel
Ori Shem-Ur
Yaron Oz
David Krueger
40
1
0
09 Sep 2024
Weight Conditioning for Smooth Optimization of Neural Networks
Weight Conditioning for Smooth Optimization of Neural Networks
Hemanth Saratchandran
Thomas X. Wang
Simon Lucey
33
0
0
05 Sep 2024
123456789
Next