ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.07990
  4. Cited By
Patch-level Representation Learning for Self-supervised Vision
  Transformers

Patch-level Representation Learning for Self-supervised Vision Transformers

16 June 2022
Sukmin Yun
Hankook Lee
Jaehyung Kim
Jinwoo Shin
    ViT
ArXivPDFHTML

Papers citing "Patch-level Representation Learning for Self-supervised Vision Transformers"

50 / 50 papers shown
Title
AFiRe: Anatomy-Driven Self-Supervised Learning for Fine-Grained Representation in Radiographic Images
AFiRe: Anatomy-Driven Self-Supervised Learning for Fine-Grained Representation in Radiographic Images
Yihang Liu
Lianghua He
Y. Wen
Longzhen Yang
Hongzhou Chen
MedIm
29
0
0
15 Apr 2025
Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding
Seil Kang
Jinyeong Kim
Junhyeok Kim
Seong Jae Hwang
VLM
85
2
0
08 Mar 2025
ACE: Anatomically Consistent Embeddings in Composition and Decomposition
ACE: Anatomically Consistent Embeddings in Composition and Decomposition
Ziyu Zhou
Haozhe Luo
M. Taher
Jiaxuan Pang
Xiaowei Ding
Michael B. Gotway
Jianming Liang
MedIm
34
0
0
20 Jan 2025
A Review of Transformer-Based Models for Computer Vision Tasks:
  Capturing Global Context and Spatial Relationships
A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships
Gracile Astlin Pereira
Muhammad Hussain
ViT
30
7
0
27 Aug 2024
Unsqueeze [CLS] Bottleneck to Learn Rich Representations
Unsqueeze [CLS] Bottleneck to Learn Rich Representations
Qing Su
Shihao Ji
24
0
0
24 Jul 2024
In-Context Learning Improves Compositional Understanding of
  Vision-Language Models
In-Context Learning Improves Compositional Understanding of Vision-Language Models
Matteo Nulli
Anesa Ibrahimi
Avik Pal
Hoshe Lee
Ivona Najdenkoska
VLM
CoGe
30
0
0
22 Jul 2024
Temporal Representation Learning for Stock Similarities and Its
  Applications in Investment Management
Temporal Representation Learning for Stock Similarities and Its Applications in Investment Management
Yoon-Jeong Hwang
Stefan Zohren
Yongjae Lee
AIFin
29
1
0
18 Jul 2024
Progressive Proxy Anchor Propagation for Unsupervised Semantic
  Segmentation
Progressive Proxy Anchor Propagation for Unsupervised Semantic Segmentation
Hyun Seok Seong
WonJun Moon
Subeen Lee
Jae-Pil Heo
33
0
0
17 Jul 2024
Multi-Grained Contrast for Data-Efficient Unsupervised Representation
  Learning
Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning
Chengchao Shen
Jianzhong Chen
Jianxin Wang
SSL
20
0
0
02 Jul 2024
Liveness Detection in Computer Vision: Transformer-based Self-Supervised
  Learning for Face Anti-Spoofing
Liveness Detection in Computer Vision: Transformer-based Self-Supervised Learning for Face Anti-Spoofing
Arman Keresh
Pakizar Shamoi
33
5
0
19 Jun 2024
Semantic Graph Consistency: Going Beyond Patches for Regularizing
  Self-Supervised Vision Transformers
Semantic Graph Consistency: Going Beyond Patches for Regularizing Self-Supervised Vision Transformers
Chaitanya Devaguptapu
Sumukh K. Aithal
Shrinivas Ramasubramanian
Moyuru Yamada
Manohar Kaul
ViT
18
0
0
18 Jun 2024
DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive
  Architecture
DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture
Shentong Mo
Sukmin Yun
35
3
0
28 May 2024
I$^3$Net: Inter-Intra-slice Interpolation Network for Medical Slice
  Synthesis
I3^33Net: Inter-Intra-slice Interpolation Network for Medical Slice Synthesis
Haofei Song
Xintian Mao
Jing Yu
Qingli Li
Yan Wang
OOD
43
1
0
05 May 2024
LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT
  Descriptors
LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors
Saksham Suri
Matthew Walmer
Kamal Gupta
Abhinav Shrivastava
33
4
0
21 Mar 2024
WIA-LD2ND: Wavelet-based Image Alignment for Self-supervised Low-Dose CT
  Denoising
WIA-LD2ND: Wavelet-based Image Alignment for Self-supervised Low-Dose CT Denoising
Haoyu Zhao
Yuliang Gu
Zhou Zhao
Bo Du
Yongchao Xu
Rui Yu
OOD
19
3
0
18 Mar 2024
Decoupled Contrastive Learning for Long-Tailed Recognition
Decoupled Contrastive Learning for Long-Tailed Recognition
Shiyu Xuan
Shiliang Zhang
36
5
0
10 Mar 2024
Leveraging Self-Supervised Instance Contrastive Learning for Radar
  Object Detection
Leveraging Self-Supervised Instance Contrastive Learning for Radar Object Detection
Colin Decourt
R. V. Rullen
D. Salle
Thomas Oberlin
SSL
22
0
0
13 Feb 2024
Learning Anatomically Consistent Embedding for Chest Radiography
Learning Anatomically Consistent Embedding for Chest Radiography
Ziyu Zhou
Haozhe Luo
Jiaxuan Pang
Xiaowei Ding
Michael B. Gotway
Jianming Liang
SSL
9
5
0
01 Dec 2023
Understanding Self-Supervised Features for Learning Unsupervised
  Instance Segmentation
Understanding Self-Supervised Features for Learning Unsupervised Instance Segmentation
Paul Engstler
Luke Melas-Kyriazi
Christian Rupprecht
Iro Laina
SSL
20
3
0
24 Nov 2023
Event Camera Data Dense Pre-training
Event Camera Data Dense Pre-training
Yan Yang
Liyuan Pan
Liu Liu
25
4
0
20 Nov 2023
Patch-Wise Self-Supervised Visual Representation Learning: A
  Fine-Grained Approach
Patch-Wise Self-Supervised Visual Representation Learning: A Fine-Grained Approach
Ali Javidani
Mohammad Amin Sadeghi
Babak Nadjar Araabi
22
0
0
28 Oct 2023
CrIBo: Self-Supervised Learning via Cross-Image Object-Level
  Bootstrapping
CrIBo: Self-Supervised Learning via Cross-Image Object-Level Bootstrapping
Tim Lebailly
Thomas Stegmüller
Behzad Bozorgtabar
Jean-Philippe Thiran
Tinne Tuytelaars
SSL
45
6
0
11 Oct 2023
Attention De-sparsification Matters: Inducing Diversity in Digital
  Pathology Representation Learning
Attention De-sparsification Matters: Inducing Diversity in Digital Pathology Representation Learning
S. Kapse
Srijan Das
Jingwei Zhang
Rajarsi R. Gupta
Joel H. Saltz
Dimitris Samaras
Prateek Prasanna
25
9
0
12 Sep 2023
DropPos: Pre-Training Vision Transformers by Reconstructing Dropped
  Positions
DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions
Haochen Wang
Junsong Fan
Yuxi Wang
Kaiyou Song
Tong Wang
Zhaoxiang Zhang
20
19
0
07 Sep 2023
MOFO: MOtion FOcused Self-Supervision for Video Understanding
MOFO: MOtion FOcused Self-Supervision for Video Understanding
Mona Ahmadian
Frank Guerin
Andrew Gilbert
18
2
0
23 Aug 2023
Time Does Tell: Self-Supervised Time-Tuning of Dense Image
  Representations
Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations
Mohammadreza Salehi
E. Gavves
Cees G. M. Snoek
Yuki M. Asano
VOS
30
19
0
22 Aug 2023
LOCATE: Self-supervised Object Discovery via Flow-guided Graph-cut and
  Bootstrapped Self-training
LOCATE: Self-supervised Object Discovery via Flow-guided Graph-cut and Bootstrapped Self-training
Silky Singh
Shripad Deshmukh
Mausoom Sarkar
Balaji Krishnamurthy
16
7
0
22 Aug 2023
Unsupervised Camouflaged Object Segmentation as Domain Adaptation
Unsupervised Camouflaged Object Segmentation as Domain Adaptation
Yi Zhang
Chengyi Wu
26
3
0
08 Aug 2023
ASCON: Anatomy-aware Supervised Contrastive Learning Framework for
  Low-dose CT Denoising
ASCON: Anatomy-aware Supervised Contrastive Learning Framework for Low-dose CT Denoising
Zhihao Chen
Qinhan Gao
Yi Zhang
Hongming Shan
21
16
0
23 Jul 2023
Patch-Level Contrasting without Patch Correspondence for Accurate and
  Dense Contrastive Representation Learning
Patch-Level Contrasting without Patch Correspondence for Accurate and Dense Contrastive Representation Learning
Shaofeng Zhang
Feng Zhu
Rui Zhao
Junchi Yan
16
16
0
23 Jun 2023
FLSL: Feature-level Self-supervised Learning
FLSL: Feature-level Self-supervised Learning
Qing Su
Anton Netchaev
Hai Helen Li
Shihao Ji
19
4
0
09 Jun 2023
Recent Advances of Local Mechanisms in Computer Vision: A Survey and
  Outlook of Recent Work
Recent Advances of Local Mechanisms in Computer Vision: A Survey and Outlook of Recent Work
Qiangchang Wang
Yilong Yin
21
0
0
02 Jun 2023
Model-Contrastive Federated Domain Adaptation
Model-Contrastive Federated Domain Adaptation
Chang’an Yi
Haotian Chen
Yonghui Xu
Yifan Zhang
MedIm
FedML
17
0
0
07 May 2023
Detecting Novelties with Empty Classes
Detecting Novelties with Empty Classes
Svenja Uhlemeyer
Julian Lienen
Eyke Hüllermeier
Hanno Gottschalk
13
1
0
30 Apr 2023
DIAMANT: Dual Image-Attention Map Encoders For Medical Image
  Segmentation
DIAMANT: Dual Image-Attention Map Encoders For Medical Image Segmentation
Yousef Yeganeh
Azade Farshad
Peter Weinberger
Seyed-Ahmad Ahmadi
Ehsan Adeli
Nassir Navab
ViT
MedIm
15
0
0
28 Apr 2023
A Cookbook of Self-Supervised Learning
A Cookbook of Self-Supervised Learning
Randall Balestriero
Mark Ibrahim
Vlad Sobal
Ari S. Morcos
Shashank Shekhar
...
Pierre Fernandez
Amir Bar
Hamed Pirsiavash
Yann LeCun
Micah Goldblum
SyDa
FedML
SSL
31
272
0
24 Apr 2023
Self-Supervised Learning from Non-Object Centric Images with a Geometric
  Transformation Sensitive Architecture
Self-Supervised Learning from Non-Object Centric Images with a Geometric Transformation Sensitive Architecture
Taeho Kim
Jong-Min Lee
18
0
0
17 Apr 2023
Leveraging Hidden Positives for Unsupervised Semantic Segmentation
Leveraging Hidden Positives for Unsupervised Semantic Segmentation
Hyun Seok Seong
WonJun Moon
Subeen Lee
Jae-Pil Heo
ViT
26
32
0
27 Mar 2023
CrOC: Cross-View Online Clustering for Dense Visual Representation
  Learning
CrOC: Cross-View Online Clustering for Dense Visual Representation Learning
Thomas Stegmüller
Tim Lebailly
Behzad Bozorgtabar
Tinne Tuytelaars
Jean-Philippe Thiran
24
15
0
23 Mar 2023
Anatomical Invariance Modeling and Semantic Alignment for
  Self-supervised Learning in 3D Medical Image Analysis
Anatomical Invariance Modeling and Semantic Alignment for Self-supervised Learning in 3D Medical Image Analysis
Yankai Jiang
Ming Sun
Heng Guo
Xiaoyu Bai
K. Yan
Le Lu
Minfeng Xu
MedIm
24
20
0
11 Feb 2023
Masked Autoencoding Does Not Help Natural Language Supervision at Scale
Masked Autoencoding Does Not Help Natural Language Supervision at Scale
Floris Weers
Vaishaal Shankar
Angelos Katharopoulos
Yinfei Yang
Tom Gunter
CLIP
13
4
0
19 Jan 2023
Location-Aware Self-Supervised Transformers for Semantic Segmentation
Location-Aware Self-Supervised Transformers for Semantic Segmentation
Mathilde Caron
N. Houlsby
Cordelia Schmid
ViT
6
9
0
05 Dec 2022
CLIP-FLow: Contrastive Learning by semi-supervised Iterative Pseudo
  labeling for Optical Flow Estimation
CLIP-FLow: Contrastive Learning by semi-supervised Iterative Pseudo labeling for Optical Flow Estimation
Zhiqi Zhang
Nitin Bansal
Changjiang Cai
Pan Ji
Qingan Yan
Xiangyu Xu
Yi Tian Xu
30
5
0
25 Oct 2022
Perceptual Grouping in Contrastive Vision-Language Models
Perceptual Grouping in Contrastive Vision-Language Models
Kanchana Ranasinghe
Brandon McKinzie
S. S. Ravi
Yinfei Yang
Alexander Toshev
Jonathon Shlens
VLM
19
51
0
18 Oct 2022
Spatial Entropy as an Inductive Bias for Vision Transformers
Spatial Entropy as an Inductive Bias for Vision Transformers
E. Peruzzo
E. Sangineto
Yahui Liu
Marco De Nadai
Wei Bi
Bruno Lepri
N. Sebe
ViT
MDE
26
1
0
09 Jun 2022
Continual Barlow Twins: continual self-supervised learning for remote
  sensing semantic segmentation
Continual Barlow Twins: continual self-supervised learning for remote sensing semantic segmentation
V. Marsocci
Simone Scardapane
CLL
19
25
0
23 May 2022
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
292
5,761
0
29 Apr 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction
  without Convolutions
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
263
3,604
0
24 Feb 2021
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,764
0
24 Feb 2021
Improved Baselines with Momentum Contrastive Learning
Improved Baselines with Momentum Contrastive Learning
Xinlei Chen
Haoqi Fan
Ross B. Girshick
Kaiming He
SSL
238
3,359
0
09 Mar 2020
1