Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.14279
Cited By
Localizing Objects with Self-Supervised Transformers and no Labels
29 September 2021
Oriane Siméoni
Gilles Puy
Huy V. Vo
Simon Roburin
Spyros Gidaris
Andrei Bursuc
P. Pérez
Renaud Marlet
Jean Ponce
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Localizing Objects with Self-Supervised Transformers and no Labels"
20 / 20 papers shown
Title
UNION: Unsupervised 3D Object Detection using Object Appearance-based Pseudo-Classes
T. Lentsch
Holger Caesar
D. Gavrila
3DPC
62
7
0
20 Feb 2025
Explaining the Impact of Training on Vision Models via Activation Clustering
Ahcène Boubekki
Samuel G. Fadel
Sebastian Mair
84
0
0
29 Nov 2024
Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation
Chanyoung Kim
Dayun Ju
Woojung Han
Ming-Hsuan Yang
Seong Jae Hwang
VLM
VOS
63
0
0
26 Nov 2024
ProMerge: Prompt and Merge for Unsupervised Instance Segmentation
Dylan Li
Gyungin Shin
12
3
0
27 Sep 2024
Historical Printed Ornaments: Dataset and Tasks
Sayan Kumar Chaki
Z. S. Baltaci
Elliot Vincent
Remi Emonet
Fabienne Vial-Bonacci
Christelle Bahier-Porte
Mathieu Aubry
Thierry Fournel
22
0
0
16 Aug 2024
DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection
Jia Syuen Lim
Zhuoxiao Chen
Mahsa Baktashmotlagh
Zhi Chen
Xin Yu
Zi Huang
Yadan Luo
VLM
ObjD
52
1
0
21 Jun 2024
Dissecting Query-Key Interaction in Vision Transformers
Xu Pan
Aaron Philip
Ziqian Xie
Odelia Schwartz
25
1
0
04 Apr 2024
Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models
Jiayun Luo
Siddhesh Khandelwal
Leonid Sigal
Boyang Albert Li
MLLM
VLM
16
7
0
28 Nov 2023
u-LLaVA: Unifying Multi-Modal Tasks via Large Language Model
Jinjin Xu
Liwu Xu
Yuzhe Yang
Xiang Li
Fanyi Wang
Yanchun Xie
Yi-Jie Huang
Yaqian Li
MoE
MLLM
VLM
8
12
0
09 Nov 2023
Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video
Shashanka Venkataramanan
Mamshad Nayeem Rizve
João Carreira
Yuki M. Asano
Yannis Avrithis
SSL
12
18
0
12 Oct 2023
Vision Transformers Need Registers
Zilong Chen
Maxime Oquab
Julien Mairal
Huaping Liu
ViT
15
308
0
28 Sep 2023
SEMPART: Self-supervised Multi-resolution Partitioning of Image Semantics
Sriram Ravindran
Debraj Basu
13
2
0
20 Sep 2023
Optical Flow boosts Unsupervised Localization and Segmentation
Xinyu Zhang
Abdeslam Boularias
18
5
0
25 Jul 2023
Mitigating Bias: Enhancing Image Classification by Improving Model Explanations
Raha Ahmadi
Mohammad Javad Rajabi
Mohammad Khalooiem
Mohammad Sabokrou
8
0
0
04 Jul 2023
Removing supervision in semantic segmentation with local-global matching and area balancing
Simone Rossetti
Nico Sama
F. Pirri
VLM
17
0
0
30 Mar 2023
Cut and Learn for Unsupervised Object Detection and Instance Segmentation
Xudong Wang
Rohit Girdhar
Stella X. Yu
Ishan Misra
VLM
20
158
0
26 Jan 2023
Deep Spectral Methods: A Surprisingly Strong Baseline for Unsupervised Semantic Segmentation and Localization
Luke Melas-Kyriazi
Christian Rupprecht
Iro Laina
Andrea Vedaldi
11
158
0
16 May 2022
Deep ViT Features as Dense Visual Descriptors
Shirzad Amir
Yossi Gandelsman
Shai Bagon
Tali Dekel
MDE
ViT
17
271
0
10 Dec 2021
Intriguing Properties of Vision Transformers
Muzammal Naseer
Kanchana Ranasinghe
Salman Khan
Munawar Hayat
F. Khan
Ming-Hsuan Yang
ViT
240
512
0
21 May 2021
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
283
5,723
0
29 Apr 2021
1