Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.10716
Cited By
CroCo: Self-Supervised Pre-training for 3D Vision Tasks by Cross-View Completion
19 October 2022
Philippe Weinzaepfel
Vincent Leroy
Thomas Lucas
Romain Brégier
Yohann Cabon
Vaibhav Arora
L. Antsfeld
Boris Chidlovskii
G. Csurka
Jérôme Revaud
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CroCo: Self-Supervised Pre-training for 3D Vision Tasks by Cross-View Completion"
50 / 55 papers shown
Title
When Dance Video Archives Challenge Computer Vision
P. Colantoni
Rafique Ahmed
Prashant Ghimire
Damien Muselet
A. Trémeau
3DH
21
0
0
12 May 2025
RayZer: A Self-supervised Large View Synthesis Model
Hanwen Jiang
Hao Tan
Peng Wang
Haian Jin
Yue Zhao
...
Kai Zhang
Fujun Luan
Kalyan Sunkavalli
Qixing Huang
Georgios Pavlakos
62
0
0
01 May 2025
Multimodal Perception for Goal-oriented Navigation: A Survey
I-Tak Ieong
Hao Tang
LM&Ro
LRM
29
0
0
22 Apr 2025
Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction
Zeren Jiang
Chuanxia Zheng
Iro Laina
Diane Larlus
Andrea Vedaldi
VGen
41
0
0
10 Apr 2025
S^4M: Boosting Semi-Supervised Instance Segmentation with SAM
Heeji Yoon
Heeseong Shin
Eunbeen Hong
Hyunwook Choi
Hansang Cho
Daun Jeong
Seungryong Kim
22
0
0
07 Apr 2025
Dexterous Manipulation through Imitation Learning: A Survey
Shan An
Ziyu Meng
Chao Tang
Y. Zhou
Tengyu Liu
...
Yao Mu
Ran Song
Wei Zhang
Zeng-Guang Hou
H. Zhang
40
0
0
04 Apr 2025
Speedy MASt3R
Jingxing Li
Yongjae Lee
Abhay Kumar Yadav
Cheng-Fang Peng
Rama Chellappa
Deliang Fan
3DGS
61
0
0
13 Mar 2025
Alligat0R: Pre-Training Through Co-Visibility Segmentation for Relative Camera Pose Regression
Thibaut Loiseau
Guillaume Bourmaud
Vincent Lepetit
62
0
0
10 Mar 2025
MUSt3R: Multi-view Network for Stereo 3D Reconstruction
Yohann Cabon
Lucas Stoffl
L. Antsfeld
G. Csurka
Boris Chidlovskii
Jérôme Revaud
Vincent Leroy
3DGS
3DV
53
2
0
03 Mar 2025
Matrix3D: Large Photogrammetry Model All-in-One
Yuanxun Lu
Jingyang Zhang
Tian Fang
Jean-Daniel Nahmias
Yanghai Tsin
Long Quan
Xun Cao
Yao Yao
Shiwei Li
103
4
0
11 Feb 2025
MATCHA:Towards Matching Anything
Fei Xue
Sven Elflein
Laura Leal-Taixe
Qunjie Zhou
45
0
0
28 Jan 2025
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
Jianing Yang
Alexander Sax
Kevin J Liang
Mikael Henaff
Hao Tang
Ang Cao
J. Chai
Franziska Meier
Matt Feiszli
3DGS
66
16
0
23 Jan 2025
MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data
Hanwen Jiang
Zexiang Xu
Desai Xie
Z. Chen
Haian Jin
...
Xin Sun
Jiuxiang Gu
Qixing Huang
Georgios Pavlakos
Hao Tan
121
1
0
18 Dec 2024
Efficient Object-centric Representation Learning with Pre-trained Geometric Prior
Phúc H. Lê Khắc
Graham Healy
A. Smeaton
OCL
66
0
0
16 Dec 2024
Feat2GS: Probing Visual Foundation Models with Gaussian Splatting
Yue Chen
Xingyu Chen
Anpei Chen
Gerard Pons-Moll
Yuliang Xiu
3DGS
83
3
0
12 Dec 2024
Cross-View Completion Models are Zero-shot Correspondence Estimators
Honggyu An
J. Kim
Seonghoon Park
Jaewoo Jung
Jisang Han
Sunghwan Hong
Seungryong Kim
3DV
73
3
0
12 Dec 2024
SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting
Gyeongjin Kang
Jisang Yoo
Jihyeon Park
Seungtae Nam
Hyeonsoo Im
Sangheon Shin
Sangpil Kim
Eunbyung Park
3DGS
111
3
0
26 Nov 2024
Probing the Mid-level Vision Capabilities of Self-Supervised Learning
Xuweiyi Chen
Markus Marks
Zezhou Cheng
66
0
0
25 Nov 2024
Generating 3D-Consistent Videos from Unposed Internet Photos
Gene Chou
Kai Zhang
Sai Bi
Hao Tan
Zexiang Xu
Fujun Luan
Bharath Hariharan
Noah Snavely
3DGS
VGen
75
3
0
20 Nov 2024
Extreme Rotation Estimation in the Wild
Hana Bezalel
Dotan Ankri
Ruojin Cai
Hadar Averbuch-Elor
20
2
0
11 Nov 2024
3D Equivariant Pose Regression via Direct Wigner-D Harmonics Prediction
Jongmin Lee
Minsu Cho
34
1
0
01 Nov 2024
Epipolar-Free 3D Gaussian Splatting for Generalizable Novel View Synthesis
Zhiyuan Min
Yawei Luo
Jianwen Sun
Yi Yang
3DGS
36
0
0
30 Oct 2024
Large Spatial Model: End-to-end Unposed Images to Semantic 3D
Zhiwen Fan
Jian Zhang
Wenyan Cong
Peihao Wang
Renjie Li
...
Z. Wang
Danfei Xu
B. Ivanovic
Marco Pavone
Yue Wang
3DV
39
11
0
24 Oct 2024
Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers
Stephen Hausler
Peyman Moghadam
SSL
ViT
24
2
0
09 Oct 2024
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion
Junyi Zhang
Charles Herrmann
Junhwa Hur
Varun Jampani
Trevor Darrell
Forrester Cole
Deqing Sun
Ming Yang
VGen
79
69
0
04 Oct 2024
Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks
Sierra Bonilla
Chiara Di Vece
Rema Daher
Xinwei Ju
Danail Stoyanov
Francisco Vasconcelos
Sophia Bano
3DV
29
1
0
29 Aug 2024
PooDLe: Pooled and dense self-supervised learning from naturalistic videos
Alex N. Wang
Christopher Hoang
Yuwen Xiong
Yann LeCun
Mengye Ren
64
0
0
20 Aug 2024
Improving Neural Surface Reconstruction with Feature Priors from Multi-View Image
Xinlin Ren
Chenjie Cao
Yanwei Fu
Xiangyang Xue
29
2
0
04 Aug 2024
Improving 2D Feature Representations by 3D-Aware Fine-Tuning
Yuanwen Yue
Anurag Das
Francis Engelmann
Siyu Tang
J. E. Lenssen
41
23
0
29 Jul 2024
Self-supervised Pretraining and Finetuning for Monocular Depth and Visual Odometry
Boris Chidlovskii
L. Antsfeld
MDE
ViT
27
1
0
16 Jun 2024
Neural Isometries: Taming Transformations for Equivariant ML
Thomas W. Mitchel
Michael Taylor
Vincent Sitzmann
21
0
0
29 May 2024
SEA-RAFT: Simple, Efficient, Accurate RAFT for Optical Flow
Yihan Wang
Lahav Lipson
Jia Deng
32
36
0
23 May 2024
Deep Learning-Based Object Pose Estimation: A Comprehensive Survey
Jian Liu
Wei Sun
Hui Yang
Zhiwen Zeng
Chongpei Liu
Jin Zheng
Xingyu Liu
Hossein Rahmani
N. Sebe
Ajmal Saeed Mian
31
15
0
13 May 2024
Playing to Vision Foundation Model's Strengths in Stereo Matching
Chuangwei Liu
Qijun Chen
Rui Fan
25
12
0
09 Apr 2024
NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields
Muhammad Zubair Irshad
Sergey Zakahrov
Vitor Campagnolo Guizilini
Adrien Gaidon
Z. Kira
Rares Ambrus
ViT
32
12
0
01 Apr 2024
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action
Jiasen Lu
Christopher Clark
Sangho Lee
Zichen Zhang
Savya Khosla
Ryan Marten
Derek Hoiem
Aniruddha Kembhavi
VLM
MLLM
27
143
0
28 Dec 2023
DUSt3R: Geometric 3D Vision Made Easy
Shuzhe Wang
Vincent Leroy
Yohann Cabon
Boris Chidlovskii
Jérôme Revaud
3DGS
26
317
0
21 Dec 2023
Low-shot Object Learning with Mutual Exclusivity Bias
Anh Thai
Ahmad Humayun
Stefan Stojanov
Zixuan Huang
Bikram Boote
James M. Rehg
27
2
0
06 Dec 2023
Learning from One Continuous Video Stream
João Carreira
Michael King
Viorica Patraucean
Dilara Gokay
Catalin Ionescu
...
Joseph Heyward
Carl Doersch
Y. Aytar
Dima Damen
Andrew Zisserman
CLL
16
4
0
01 Dec 2023
MFOS: Model-Free & One-Shot Object Pose Estimation
Jongmin Lee
Yohann Cabon
Romain Brégier
Sungjoo Yoo
Jérôme Revaud
ViT
19
6
0
03 Oct 2023
Win-Win: Training High-Resolution Vision Transformers from Two Windows
Vincent Leroy
Jérôme Revaud
Thomas Lucas
Philippe Weinzaepfel
ViT
27
2
0
01 Oct 2023
End-to-End (Instance)-Image Goal Navigation through Correspondence as an Emergent Phenomenon
G. Bono
L. Antsfeld
Boris Chidlovskii
Zhi Zheng
Christian Wolf
3DV
19
9
0
28 Sep 2023
M
3
^{3}
3
3D: Learning 3D priors using Multi-Modal Masked Autoencoders for 2D image and video understanding
Muhammad Abdullah Jamal
Omid Mohareri
3DPC
14
1
0
26 Sep 2023
SACReg: Scene-Agnostic Coordinate Regression for Visual Localization
Jérôme Revaud
Yohann Cabon
Romain Brégier
Jongmin Lee
Philippe Weinzaepfel
22
10
0
21 Jul 2023
MIMIC: Masked Image Modeling with Image Correspondences
Kalyani Marathe
Mahtab Bigverdi
Nishat Khan
Tuhin Kundu
Patrick Howe
Sharan Ranjit S
Anand Bhattad
Aniruddha Kembhavi
Linda G. Shapiro
Ranjay Krishna
12
0
0
27 Jun 2023
Audiovisual Masked Autoencoders
Mariana-Iuliana Georgescu
Eduardo Fonseca
Radu Tudor Ionescu
Mario Lucic
Cordelia Schmid
Anurag Arnab
SSL
22
43
0
09 Dec 2022
Location-Aware Self-Supervised Transformers for Semantic Segmentation
Mathilde Caron
N. Houlsby
Cordelia Schmid
ViT
6
9
0
05 Dec 2022
CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow
Philippe Weinzaepfel
Thomas Lucas
Vincent Leroy
Yohann Cabon
Vaibhav Arora
Romain Brégier
G. Csurka
L. Antsfeld
Boris Chidlovskii
Jérôme Revaud
ViT
15
79
0
18 Nov 2022
Weak Augmentation Guided Relational Self-Supervised Learning
Mingkai Zheng
Shan You
Fei Wang
Chao Qian
Changshui Zhang
Xiaogang Wang
Chang Xu
9
4
0
16 Mar 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,337
0
11 Nov 2021
1
2
Next