ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.07561
50
0

Alligat0R: Pre-Training Through Co-Visibility Segmentation for Relative Camera Pose Regression

10 March 2025
Thibaut Loiseau
Guillaume Bourmaud
Vincent Lepetit
ArXivPDFHTML
Abstract

Pre-training techniques have greatly advanced computer vision, with CroCo's cross-view completion approach yielding impressive results in tasks like 3D reconstruction and pose regression. However, this method requires substantial overlap between training pairs, limiting its effectiveness. We introduce Alligat0R, a novel pre-training approach that reformulates cross-view learning as a co-visibility segmentation task. Our method predicts whether each pixel in one image is co-visible in the second image, occluded, or outside the field of view (FOV), enabling the use of image pairs with any degree of overlap and providing interpretable predictions. To support this, we present Cub3, a large-scale dataset with 2.5 million image pairs and dense co-visibility annotations derived from the nuScenes dataset. This dataset includes diverse scenarios with varying degrees of overlap. The experiments show that Alligat0R significantly outperforms CroCo in relative pose regression, especially in scenarios with limited overlap. Alligat0R and Cub3 will be made publicly available.

View on arXiv
@article{loiseau2025_2503.07561,
  title={ Alligat0R: Pre-Training Through Co-Visibility Segmentation for Relative Camera Pose Regression },
  author={ Thibaut Loiseau and Guillaume Bourmaud and Vincent Lepetit },
  journal={arXiv preprint arXiv:2503.07561},
  year={ 2025 }
}
Comments on this paper