Hypersolid: Emergent Vision Representations via Short-Range Repulsion
Esteban Rodríguez-Betancourt
Edgar Casasola-Murillo
Main:8 Pages
16 Figures
Bibliography:1 Pages
4 Tables
Appendix:8 Pages
Abstract
A recurring challenge in self-supervised learning is preventing representation collapse. Existing solutions typically rely on global regularization, such as maximizing distances, decorrelating dimensions or enforcing certain distributions. We instead reinterpret representation learning as a discrete packing problem, where preserving information simplifies to maintaining injectivity. We operationalize this in Hypersolid, a method using short-range hard-ball repulsion to prevent local collisions. This constraint results in a high-separation geometric regime that preserves augmentation diversity, excelling on fine-grained and low-resolution classification tasks.
View on arXivComments on this paper
