ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.12025
34
0

Geometric Inductive Biases of Deep Networks: The Role of Data and Architecture

15 October 2024
Sajad Movahedi
Antonio Orvieto
Seyed-Mohsen Moosavi-Dezfooli
    AAML
    AI4CE
ArXivPDFHTML
Abstract

In this paper, we propose the geometric invariance hypothesis (GIH)\textit{geometric invariance hypothesis (GIH)}geometric invariance hypothesis (GIH), which argues that the input space curvature of a neural network remains invariant under transformation in certain architecture-dependent directions during training. We investigate a simple, non-linear binary classification problem residing on a plane in a high dimensional space and observe that\unicodex2014\unicode{x2014}\unicodex2014unlike MPLs\unicodex2014\unicode{x2014}\unicodex2014ResNets fail to generalize depending on the orientation of the plane. Motivated by this example, we define a neural network's average geometry\textbf{average geometry}average geometry and average geometry evolution\textbf{average geometry evolution}average geometry evolution as compact architecture-dependent\textit{architecture-dependent}architecture-dependent summaries of the model's input-output geometry and its evolution during training. By investigating the average geometry evolution at initialization, we discover that the geometry of a neural network evolves according to the data covariance projected onto its average geometry. This means that the geometry only changes in a subset of the input space when the average geometry is low-rank, such as in ResNets. This causes an architecture-dependent invariance property in the input space curvature, which we dub GIH. Finally, we present extensive experimental results to observe the consequences of GIH and how it relates to generalization in neural networks.

View on arXiv
@article{movahedi2025_2410.12025,
  title={ Geometric Inductive Biases of Deep Networks: The Role of Data and Architecture },
  author={ Sajad Movahedi and Antonio Orvieto and Seyed-Mohsen Moosavi-Dezfooli },
  journal={arXiv preprint arXiv:2410.12025},
  year={ 2025 }
}
Comments on this paper