Naturally Computed Scale Invariance in the Residual Stream of ResNet18

22 April 2025

Abstract

An important capacity in visual object recognition is invariance to image-altering variables which leave the identity of objects unchanged, such as lighting, rotation, and scale. How do neural networks achieve this? Prior mechanistic interpretability research has illuminated some invariance-building circuitry in InceptionV1, but the results are limited and networks with different architectures have remained largely unexplored. This work investigates ResNet18 with a particular focus on its residual stream, an architectural component which InceptionV1 lacks. We observe that many convolutional channels in intermediate blocks exhibit scale invariant properties, computed by the element-wise residual summation of scale equivariant representations: the block input's smaller-scale copy with the block pre-sum output's larger-scale copy. Through subsequent ablation experiments, we attempt to causally link these neural properties with scale-robust object recognition behavior. Our tentative findings suggest how the residual stream computes scale invariance and its possible role in behavior. Code is available at:this https URL

View on arXiv

@article{longon2025_2504.16290,
  title={ Naturally Computed Scale Invariance in the Residual Stream of ResNet18 },
  author={ André Longon },
  journal={arXiv preprint arXiv:2504.16290},
  year={ 2025 }
}

Comments on this paper