Gradient Short-Circuit: Efficient Out-of-Distribution Detection via Feature Intervention

2 July 2025

Jiawei Gu

Ziyue Qiao

Zechao Li

OODD

ArXiv (abs)PDF HTML

Main:8 Pages

8 Figures

Bibliography:2 Pages

21 Tables

Appendix:14 Pages

Abstract

Out-of-Distribution (OOD) detection is critical for safely deploying deep models in open-world environments, where inputs may lie outside the training distribution. During inference on a model trained exclusively with In-Distribution (ID) data, we observe a salient gradient phenomenon: around an ID sample, the local gradient directions for "enhancing" that sample's predicted class remain relatively consistent, whereas OOD samples--unseen in training--exhibit disorganized or conflicting gradient directions in the same neighborhood. Motivated by this observation, we propose an inference-stage technique to short-circuit those feature coordinates that spurious gradients exploit to inflate OOD confidence, while leaving ID classification largely intact. To circumvent the expense of recomputing the logits after this gradient short-circuit, we further introduce a local first-order approximation that accurately captures the post-modification outputs without a second forward pass. Experiments on standard OOD benchmarks show our approach yields substantial improvements. Moreover, the method is lightweight and requires minimal changes to the standard inference pipeline, offering a practical path toward robust OOD detection in real-world applications.

View on arXiv

@article{gu2025_2507.01417,
  title={ Gradient Short-Circuit: Efficient Out-of-Distribution Detection via Feature Intervention },
  author={ Jiawei Gu and Ziyue Qiao and Zechao Li },
  journal={arXiv preprint arXiv:2507.01417},
  year={ 2025 }
}

Comments on this paper