
Title |
|---|
![]() Make Haste Slowly: A Theory of Emergent Structured Mixed Selectivity in Feature Learning ReLU NetworksInternational Conference on Learning Representations (ICLR), 2025 |
![]() Beyond Interpretability: The Gains of Feature Monosemanticity on Model
RobustnessInternational Conference on Learning Representations (ICLR), 2024 |