
Title |
|---|
![]() Diffusion Lens: Interpreting Text Encoders in Text-to-Image PipelinesAnnual Meeting of the Association for Computational Linguistics (ACL), 2024 |
![]() Patchscopes: A Unifying Framework for Inspecting Hidden Representations
of Language ModelsInternational Conference on Machine Learning (ICML), 2024 |
![]() An Adversarial Example for Direct Logit Attribution: Memory Management
in gelu-4lBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2023 |
![]() Discovering the Compositional Structure of Vector Representations with
Role Learning NetworksBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2019 |