
Title |
|---|
![]() Attention as a HypernetworkInternational Conference on Learning Representations (ICLR), 2024 |
![]() Neural Pfaffians: Solving Many Many-Electron Schrödinger EquationsNeural Information Processing Systems (NeurIPS), 2024 Nicholas Gao Stephan Günnemann |
![]() Improving Transformers with Dynamically Composable Multi-Head AttentionInternational Conference on Machine Learning (ICML), 2024 |
![]() You Only Cache Once: Decoder-Decoder Architectures for Language ModelsNeural Information Processing Systems (NeurIPS), 2024 |