Looped ReLU MLPs May Be All You Need as Practical Programmable ComputersInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024 |
Dense Associative Memory Through the Lens of Random FeaturesNeural Information Processing Systems (NeurIPS), 2024 |
Bypassing the Exponential Dependency: Looped Transformers Efficiently Learn In-context by Multi-step Gradient DescentInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024 |
Fundamental Limitations on Subquadratic Alternatives to TransformersInternational Conference on Learning Representations (ICLR), 2024 |
The Closeness of In-Context Learning and Weight Shifting for Softmax
RegressionNeural Information Processing Systems (NeurIPS), 2023 |
Open-Ended Multi-Modal Relational Reasoning for Video Question AnsweringIEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), 2020 |