
Title |
|---|
![]() Lines of Thought in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024 |
![]() Out-of-distribution generalization via composition: a lens through induction heads in TransformersProceedings of the National Academy of Sciences of the United States of America (PNAS), 2024 |
![]() An Information-Theoretic Analysis of In-Context LearningInternational Conference on Machine Learning (ICML), 2024 |
![]() Characterizing Large Language Model Geometry Helps Solve Toxicity
Detection and GenerationInternational Conference on Machine Learning (ICML), 2023 |