Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.02258
Cited By
Mixture-of-Depths: Dynamically allocating compute in transformer-based language models
2 April 2024
David Raposo
Sam Ritter
Blake A. Richards
Timothy Lillicrap
Peter C. Humphreys
Adam Santoro
MoE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
1 / 51 papers shown
Title
SpaceByte: Towards Deleting Tokenization from Large Language Modeling
Kevin Slagle
32
3
0
22 Apr 2024
Previous
1
2