ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2404.02258
  4. Cited By
Mixture-of-Depths: Dynamically allocating compute in transformer-based
  language models

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

2 April 2024
David Raposo
Sam Ritter
Blake A. Richards
Timothy Lillicrap
Peter C. Humphreys
Adam Santoro
    MoE
ArXivPDFHTML

Papers citing "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

1 / 51 papers shown
Title
SpaceByte: Towards Deleting Tokenization from Large Language Modeling
SpaceByte: Towards Deleting Tokenization from Large Language Modeling
Kevin Slagle
32
3
0
22 Apr 2024
Previous
12