Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.01771
Cited By
BlackMamba: Mixture of Experts for State-Space Models
1 February 2024
Quentin G. Anthony
Yury Tokpanov
Paolo Glorioso
Beren Millidge
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BlackMamba: Mixture of Experts for State-Space Models"
4 / 4 papers shown
Title
MambaLRP: Explaining Selective State Space Sequence Models
F. Jafari
G. Montavon
Klaus-Robert Müller
Oliver Eberle
Mamba
47
9
0
11 Jun 2024
Zoology: Measuring and Improving Recall in Efficient Language Models
Simran Arora
Sabri Eyuboglu
Aman Timalsina
Isys Johnson
Michael Poli
James Zou
Atri Rudra
Christopher Ré
56
65
0
08 Dec 2023
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Leo Gao
Stella Biderman
Sid Black
Laurence Golding
Travis Hoppe
...
Horace He
Anish Thite
Noa Nabeshima
Shawn Presser
Connor Leahy
AIMat
245
1,977
0
31 Dec 2020
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
243
1,791
0
17 Sep 2019
1