Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2101.00027
Cited By
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
31 December 2020
Leo Gao
Stella Biderman
Sid Black
Laurence Golding
Travis Hoppe
Charles Foster
Jason Phang
Horace He
Anish Thite
Noa Nabeshima
Shawn Presser
Connor Leahy
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Pile: An 800GB Dataset of Diverse Text for Language Modeling"
Title
No papers