Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.02483
Cited By
Considerations for Multilingual Wikipedia Research
5 April 2022
Isaac Johnson
Emily A. Lescak
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Considerations for Multilingual Wikipedia Research"
2 / 2 papers shown
Title
WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning
Krishna Srinivasan
K. Raman
Jiecao Chen
Michael Bendersky
Marc Najork
VLM
208
310
0
02 Mar 2021
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Leo Gao
Stella Biderman
Sid Black
Laurence Golding
Travis Hoppe
...
Horace He
Anish Thite
Noa Nabeshima
Shawn Presser
Connor Leahy
AIMat
253
1,989
0
31 Dec 2020
1