Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.14780
Cited By
Training and Evaluation of a Multilingual Tokenizer for GPT-SW3
28 April 2023
Felix Stollenwerk
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Training and Evaluation of a Multilingual Tokenizer for GPT-SW3"
2 / 2 papers shown
Title
How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models
Phillip Rust
Jonas Pfeiffer
Ivan Vulić
Sebastian Ruder
Iryna Gurevych
69
235
0
31 Dec 2020
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
243
1,791
0
17 Sep 2019
1