Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.06192
Cited By
Revisiting BFloat16 Training
13 October 2020
Pedram Zamirai
Jian Zhang
Christopher R. Aberger
Christopher De Sa
FedML
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Revisiting BFloat16 Training"
3 / 3 papers shown
Title
Stochastic Rounding for LLM Training: Theory and Practice
Kaan Ozkara
Tao Yu
Youngsuk Park
36
0
0
27 Feb 2025
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
245
1,817
0
17 Sep 2019
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
297
6,956
0
20 Apr 2018
1