
Asynchronous Local-SGD Training for Language Modeling
Bo Liu
Rachita Chhaparia
Arthur Douillard
Satyen Kale
Andrei A. Rusu
Jiajun Shen
Arthur Szlam
MarcÁurelio Ranzato
Papers citing "Asynchronous Local-SGD Training for Language Modeling"
5 / 5 papers shown
Title |
---|