Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1912.11959
Cited By
Is Attention All What You Need? -- An Empirical Investigation on Convolution-Based Active Memory and Self-Attention
27 December 2019
Thomas D. Dowdell
Hongyu Zhang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Is Attention All What You Need? -- An Empirical Investigation on Convolution-Based Active Memory and Self-Attention"
1 / 1 papers shown
Title
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
245
1,817
0
17 Sep 2019
1