Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.00356
Cited By
Enhancing Accuracy and Parameter-Efficiency of Neural Representations for Network Parameterization
29 June 2024
Hongjun Choi
Jayaraman J. Thiagarajan
Ruben Glatt
Shusen Liu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Enhancing Accuracy and Parameter-Efficiency of Neural Representations for Network Parameterization"
2 / 2 papers shown
Title
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
243
1,817
0
17 Sep 2019
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
273
2,886
0
15 Sep 2016
1