Enhancing Accuracy and Parameter-Efficiency of Neural Representations for Network Parameterization

29 June 2024

Papers citing "Enhancing Accuracy and Parameter-Efficiency of Neural Representations for Network Parameterization"

2 / 2 papers shown

Title
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism M. Shoeybi M. Patwary Raul Puri P. LeGresley Jared Casper Bryan Catanzaro MoE 243 1,817 0 17 Sep 2019
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima N. Keskar Dheevatsa Mudigere J. Nocedal M. Smelyanskiy P. T. P. Tang ODL 273 2,886 0 15 Sep 2016