Universal scaling laws in the gradient descent training of neural
  networks

Universal scaling laws in the gradient descent training of neural networks

Papers citing "Universal scaling laws in the gradient descent training of neural networks"