When Expressivity Meets Trainability: Fewer than Neurons Can WorkNeural Information Processing Systems (NeurIPS), 2022 |
On the Omnipresence of Spurious Local Minima in Certain Neural Network
Training ProblemsConstructive approximation (Constr. Approx.), 2022 |
Critical Point-Finding Methods Reveal Gradient-Flat Regions of Deep
Network LossesNeural Computation (Neural Comput.), 2020 |