Quadratic Suffices for Over-parametrization via Matrix Chernoff Bound
v1v2 (latest)

Quadratic Suffices for Over-parametrization via Matrix Chernoff Bound

Papers citing "Quadratic Suffices for Over-parametrization via Matrix Chernoff Bound"

50 / 75 papers shown
Title
Evaluating the design space of diffusion-based generative models
Evaluating the design space of diffusion-based generative modelsNeural Information Processing Systems (NeurIPS), 2024
215
14
0
18 Jun 2024
Six Lectures on Linearized Neural Networks
Six Lectures on Linearized Neural NetworksJournal of Statistical Mechanics: Theory and Experiment (J. Stat. Mech.), 2023
224
16
0
25 Aug 2023
How to Protect Copyright Data in Optimization of Large Language Models?
How to Protect Copyright Data in Optimization of Large Language Models?AAAI Conference on Artificial Intelligence (AAAI), 2023
136
36
0
23 Aug 2023
Memory capacity of two layer neural networks with smooth activations
Memory capacity of two layer neural networks with smooth activationsSIAM Journal on Mathematics of Data Science (SIMODS), 2023
212
6
0
03 Aug 2023
A Sublinear Adversarial Training Algorithm
A Sublinear Adversarial Training AlgorithmInternational Conference on Learning Representations (ICLR), 2022
131
26
0
10 Aug 2022
Federated Adversarial Learning: A Framework with Convergence Analysis
Federated Adversarial Learning: A Framework with Convergence AnalysisInternational Conference on Machine Learning (ICML), 2022
158
28
0
07 Aug 2022
Implicit Bias of MSE Gradient Optimization in Underparameterized Neural
  Networks
Implicit Bias of MSE Gradient Optimization in Underparameterized Neural NetworksInternational Conference on Learning Representations (ICLR), 2022
142
13
0
12 Jan 2022
Does Preprocessing Help Training Over-parameterized Neural Networks?
Does Preprocessing Help Training Over-parameterized Neural Networks?Neural Information Processing Systems (NeurIPS), 2021
170
50
0
09 Oct 2021
Early-stopped neural networks are consistent
Early-stopped neural networks are consistentNeural Information Processing Systems (NeurIPS), 2021
166
45
0
10 Jun 2021
GIST: Distributed Training for Large-Scale Graph Convolutional Networks
GIST: Distributed Training for Large-Scale Graph Convolutional NetworksJournal of Applied and Computational Topology (JACT), 2021
168
11
0
20 Feb 2021
On the Proof of Global Convergence of Gradient Descent for Deep ReLU
  Networks with Linear Widths
On the Proof of Global Convergence of Gradient Descent for Deep ReLU Networks with Linear WidthsInternational Conference on Machine Learning (ICML), 2021
165
51
0
24 Jan 2021

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.