ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.01331
  4. Cited By
Fast Convex Optimization for Two-Layer ReLU Networks: Equivalent Model Classes and Cone Decompositions
v1v2v3v4 (latest)

Fast Convex Optimization for Two-Layer ReLU Networks: Equivalent Model Classes and Cone Decompositions

International Conference on Machine Learning (ICML), 2022
2 February 2022
Aaron Mishkin
Arda Sahiner
Mert Pilanci
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Fast Convex Optimization for Two-Layer ReLU Networks: Equivalent Model Classes and Cone Decompositions"

50 / 52 papers shown
A Recovery Guarantee for Sparse Neural Networks
A Recovery Guarantee for Sparse Neural Networks
Sara Fridovich-Keil
Mert Pilanci
116
1
0
24 Sep 2025
Algebraic Approach to Ridge-Regularized Mean Squared Error Minimization in Minimal ReLU Neural Network
Algebraic Approach to Ridge-Regularized Mean Squared Error Minimization in Minimal ReLU Neural Network
Ryoya Fukasaku
Y. Kabata
Akifumi Okuno
234
0
0
25 Aug 2025
Sliced-Wasserstein Distance-based Data Selection
Sliced-Wasserstein Distance-based Data Selection
Julien Pallage
Antoine Lesage-Landry
283
1
0
17 Apr 2025
Generative Feature Training of Thin 2-Layer Networks
Generative Feature Training of Thin 2-Layer Networks
J. Hertrich
Sebastian Neumayer
GAN
500
3
0
11 Nov 2024
CRONOS: Enhancing Deep Learning with Scalable GPU Accelerated Convex
  Neural Networks
CRONOS: Enhancing Deep Learning with Scalable GPU Accelerated Convex Neural NetworksNeural Information Processing Systems (NeurIPS), 2024
Miria Feng
Zachary Frangella
Mert Pilanci
BDL
465
4
0
02 Nov 2024
Convex Distillation: Efficient Compression of Deep Networks via Convex
  Optimization
Convex Distillation: Efficient Compression of Deep Networks via Convex Optimization
Prateek Varshney
Mert Pilanci
465
0
0
09 Oct 2024
Randomized Geometric Algebra Methods for Convex Neural Networks
Randomized Geometric Algebra Methods for Convex Neural Networks
Yifei Wang
Sungyoon Kim
Paul Chu
Indu Subramaniam
Mert Pilanci
AAML
365
3
0
04 Jun 2024
Learning with Norm Constrained, Over-parameterized, Two-layer Neural
  Networks
Learning with Norm Constrained, Over-parameterized, Two-layer Neural Networks
Fanghui Liu
L. Dadi
Volkan Cevher
482
10
0
29 Apr 2024
The Real Tropical Geometry of Neural Networks
The Real Tropical Geometry of Neural Networks
Marie-Charlotte Brandenburg
Georg Loho
Guido Montúfar
453
20
0
18 Mar 2024
Convex Relaxations of ReLU Neural Networks Approximate Global Optima in
  Polynomial Time
Convex Relaxations of ReLU Neural Networks Approximate Global Optima in Polynomial TimeInternational Conference on Machine Learning (ICML), 2024
Sungyoon Kim
Mert Pilanci
586
10
0
06 Feb 2024
Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models
Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models
Fangzhao Zhang
Mert Pilanci
AI4CE
520
40
0
04 Feb 2024
Analyzing Neural Network-Based Generative Diffusion Models through
  Convex Optimization
Analyzing Neural Network-Based Generative Diffusion Models through Convex Optimization
Fangzhao Zhang
Mert Pilanci
DiffM
363
5
0
03 Feb 2024
Fixing the NTK: From Neural Network Linearizations to Exact Convex
  Programs
Fixing the NTK: From Neural Network Linearizations to Exact Convex ProgramsNeural Information Processing Systems (NeurIPS), 2023
Rajat Vadiraj Dwaraknath
Tolga Ergen
Mert Pilanci
337
0
0
26 Sep 2023
On the Global Convergence of Natural Actor-Critic with Two-layer Neural
  Network Parametrization
On the Global Convergence of Natural Actor-Critic with Two-layer Neural Network Parametrization
Mudit Gaur
Amrit Singh Bedi
Di-di Wang
Vaneet Aggarwal
296
8
0
18 Jun 2023
Optimal Sets and Solution Paths of ReLU Networks
Optimal Sets and Solution Paths of ReLU NetworksInternational Conference on Machine Learning (ICML), 2023
Aaron Mishkin
Mert Pilanci
385
7
0
31 May 2023
Variation Spaces for Multi-Output Neural Networks: Insights on
  Multi-Task Learning and Network Compression
Variation Spaces for Multi-Output Neural Networks: Insights on Multi-Task Learning and Network CompressionJournal of machine learning research (JMLR), 2023
Joseph Shenouda
Rahul Parhi
Kangwook Lee
Robert D. Nowak
388
21
0
25 May 2023
Convex Dual Theory Analysis of Two-Layer Convolutional Neural Networks
  with Soft-Thresholding
Convex Dual Theory Analysis of Two-Layer Convolutional Neural Networks with Soft-ThresholdingIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Chunyan Xiong
Meng Lu
Xiaotong Yu
JIAN-PENG Cao
Zhong Chen
D. Guo
X. Qu
MLT
380
3
0
14 Apr 2023
Training a Two Layer ReLU Network Analytically
Training a Two Layer ReLU Network AnalyticallyItalian National Conference on Sensors (INS), 2023
Adrian Barbu
317
9
0
06 Apr 2023
Efficient displacement convex optimization with particle gradient
  descent
Efficient displacement convex optimization with particle gradient descentInternational Conference on Machine Learning (ICML), 2023
Hadi Daneshmand
Jason D. Lee
Chi Jin
321
6
0
09 Feb 2023
Convexifying Transformers: Improving optimization and understanding of
  transformer networks
Convexifying Transformers: Improving optimization and understanding of transformer networks
Tolga Ergen
Behnam Neyshabur
Harsh Mehta
MLT
259
15
0
20 Nov 2022
On the Global Convergence of Fitted Q-Iteration with Two-layer Neural
  Network Parametrization
On the Global Convergence of Fitted Q-Iteration with Two-layer Neural Network ParametrizationInternational Conference on Machine Learning (ICML), 2022
Mudit Gaur
Vaneet Aggarwal
Mridul Agarwal
MLT
456
3
0
14 Nov 2022
PathProx: A Proximal Gradient Algorithm for Weight Decay Regularized
  Deep Neural Networks
PathProx: A Proximal Gradient Algorithm for Weight Decay Regularized Deep Neural Networks
Liu Yang
Jifan Zhang
Joseph Shenouda
Dimitris Papailiopoulos
Kangwook Lee
Robert D. Nowak
436
2
0
06 Oct 2022
Unraveling Attention via Convex Duality: Analysis and Interpretations of
  Vision Transformers
Unraveling Attention via Convex Duality: Analysis and Interpretations of Vision TransformersInternational Conference on Machine Learning (ICML), 2022
Arda Sahiner
Tolga Ergen
Batu Mehmet Ozturkler
John M. Pauly
Morteza Mardani
Mert Pilanci
352
36
0
17 May 2022
Efficient Global Optimization of Two-Layer ReLU Networks: Quadratic-Time Algorithms and Adversarial Training
Efficient Global Optimization of Two-Layer ReLU Networks: Quadratic-Time Algorithms and Adversarial TrainingSIAM Journal on Mathematics of Data Science (SIMODS), 2022
Yatong Bai
Tanmay Gautam
Somayeh Sojoudi
AAML
381
19
0
06 Jan 2022
Parallel Deep Neural Networks Have Zero Duality Gap
Parallel Deep Neural Networks Have Zero Duality Gap
Yifei Wang
Tolga Ergen
Mert Pilanci
478
12
0
13 Oct 2021
Global Optimality Beyond Two Layers: Training Deep ReLU Networks via
  Convex Programs
Global Optimality Beyond Two Layers: Training Deep ReLU Networks via Convex ProgramsInternational Conference on Machine Learning (ICML), 2021
Tolga Ergen
Mert Pilanci
OffRLMLT
315
35
0
11 Oct 2021
Hidden Convexity of Wasserstein GANs: Interpretable Generative Models
  with Closed-Form Solutions
Hidden Convexity of Wasserstein GANs: Interpretable Generative Models with Closed-Form Solutions
Arda Sahiner
Tolga Ergen
Batu Mehmet Ozturkler
Burak Bartan
John M. Pauly
Morteza Mardani
Mert Pilanci
GAN
420
22
0
12 Jul 2021
Demystifying Batch Normalization in ReLU Networks: Equivalent Convex
  Optimization Models and Implicit Regularization
Demystifying Batch Normalization in ReLU Networks: Equivalent Convex Optimization Models and Implicit RegularizationInternational Conference on Learning Representations (ICLR), 2021
Tolga Ergen
Arda Sahiner
Batu Mehmet Ozturkler
John M. Pauly
Morteza Mardani
Mert Pilanci
461
33
0
02 Mar 2021
On the Reproducibility of Neural Network Predictions
On the Reproducibility of Neural Network Predictions
Srinadh Bhojanapalli
Kimberly Wilber
Andreas Veit
A. S. Rawat
Seungyeon Kim
A. Menon
Sanjiv Kumar
345
42
0
05 Feb 2021
Vector-output ReLU Neural Network Problems are Copositive Programs:
  Convex Analysis of Two Layer Networks and Polynomial-time Algorithms
Vector-output ReLU Neural Network Problems are Copositive Programs: Convex Analysis of Two Layer Networks and Polynomial-time AlgorithmsInternational Conference on Learning Representations (ICLR), 2020
Arda Sahiner
Tolga Ergen
John M. Pauly
Mert Pilanci
MLT
568
45
0
24 Dec 2020
Convex Regularization Behind Neural Reconstruction
Convex Regularization Behind Neural ReconstructionInternational Conference on Learning Representations (ICLR), 2020
Arda Sahiner
Morteza Mardani
Batu Mehmet Ozturkler
Mert Pilanci
John M. Pauly
271
25
0
09 Dec 2020
Nonparametric Learning of Two-Layer ReLU Residual Units
Nonparametric Learning of Two-Layer ReLU Residual Units
Zhunxuan Wang
Linyun He
Chunchuan Lyu
Shay B. Cohen
MLTOffRL
556
1
0
17 Aug 2020
The Hidden Convex Optimization Landscape of Two-Layer ReLU Neural
  Networks: an Exact Characterization of the Optimal Solutions
The Hidden Convex Optimization Landscape of Two-Layer ReLU Neural Networks: an Exact Characterization of the Optimal SolutionsInternational Conference on Learning Representations (ICLR), 2020
Yifei Wang
Jonathan Lacotte
Mert Pilanci
MLT
431
30
0
10 Jun 2020
The Curious Case of Convex Neural Networks
The Curious Case of Convex Neural Networks
S. Sivaprasad
Ankur Singh
Naresh Manwani
Vineet Gandhi
482
33
0
09 Jun 2020
A Brief Prehistory of Double Descent
A Brief Prehistory of Double DescentProceedings of the National Academy of Sciences of the United States of America (PNAS), 2020
Marco Loog
T. Viering
A. Mey
Jesse H. Krijthe
David Tax
186
76
0
07 Apr 2020
Neural Networks are Convex Regularizers: Exact Polynomial-time Convex
  Optimization Formulations for Two-layer Networks
Neural Networks are Convex Regularizers: Exact Polynomial-time Convex Optimization Formulations for Two-layer NetworksInternational Conference on Machine Learning (ICML), 2020
Mert Pilanci
Tolga Ergen
393
140
0
24 Feb 2020
Revealing the Structure of Deep Neural Networks via Convex Duality
Revealing the Structure of Deep Neural Networks via Convex DualityInternational Conference on Machine Learning (ICML), 2020
Tolga Ergen
Mert Pilanci
MLT
527
77
0
22 Feb 2020
Deep Double Descent: Where Bigger Models and More Data Hurt
Deep Double Descent: Where Bigger Models and More Data HurtInternational Conference on Learning Representations (ICLR), 2019
Preetum Nakkiran
Gal Kaplun
Yamini Bansal
Tristan Yang
Boaz Barak
Ilya Sutskever
562
1,091
0
04 Dec 2019
PyTorch: An Imperative Style, High-Performance Deep Learning Library
PyTorch: An Imperative Style, High-Performance Deep Learning LibraryNeural Information Processing Systems (NeurIPS), 2019
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
...
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
ODL
1.1K
50,986
0
03 Dec 2019
Kernel and Rich Regimes in Overparametrized ModelsAnnual Conference Computational Learning Theory (COLT), 2019
Blake E. Woodworth
Suriya Gunasekar
Pedro H. P. Savarese
E. Moroshko
Itay Golan
Jason D. Lee
Daniel Soudry
Nathan Srebro
524
404
0
13 Jun 2019
Decoupling Gating from Linearity
Decoupling Gating from Linearity
Jonathan Fiat
Eran Malach
Shai Shalev-Shwartz
271
31
0
12 Jun 2019
Greedy Layerwise Learning Can Scale to ImageNet
Greedy Layerwise Learning Can Scale to ImageNet
Eugene Belilovsky
Michael Eickenberg
Edouard Oyallon
605
206
0
29 Dec 2018
Reconciling modern machine learning practice and the bias-variance
  trade-off
Reconciling modern machine learning practice and the bias-variance trade-off
M. Belkin
Daniel J. Hsu
Siyuan Ma
Soumik Mandal
1.0K
1,959
0
28 Dec 2018
Deep Reinforcement Learning that Matters
Deep Reinforcement Learning that Matters
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
David Meger
OffRL
769
2,255
0
19 Sep 2017
Geometry of Optimization and Implicit Regularization in Deep Learning
Geometry of Optimization and Implicit Regularization in Deep Learning
Behnam Neyshabur
Ryota Tomioka
Ruslan Salakhutdinov
Nathan Srebro
AI4CE
247
140
0
08 May 2017
Input Convex Neural Networks
Input Convex Neural Networks
Brandon Amos
Lei Xu
J. Zico Kolter
1.1K
779
0
22 Sep 2016
TensorFlow: A system for large-scale machine learning
TensorFlow: A system for large-scale machine learning
Martín Abadi
P. Barham
Jianmin Chen
Zhiwen Chen
Andy Davis
...
Vijay Vasudevan
Pete Warden
Martin Wicke
Yuan Yu
Xiaoqiang Zhang
GNNAI4CE
1.2K
19,463
0
27 May 2016
Escaping From Saddle Points --- Online Stochastic Gradient for Tensor
  Decomposition
Escaping From Saddle Points --- Online Stochastic Gradient for Tensor Decomposition
Rong Ge
Furong Huang
Chi Jin
Yang Yuan
553
1,121
0
06 Mar 2015
Breaking the Curse of Dimensionality with Convex Neural Networks
Breaking the Curse of Dimensionality with Convex Neural NetworksJournal of machine learning research (JMLR), 2014
Francis R. Bach
616
782
0
30 Dec 2014
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic OptimizationInternational Conference on Learning Representations (ICLR), 2014
Diederik P. Kingma
Jimmy Ba
ODL
5.0K
164,701
0
22 Dec 2014
12
Next
Page 1 of 2