Provable Benefits of Overparameterization in Model Compression: From Double Descent to Pruning Neural Networks

AAAI Conference on Artificial Intelligence (AAAI), 2020

16 December 2020

Xiangyu Chang

Yingcong Li

Samet Oymak

Christos Thrampoulidis

ArXiv (abs)PDF HTML

Papers citing "Provable Benefits of Overparameterization in Model Compression: From Double Descent to Pruning Neural Networks"

40 / 40 papers shown

One-Bit Quantization for Random Features Models

D. Akhtiamov

Reza Ghane

B. Hassibi

186

17 Oct 2025

Information-Theoretic Criteria for Knowledge Distillation in Multimodal Learning

Rongrong Xie

Yizhou Xu

Guido Sanguinetti

160

15 Oct 2025

Optimal Regularization for Performative Learning

Edwige Cyffers

Alireza Mirrokni

Marco Mondelli

168

14 Oct 2025

High-dimensional Analysis of Synthetic Data Selection

212

09 Oct 2025

Optimal Implicit Bias in Linear Regression

K. N. Varma

Babak Hassibi

213

20 Jun 2025

MergeBench: A Benchmark for Merging Domain-Specialized LLMs

774

16 May 2025

Accelerating Linear Recurrent Neural Networks for the Edge with Unstructured Sparsity

399

03 Feb 2025

Spurious Correlations in High Dimensional Regression: The Roles of Regularization, Simplicity Bias and Over-Parameterization

Simone Bombari

Marco Mondelli

812

03 Feb 2025

Implicit Bias in Matrix Factorization and its Explicit Realization in a New Architecture

Yikun Hou

Suvrit Sra

A. Yurtsever

378

27 Jan 2025

Beyond adaptive gradient: Fast-Controlled Minibatch Algorithm for large-scale optimization

486

24 Nov 2024

High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling LawsInternational Conference on Learning Representations (ICLR), 2024

M. E. Ildiz

Halil Alperen Gozeten

Ege Onur Taga

Marco Mondelli

Samet Oymak

608

24 Oct 2024

Precise asymptotics of reweighted least-squares algorithms for linear diagonal networks

Chiraag Kaushik

Justin Romberg

Vidya Muthukumar

244

04 Jun 2024

Occam Gradient Descent

B. N. Kausik

ODL VLM

406

30 May 2024

Class-wise Activation Unravelling the Engima of Deep Double Descent

Yufei Gu

180

13 May 2024

Masks, Signs, And Learning Rate Rewinding

Advait Gadhikar

R. Burkholz

269

29 Feb 2024

Understanding the Role of Optimization in Double Descent

Chris Yuhao Liu

Jeffrey Flanigan

277

06 Dec 2023

Efficient Compression of Overparameterized Deep Models through Low-Dimensional Learning Dynamics

Soo Min Kwon

Zekai Zhang

Dogyoon Song

Laura Balzano

Qing Qu

339

08 Nov 2023

Unraveling the Enigma of Double Descent: An In-depth Analysis through the Lens of Learned Feature Space

Yufei Gu

Xiaoqing Zheng

T. Aste

371

20 Oct 2023

The Quest of Finding the Antidote to Sparse Double Descent

Victor Quétu

Marta Milovanović

350

31 Aug 2023

DSD$^2$: Can We Dodge Sparse Double Descent and Compress the Neural
Network Worry-Free?

DSD

^2

: Can We Dodge Sparse Double Descent and Compress the Neural Network Worry-Free?AAAI Conference on Artificial Intelligence (AAAI), 2023

Victor Quétu

Enzo Tartaglione

412

02 Mar 2023

Can we avoid Double Descent in Deep Neural Networks?International Conference on Information Photonics (ICIP), 2023

Victor Quétu

Enzo Tartaglione

AI4CE

340

26 Feb 2023

Precise Asymptotic Analysis of Deep Random Feature ModelsAnnual Conference Computational Learning Theory (COLT), 2023

David Bosch

Ashkan Panahi

B. Hassibi

385

13 Feb 2023

Strong inductive biases provably prevent harmless interpolationInternational Conference on Learning Representations (ICLR), 2023

299

18 Jan 2023

Why Random Pruning Is All We Need to Start SparseInternational Conference on Machine Learning (ICML), 2022

Advait Gadhikar

Sohom Mukherjee

R. Burkholz

355

05 Oct 2022

Deep Double Descent via Smooth Interpolation

712

21 Sep 2022

Overparameterization from Computational ConstraintsNeural Information Processing Systems (NeurIPS), 2022

237

27 Aug 2022

Sparse Double Descent: Where Network Pruning Aggravates OverfittingInternational Conference on Machine Learning (ICML), 2022

Zhengqi He

Zeke Xie

Quanzhi Zhu

Zengchang Qin

342

17 Jun 2022

Deep Architecture Connectivity Matters for Its Convergence: A Fine-Grained AnalysisNeural Information Processing Systems (NeurIPS), 2022

296

11 May 2022

Random Features Model with General Convex Regularization: A Fine Grained Analysis with Precise Asymptotic Learning CurvesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022

Devdatt Dubhash

350

06 Apr 2022

Provable and Efficient Continual Representation Learning

293

03 Mar 2022

Towards Sample-efficient Overparameterized Meta-learningNeural Information Processing Systems (NeurIPS), 2022

194

16 Jan 2022

A Farewell to the Bias-Variance Tradeoff? An Overview of the Theory of Overparameterized Machine Learning

Yehuda Dar

Vidya Muthukumar

Richard G. Baraniuk

310

06 Sep 2021

How much pre-training is enough to discover a good subnetwork?

Anastasios Kyrillidis

315

31 Jul 2021

Spectral Pruning for Recurrent Neural NetworksInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021

212

23 May 2021

Generalization Guarantees for Neural Architecture Search with Train-Validation SplitInternational Conference on Machine Learning (ICML), 2021

307

29 Apr 2021

Lottery Jackpots Exist in Pre-trained ModelsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021

485

18 Apr 2021

Label-Imbalanced and Group-Sensitive Classification under OverparameterizationNeural Information Processing Systems (NeurIPS), 2021

Ganesh Ramachandra Kini

Orestis Paraskevas

Samet Oymak

Christos Thrampoulidis

580

115

02 Mar 2021

Distilling Double Descent

Andrew Cotter

A. Menon

Harikrishna Narasimhan

A. S. Rawat

Sashank J. Reddi

Yichen Zhou

261

13 Feb 2021

Binary Classification of Gaussian Mixtures: Abundance of Support Vectors, Benign Overfitting and RegularizationSIAM Journal on Mathematics of Data Science (SIMODS), 2020

Ke Wang

Christos Thrampoulidis

541

18 Nov 2020

Optimal Lottery Tickets via SubsetSum: Logarithmic Over-Parameterization is SufficientNeural Information Processing Systems (NeurIPS), 2020

Dimitris Papailiopoulos

429

115

14 Jun 2020