v1v2 (latest)

SequentialAttention++ for Block Sparsification: Differentiable Pruning Meets Combinatorial Optimization

27 February 2024

Vahab Mirrokni

Papers citing "SequentialAttention++ for Block Sparsification: Differentiable Pruning Meets Combinatorial Optimization"

36 / 36 papers shown

Accurate Neural Network Pruning Requires Rethinking Sparse Optimization

Denis Kuznedelev

Dan Alistarh

335

03 Aug 2023

PDP: Parameter-free Differentiable Pruning is All You NeedNeural Information Processing Systems (NeurIPS), 2023

Minsik Cho

Saurabh N. Adya

Devang Naik

VLM

250

18 May 2023

Fast as CHITA: Neural Network Pruning with Combinatorial OptimizationInternational Conference on Machine Learning (ICML), 2023

295

28 Feb 2023

SparseGPT: Massive Language Models Can Be Accurately Pruned in One-ShotInternational Conference on Machine Learning (ICML), 2023

Elias Frantar

Dan Alistarh

VLM

587

1,046

02 Jan 2023

Are Straight-Through gradients and Soft-Thresholding all you need for Sparse Training?IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

A. Vanderschueren

Christophe De Vleeschouwer

154

02 Dec 2022

MegaBlocks: Efficient Sparse Training with Mixture-of-ExpertsConference on Machine Learning and Systems (MLSys), 2022

Deepak Narayanan

206

160

29 Nov 2022

CAP: Correlation-Aware Pruning for Highly-Accurate Sparse Vision ModelsNeural Information Processing Systems (NeurIPS), 2022

Denis Kuznedelev

Eldar Kurtic

Elias Frantar

Dan Alistarh

VLM ViT

174

14 Oct 2022

Sequential Attention for Feature SelectionInternational Conference on Learning Representations (ICLR), 2022

Vahab Mirrokni

342

29 Sep 2022

Hardness and Algorithms for Robust and Sparse OptimizationInternational Conference on Machine Learning (ICML), 2022

Eric Price

Sandeep Silwal

Samson Zhou

234

29 Jun 2022

Sparse-Group Log-Sum Penalized Graphical Model Learning For Time SeriesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Jitendra Tugnait

212

29 Apr 2022

Iterative Hard Thresholding with Adaptive Regularization: Sparser Solutions Without Sacrificing RuntimeInternational Conference on Machine Learning (ICML), 2022

Kyriakos Axiotis

M. Sviridenko

131

11 Apr 2022

Data-Efficient Structured Pruning via Submodular OptimizationNeural Information Processing Systems (NeurIPS), 2022

Marwa El Halabi

Suraj Srinivas

Damien Scieur

407

09 Mar 2022

OptG: Optimizing Gradient-driven Criteria in Network Sparsity

471

30 Jan 2022

Powerpropagation: A sparsity inducing weight reparameterisation

Jonathan Richard Schwarz

Siddhant M. Jayakumar

Razvan Pascanu

P. Latham

Yee Whye Teh

383

01 Oct 2021

S2TA: Exploiting Structured Sparsity for Energy-Efficient Mobile CNN AccelerationInternational Symposium on High-Performance Computer Architecture (HPCA), 2021

Zhi-Gang Liu

P. Whatmough

Yuhao Zhu

Matthew Mattina

197

102

16 Jul 2021

AC/DC: Alternating Compressed/DeCompressed Training of Deep Neural Networks

Dan Alistarh

596

23 Jun 2021

Sparse Training via Boosting Pruning Plasticity with NeuroregenerationNeural Information Processing Systems (NeurIPS), 2021

Lu Yin

Decebal Constantin Mocanu

332

133

19 Jun 2021

The Fine-Grained Hardness of Sparse Linear Regression

A. Gupte

Vinod Vaikuntanathan

195

06 Jun 2021

Operation-Aware Soft Channel Pruning using Differentiable MasksInternational Conference on Machine Learning (ICML), 2020

Minsoo Kang

Bohyung Han

AAML

197

160

08 Jul 2020

Sparse Convex Optimization via Adaptively Regularized Hard Thresholding

Kyriakos Axiotis

M. Sviridenko

264

25 Jun 2020

Movement Pruning: Adaptive Sparsity by Fine-Tuning

Victor Sanh

Thomas Wolf

Alexander M. Rush

386

557

15 May 2020

Winning the Lottery with Continuous SparsificationNeural Information Processing Systems (NeurIPS), 2019

Pedro H. P. Savarese

Hugo Silva

Michael Maire

383

150

10 Dec 2019

Rigging the Lottery: Making All Tickets WinnersInternational Conference on Machine Learning (ICML), 2019

537

686

25 Nov 2019

Implicit Regularization for Optimal Sparse RecoveryNeural Information Processing Systems (NeurIPS), 2019

Tomas Vaskevicius

Varun Kanade

Patrick Rebeschini

177

112

11 Sep 2019

Differentiable Mask for Pruning Convolutional and Recurrent NetworksCanadian Conference on Computer and Robot Vision (CRV), 2019

177

10 Sep 2019

Deep Learning Recommendation Model for Personalization and Recommendation Systems

...

253

844

31 May 2019

Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be PrunedAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

742

1,345

23 May 2019

DARTS: Differentiable Architecture Search

Hanxiao Liu

Karen Simonyan

Yiming Yang

788

4,746

24 Jun 2018

The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks

Jonathan Frankle

Michael Carbin

987

3,927

09 Mar 2018

Attribution Modeling Increases Efficiency of Bidding in Display Advertising

196

20 Jul 2017

Attention Is All You NeedNeural Information Processing Systems (NeurIPS), 2017

4.3K

162,388

12 Jun 2017

Restricted Strong Convexity Implies Weak Submodularity

275

162

02 Dec 2016

$Lasso, fractional norm and structured sparse estimation using a Hadamard product parametrization$

Lasso, fractional norm and structured sparse estimation using a Hadamard product parametrization

P. Hoff

390

31 Oct 2016

Learning Structured Sparsity in Deep Neural Networks

Yiran Chen

534

2,456

12 Aug 2016

Structured Pruning of Deep Convolutional Neural Networks

S. Anwar

Kyuyeon Hwang

Wonyong Sung

338

789

29 Dec 2015

Learning both Weights and Connections for Efficient Neural NetworksNeural Information Processing Systems (NeurIPS), 2015

Song Han

567

7,332

08 Jun 2015