ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.02824
  4. Cited By
Layer-Adaptive State Pruning for Deep State Space Models
v1v2v3 (latest)

Layer-Adaptive State Pruning for Deep State Space Models

Neural Information Processing Systems (NeurIPS), 2024
5 November 2024
Minseon Gwak
Seongrok Moon
Joohwan Ko
PooGyeon Park
ArXiv (abs)PDFHTML

Papers citing "Layer-Adaptive State Pruning for Deep State Space Models"

28 / 28 papers shown
Hankel Singular Value Regularization for Highly Compressible State Space Models
Hankel Singular Value Regularization for Highly Compressible State Space Models
Paul Schwerdtner
Jules Berman
Benjamin Peherstorfer
185
1
0
27 Oct 2025
A Deep State-Space Model Compression Method using Upper Bound on Output Error
A Deep State-Space Model Compression Method using Upper Bound on Output Error
Hiroki Sakamoto
Kazuhiro Sato
61
0
0
16 Oct 2025
Uncovering the Spectral Bias in Diagonal State Space Models
Uncovering the Spectral Bias in Diagonal State Space Models
Rubén Solozabal
Velibor Bojkovic
Hilal AlQuabeh
Kentaro Inui
Martin Takáč
109
1
0
28 Aug 2025
Compression Method for Deep Diagonal State Space Model Based on $H^2$ Optimal Reduction
Compression Method for Deep Diagonal State Space Model Based on H2H^2H2 Optimal ReductionIEEE Control Systems Letters (L-CSS), 2025
Hiroki Sakamoto
Kazuhiro Sato
144
3
0
14 Jul 2025
State-Free Inference of State-Space Models: The Transfer Function
  Approach
State-Free Inference of State-Space Models: The Transfer Function ApproachInternational Conference on Machine Learning (ICML), 2024
Rom N. Parnichkun
Stefano Massaroli
Alessandro Moro
Jimmy T.H. Smith
Ramin Hasani
...
Hajime Asama
Stefano Ermon
Taiji Suzuki
Atsushi Yamashita
Michael Poli
271
15
0
10 May 2024
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Albert Gu
Tri Dao
Mamba
564
5,271
0
01 Dec 2023
Efficient Joint Optimization of Layer-Adaptive Weight Pruning in Deep
  Neural Networks
Efficient Joint Optimization of Layer-Adaptive Weight Pruning in Deep Neural NetworksIEEE International Conference on Computer Vision (ICCV), 2023
Kaixin Xu
Zhe Wang
Xue Geng
Jie Lin
Ruibing Jin
Xiaoli Li
Weisi Lin
138
21
0
21 Aug 2023
Effectively Modeling Time Series with Simple Discrete State Spaces
Effectively Modeling Time Series with Simple Discrete State SpacesInternational Conference on Learning Representations (ICLR), 2023
Michael Zhang
Khaled Kamal Saab
Michael Poli
Tri Dao
Karan Goel
Christopher Ré
AI4TS
150
69
0
16 Mar 2023
On the Parameterization and Initialization of Diagonal State Space
  Models
On the Parameterization and Initialization of Diagonal State Space ModelsNeural Information Processing Systems (NeurIPS), 2022
Albert Gu
Ankit Gupta
Karan Goel
Christopher Ré
413
473
0
23 Jun 2022
Diagonal State Spaces are as Effective as Structured State Spaces
Diagonal State Spaces are as Effective as Structured State SpacesNeural Information Processing Systems (NeurIPS), 2022
Ankit Gupta
Albert Gu
Jonathan Berant
392
411
0
27 Mar 2022
It's Raw! Audio Generation with State-Space Models
It's Raw! Audio Generation with State-Space ModelsInternational Conference on Machine Learning (ICML), 2022
Karan Goel
Albert Gu
Chris Donahue
Christopher Ré
261
233
0
20 Feb 2022
Efficiently Modeling Long Sequences with Structured State Spaces
Efficiently Modeling Long Sequences with Structured State SpacesInternational Conference on Learning Representations (ICLR), 2021
Albert Gu
Karan Goel
Christopher Ré
1.0K
2,871
0
31 Oct 2021
Combining Recurrent, Convolutional, and Continuous-time Models with
  Linear State-Space Layers
Combining Recurrent, Convolutional, and Continuous-time Models with Linear State-Space Layers
Albert Gu
Isys Johnson
Karan Goel
Khaled Kamal Saab
Tri Dao
Atri Rudra
Christopher Ré
294
935
0
26 Oct 2021
Long Range Arena: A Benchmark for Efficient Transformers
Long Range Arena: A Benchmark for Efficient Transformers
Yi Tay
Mostafa Dehghani
Samira Abnar
Songlin Yang
Dara Bahri
Philip Pham
J. Rao
Liu Yang
Sebastian Ruder
Donald Metzler
383
832
0
08 Nov 2020
Layer-adaptive sparsity for the Magnitude-based Pruning
Layer-adaptive sparsity for the Magnitude-based Pruning
Jaeho Lee
Sejun Park
Sangwoo Mo
SungSoo Ahn
Jinwoo Shin
252
300
0
15 Oct 2020
HiPPO: Recurrent Memory with Optimal Polynomial Projections
HiPPO: Recurrent Memory with Optimal Polynomial Projections
Albert Gu
Tri Dao
Stefano Ermon
Atri Rudra
Christopher Ré
398
805
0
17 Aug 2020
Rigging the Lottery: Making All Tickets Winners
Rigging the Lottery: Making All Tickets WinnersInternational Conference on Machine Learning (ICML), 2019
Utku Evci
Trevor Gale
Jacob Menick
Pablo Samuel Castro
Erich Elsen
537
686
0
25 Nov 2019
One ticket to win them all: generalizing lottery ticket initializations
  across datasets and optimizers
One ticket to win them all: generalizing lottery ticket initializations across datasets and optimizersNeural Information Processing Systems (NeurIPS), 2019
Ari S. Morcos
Haonan Yu
Michela Paganini
Yuandong Tian
197
244
0
06 Jun 2019
The State of Sparsity in Deep Neural Networks
The State of Sparsity in Deep Neural Networks
Trevor Gale
Erich Elsen
Sara Hooker
385
839
0
25 Feb 2019
Learning long-range spatial dependencies with horizontal gated-recurrent
  units
Learning long-range spatial dependencies with horizontal gated-recurrent units
Drew Linsley
Junkyung Kim
Vijay Veerabadran
Thomas Serre
315
170
0
21 May 2018
ListOps: A Diagnostic Dataset for Latent Tree Learning
ListOps: A Diagnostic Dataset for Latent Tree Learning
Nikita Nangia
Samuel R. Bowman
254
151
0
17 Apr 2018
Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition
Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition
Pete Warden
235
1,850
0
09 Apr 2018
To prune, or not to prune: exploring the efficacy of pruning for model
  compression
To prune, or not to prune: exploring the efficacy of pruning for model compression
Michael Zhu
Suyog Gupta
355
1,405
0
05 Oct 2017
Channel Pruning for Accelerating Very Deep Neural Networks
Channel Pruning for Accelerating Very Deep Neural Networks
Yihui He
Xiangyu Zhang
Jian Sun
637
2,683
0
19 Jul 2017
Scalable Training of Artificial Neural Networks with Adaptive Sparse
  Connectivity inspired by Network Science
Scalable Training of Artificial Neural Networks with Adaptive Sparse Connectivity inspired by Network Science
Decebal Constantin Mocanu
Elena Mocanu
Peter Stone
Phuong H. Nguyen
M. Gibescu
A. Liotta
402
697
0
15 Jul 2017
Gaussian Error Linear Units (GELUs)
Gaussian Error Linear Units (GELUs)
Dan Hendrycks
Kevin Gimpel
971
6,140
0
27 Jun 2016
Learning both Weights and Connections for Efficient Neural Networks
Learning both Weights and Connections for Efficient Neural NetworksNeural Information Processing Systems (NeurIPS), 2015
Song Han
Jeff Pool
J. Tran
W. Dally
CVBM
564
7,320
0
08 Jun 2015
Norm-Based Capacity Control in Neural Networks
Norm-Based Capacity Control in Neural Networks
Behnam Neyshabur
Ryota Tomioka
Nathan Srebro
883
635
0
27 Feb 2015
1