ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.16911
  4. Cited By
Active Speech Enhancement: Active Speech Denoising Decliping and Deveraberation
v1v2 (latest)

Active Speech Enhancement: Active Speech Denoising Decliping and Deveraberation

22 May 2025
Ofir Yaish
Yehuda Mishaly
Eliya Nachmani
ArXiv (abs)PDFHTML

Papers citing "Active Speech Enhancement: Active Speech Denoising Decliping and Deveraberation"

46 / 46 papers shown
Title
DiTSE: High-Fidelity Generative Speech Enhancement via Latent Diffusion Transformers
DiTSE: High-Fidelity Generative Speech Enhancement via Latent Diffusion Transformers
Heitor R. Guimarães
Jiaqi Su
Rithesh Kumar
Tiago H. Falk
Zeyu Jin
DiffM
83
3
0
13 Apr 2025
GAN-Based Speech Enhancement for Low SNR Using Latent Feature
  Conditioning
GAN-Based Speech Enhancement for Low SNR Using Latent Feature Conditioning
Shrishti Saha Shetu
Emanuël A. P. Habets
Andreas Brendel
38
3
0
17 Oct 2024
SELD-Mamba: Selective State-Space Model for Sound Event Localization and
  Detection with Source Distance Estimation
SELD-Mamba: Selective State-Space Model for Sound Event Localization and Detection with Source Distance Estimation
Da Mu
Zhicheng Zhang
Haobo Yue
Zehao Wang
Jin Tang
Jianqin Yin
Mamba
68
4
0
09 Aug 2024
Speech Slytherin: Examining the Performance and Efficiency of Mamba for
  Speech Separation, Recognition, and Synthesis
Speech Slytherin: Examining the Performance and Efficiency of Mamba for Speech Separation, Recognition, and Synthesis
Xilin Jiang
Yinghao Aaron Li
Adrian Nicolas Florea
Cong Han
N. Mesgarani
Mamba
97
14
0
13 Jul 2024
Open-Source Conversational AI with SpeechBrain 1.0
Open-Source Conversational AI with SpeechBrain 1.0
Mirco Ravanelli
Titouan Parcollet
Adel Moumen
Sylvain de Langen
Cem Subakan
...
Salima Mdhaffar
G. Laperriere
Mickael Rouvier
Renato De Mori
Yannick Esteve
VLM
144
16
0
29 Jun 2024
RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake
  Detection
RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection
Yujie Chen
Jiangyan Yi
Jun Xue
Chenglong Wang
Xiaohui Zhang
Shunbo Dong
Siding Zeng
Jianhua Tao
Lv Zhao
Cunhang Fan
Mamba
100
20
0
10 Jun 2024
Audio Mamba: Bidirectional State Space Model for Audio Representation
  Learning
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning
Mehmet Hamza Erol
Arda Senocak
Jiu Feng
Joon Son Chung
Mamba
139
25
0
05 Jun 2024
Audio Mamba: Selective State Spaces for Self-Supervised Audio
  Representations
Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations
Sarthak Yadav
Zheng-Hua Tan
Mamba
81
17
0
04 Jun 2024
Transformers are SSMs: Generalized Models and Efficient Algorithms
  Through Structured State Space Duality
Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality
Tri Dao
Albert Gu
Mamba
145
542
0
31 May 2024
Audio Mamba: Pretrained Audio State Space Model For Audio Tagging
Audio Mamba: Pretrained Audio State Space Model For Audio Tagging
Jiaju Lin
Haoxuan Hu
Mamba
51
9
0
22 May 2024
Mamba in Speech: Towards an Alternative to Self-Attention
Mamba in Speech: Towards an Alternative to Self-Attention
Xiangyu Zhang
Qiquan Zhang
Hexin Liu
Tianyi Xiao
Xinyuan Qian
Beena Ahmed
E. Ambikairajah
Haizhou Li
Julien Epps
Mamba
142
47
0
21 May 2024
SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
Siavash Shams
Sukru Samet Dindar
Xilin Jiang
N. Mesgarani
Mamba
124
22
0
20 May 2024
An Investigation of Incorporating Mamba for Speech Enhancement
An Investigation of Incorporating Mamba for Speech Enhancement
Rong-Yu Chao
Wen-Huang Cheng
Moreno La Quatra
Sabato Marco Siniscalchi
Chao-Han Huck Yang
Szu-Wei Fu
Yu Tsao
Mamba
115
37
0
10 May 2024
SPMamba: State-space model is all you need in speech separation
SPMamba: State-space model is all you need in speech separation
Kai Li
Guo Chen
Mamba
89
26
0
02 Apr 2024
Jamba: A Hybrid Transformer-Mamba Language Model
Jamba: A Hybrid Transformer-Mamba Language Model
Opher Lieber
Barak Lenz
Hofit Bata
Gal Cohen
Jhonathan Osin
...
Nir Ratner
N. Rozen
Erez Shwartz
Mor Zusman
Y. Shoham
120
227
0
28 Mar 2024
Dual-path Mamba: Short and Long-term Bidirectional Selective Structured
  State Space Models for Speech Separation
Dual-path Mamba: Short and Long-term Bidirectional Selective Structured State Space Models for Speech Separation
Xilin Jiang
Cong Han
N. Mesgarani
Mamba
104
49
0
27 Mar 2024
Multichannel Long-Term Streaming Neural Speech Enhancement for Static
  and Moving Speakers
Multichannel Long-Term Streaming Neural Speech Enhancement for Static and Moving Speakers
Changsheng Quan
Xiaofei Li
120
27
0
12 Mar 2024
Unsupervised learning based end-to-end delayless generative fixed-filter
  active noise control
Unsupervised learning based end-to-end delayless generative fixed-filter active noise control
Zheng-wu Luo
Dongyuan Shi
Xiaoyi Shen
Woon-Seng Gan
38
3
0
08 Feb 2024
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Albert Gu
Tri Dao
Mamba
163
2,826
0
01 Dec 2023
Diffusion-based speech enhancement with a weighted generative-supervised
  learning loss
Diffusion-based speech enhancement with a weighted generative-supervised learning loss
Jean-Eudes Ayilo
Mostafa Sadeghi
Romain Serizel
DiffM
72
10
0
19 Sep 2023
Deep Generative Fixed-filter Active Noise Control
Deep Generative Fixed-filter Active Noise Control
Zheng-wu Luo
Dongyuan Shi
Xiaoyi Shen
Junwei Ji
W. Gan
36
15
0
10 Mar 2023
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech
  Enhancement and Dereverberation
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
Jean-Marie Lemercier
Julius Richter
Simon Welker
Timo Gerkmann
DiffM
248
92
0
22 Dec 2022
Cross-Attention is all you need: Real-Time Streaming Transformers for
  Personalised Speech Enhancement
Cross-Attention is all you need: Real-Time Streaming Transformers for Personalised Speech Enhancement
Shucong Zhang
Malcolm Chadwick
Alberto Gil C. P. Ramos
S. Bhattacharya
57
5
0
08 Nov 2022
High Fidelity Neural Audio Compression
High Fidelity Neural Audio Compression
Alexandre Défossez
Jade Copet
Gabriel Synnaeve
Yossi Adi
123
674
0
24 Oct 2022
A Hybrid SFANC-FxNLMS Algorithm for Active Noise Control based on Deep
  Learning
A Hybrid SFANC-FxNLMS Algorithm for Active Noise Control based on Deep Learning
Zheng-wu Luo
Dongyuan Shi
W. Gan
42
37
0
17 Aug 2022
Speech Enhancement and Dereverberation with Diffusion-based Generative
  Models
Speech Enhancement and Dereverberation with Diffusion-based Generative Models
Julius Richter
Simon Welker
Jean-Marie Lemercier
Bunlong Lay
Timo Gerkmann
DiffM
93
207
0
11 Aug 2022
Efficient Transformer-based Speech Enhancement Using Long Frames and
  STFT Magnitudes
Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes
Danilo de Oliveira
Tal Peer
Timo Gerkmann
61
21
0
23 Jun 2022
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration
Haohe Liu
Xubo Liu
Qiuqiang Kong
Qiao Tian
Yan Zhao
DeLiang Wang
Chuanzeng Huang
Yuxuan Wang
76
59
0
12 Apr 2022
Speech Enhancement with Score-Based Generative Models in the Complex
  STFT Domain
Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain
Simon Welker
Julius Richter
Timo Gerkmann
DiffM
112
117
0
31 Mar 2022
CMGAN: Conformer-based Metric GAN for Speech Enhancement
CMGAN: Conformer-based Metric GAN for Speech Enhancement
Ru Cao
Sherif Abdulatif
Bin Yang
111
100
0
28 Mar 2022
Conditional Diffusion Probabilistic Model for Speech Enhancement
Conditional Diffusion Probabilistic Model for Speech Enhancement
Yen-Ju Lu
Zhongqiu Wang
Shinji Watanabe
Alexander Richard
Cheng Yu
Yu Tsao
DiffM
79
191
0
10 Feb 2022
SoundStream: An End-to-End Neural Audio Codec
SoundStream: An End-to-End Neural Audio Codec
Neil Zeghidour
Alejandro Luebs
Ahmed Omran
Jan Skoglund
Marco Tagliasacchi
AI4TS
116
806
0
07 Jul 2021
Nonlinear Acoustic Echo Cancellation with Deep Learning
Nonlinear Acoustic Echo Cancellation with Deep Learning
Amir Ivry
Israel Cohen
B. Berdugo
61
21
0
25 Jun 2021
Self-attending RNN for Speech Enhancement to Improve Cross-corpus
  Generalization
Self-attending RNN for Speech Enhancement to Improve Cross-corpus Generalization
Ashutosh Pandey
DeLiang Wang
39
45
0
26 May 2021
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement
Szu-Wei Fu
Cheng Yu
Tsun-An Hsieh
Peter William VanHarn Plantinga
Mirco Ravanelli
Xugang Lu
Yu Tsao
87
218
0
08 Apr 2021
TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement
  in the Time Domain
TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement in the Time Domain
Kai Wang
Bengbeng He
Weiping Zhu
105
170
0
18 Mar 2021
Attention is All You Need in Speech Separation
Attention is All You Need in Speech Separation
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Mirko Bronzi
Jianyuan Zhong
107
567
0
25 Oct 2020
DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech
  Enhancement
DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement
Yanxin Hu
Yun Liu
Shubo Lv
Mengtao Xing
Shimin Zhang
Yihui Fu
Jian Wu
Bihong Zhang
Lei Xie
136
599
0
01 Aug 2020
Real Time Speech Enhancement in the Waveform Domain
Real Time Speech Enhancement in the Waveform Domain
Alexandre Défossez
Gabriel Synnaeve
Yossi Adi
109
466
0
23 Jun 2020
Conformer: Convolution-augmented Transformer for Speech Recognition
Conformer: Convolution-augmented Transformer for Speech Recognition
Anmol Gulati
James Qin
Chung-Cheng Chiu
Niki Parmar
Yu Zhang
...
Wei Han
Shibo Wang
Zhengdong Zhang
Yonghui Wu
Ruoming Pang
231
3,177
0
16 May 2020
Dual-path RNN: efficient long sequence modeling for time-domain
  single-channel speech separation
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation
Yi Luo
Zhuo Chen
Takuya Yoshioka
AI4TS
127
776
0
14 Oct 2019
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores
  Optimization for Speech Enhancement
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement
Szu-Wei Fu
Chien-Feng Liao
Yu Tsao
Shou-De Lin
73
331
0
13 May 2019
A Wavenet for Speech Denoising
A Wavenet for Speech Denoising
Dario Rethage
Jordi Pons
Xavier Serra
157
432
0
22 Jun 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
933
133,201
0
12 Jun 2017
SEGAN: Speech Enhancement Generative Adversarial Network
SEGAN: Speech Enhancement Generative Adversarial Network
Santiago Pascual
Antonio Bonafonte
Joan Serrà
GAN
100
1,149
0
28 Mar 2017
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.3K
150,586
0
22 Dec 2014
1