v1v2 (latest)

Wavesplit: End-to-End Speech Separation by Speaker Clustering

IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020

20 February 2020

Papers citing "Wavesplit: End-to-End Speech Separation by Speaker Clustering"

50 / 149 papers shown

MARS-Sep: Multimodal-Aligned Reinforced Sound Separation

149

12 Oct 2025

Neural Speech Separation with Parallel Amplitude and Phase Spectrum Estimation

Fei Liu

Yang Ai

Zhen-Hua Ling

226

17 Sep 2025

A Study of the Scale Invariant Signal to Distortion Ratio in Speech Separation with Noisy References

Simon Dahl Jepsen

M. G. Christensen

Jesper Rindom Jensen

187

20 Aug 2025

Advances in Speech Separation: Techniques, Challenges, and Future Trends

...

230

14 Aug 2025

SpectroStream: A Versatile Neural Codec for General Audio

115

07 Aug 2025

Whilter: A Whisper-based Data Filter for "In-the-Wild" Speech Corpora Using Utterance-level Multi-Task Classification

282

29 Jul 2025

Plug-and-Play Co-Occurring Face Attention for Robust Audio-Visual Speaker Extraction

279

27 May 2025

Attractor-Based Speech Separation of Multiple Utterances by Unknown Number of Speakers

212

22 May 2025

Listen to Extract: Onset-Prompted Target Speaker Extraction

394

08 May 2025

SepALM: Audio Language Models Are Error Correctors for Robust Speech SeparationInternational Joint Conference on Artificial Intelligence (IJCAI), 2025

531

06 May 2025

Contextual Speech Extraction: Leveraging Textual History as an Implicit Cue for Target Speech ExtractionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

347

13 Mar 2025

EDSep: An Effective Diffusion-Based Method for Speech Source SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

Jinwei Dong

Xinsheng Wang

Qirong Mao

347

28 Jan 2025

Beyond Speaker Identity: Text Guided Target Speech ExtractionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

282

17 Jan 2025

Task-Aware Unified Source Separation

316

31 Oct 2024

OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup

Zehan Wang

...

Zhou Zhao

261

28 Oct 2024

SepMamba: State-space models for speaker separation using Mamba

Thor Højhus Avenstrup

267

28 Oct 2024

WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker ExtractionInterspeech (Interspeech), 2024

Shuai Wang

Ke Zhang

Shaoxiong Lin

Junjie Li

Xuefei Wang

Meng Ge

Jianwei Yu

Yanmin Qian

Haizhou Li

234

24 Sep 2024

Compositional Audio Representation LearningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Sripathi Sridhar

Mark Cartwright

AI4TS

540

15 Sep 2024

USEF-TSE: Universal Speaker Embedding Free Target Speaker ExtractionIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2024

Bang Zeng

Ming Li

489

04 Sep 2024

Improving Generalization of Speech Separation in Real-World Scenarios: Strategies in Simulation, Optimization, and EvaluationInterspeech (Interspeech), 2024

Kai Chen

Jiaqi Su

Taylor Berg-Kirkpatrick

Shlomo Dubnov

Zeyu Jin

183

28 Aug 2024

TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and EnhancementInternational Workshop on Acoustic Signal Enhancement (IWAENC), 2024

Kohei Saijo

Gordon Wichern

François G. Germain

Zexu Pan

Jonathan Le Roux

190

06 Aug 2024

Towards a Universal Method for Meaningful Signal Detection

Louis Mahon

219

28 Jul 2024

Papez: Resource-Efficient Speech Separation with Auditory Working Memory

Hyunseok Oh

Juheon Yi

Youngki Lee

308

01 Jul 2024

Song Data Cleansing for End-to-End Neural Singer Diarization Using Neural Analysis and Synthesis Framework

Hokuto Munakata

Ryo Terashima

Yusuke Fujita

235

24 Jun 2024

Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition

Mohammad Soleymanpour

Anurag Chowdhury

Mark C. Fuhs

307

13 Jun 2024

MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection

Ali Behrouz

Michele Santacatterina

Ramin Zabih

525

29 Mar 2024

Dual-path Mamba: Short and Long-term Bidirectional Selective Structured State Space Models for Speech Separation

Xilin Jiang

Cong Han

N. Mesgarani

Mamba

317

27 Mar 2024

CrossNet: Leveraging Global, Cross-Band, Narrow-Band, and Positional Encoding for Single- and Multi-Channel Speaker Separation

Vahid Ahmadi Kalkhorani

DeLiang Wang

266

06 Mar 2024

ConSep: a Noise- and Reverberation-Robust Speech Separation Framework by Magnitude Conditioning

Kuan-Hsun Ho

J. Hung

Berlin Chen

219

04 Mar 2024

TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down FusionInternational Conference on Information and Software Technologies (ICIST), 2023

Samuel Pegg

Kai Li

Xiaolin Hu

291

25 Jan 2024

Boosting Unknown-number Speaker Separation with Transformer Decoder-based AttractorIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Shinji Watanabe

211

23 Jan 2024

Single-Microphone Speaker Separation and Voice Activity Detection in Noisy and Reverberant Environments

Renana Opochinsky

Mordehay Moradi

Sharon Gannot

248

07 Jan 2024

MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation

324

19 Dec 2023

Improving Label Assignments Learning by Dynamic Sample Dropout Combined with Layer-wise Optimization in Speech SeparationInterspeech (Interspeech), 2023

Chenyu Gao

Yue Gu

I. Marsic

330

20 Nov 2023

Multi-channel Conversational Speaker Separation via Neural DiarizationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

H. Taherian

DeLiang Wang

BDL

262

15 Nov 2023

On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic EnvironmentsAutomatic Speech Recognition & Understanding (ASRU), 2023

William Ravenscroft

Stefan Goetze

Thomas Hain

264

09 Oct 2023

SPGM: Prioritizing Local Features for enhanced speech separation performanceIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

...

300

22 Sep 2023

Sampling-Frequency-Independent Universal Sound Separation

Tomohiko Nakamura

Kohei Yatabe

201

22 Sep 2023

Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting TranscriptionSpoken Language Technology Workshop (SLT), 2023

315

15 Sep 2023

Analysis of Speech Separation Performance Degradation on Emotional Speech MixturesAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2023

255

14 Sep 2023

IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech SeparationInternational Conference on Machine Learning (ICML), 2023

342

16 Aug 2023

Complete and separate: Conditional separation with missing target source attribute completionIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2023

Dimitrios Bralios

Efthymios Tzinis

Paris Smaragdis

235

27 Jul 2023

Mixture Encoder for Joint Speech Separation and RecognitionInterspeech (Interspeech), 2023

238

21 Jun 2023

Algorithms of Sampling-Frequency-Independent Layers for Non-integer StridesEuropean Signal Processing Conference (EUSIPCO), 2023

Hiroshi Saruwatari

177

19 Jun 2023

A Teacher-Student approach for extracting informative speaker embeddings from speech mixturesInterspeech (Interspeech), 2023

374

01 Jun 2023

UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training MixturesNeural Information Processing Systems (NeurIPS), 2023

Zhong-Qiu Wang

Shinji Watanabe

298

31 May 2023

An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordingsComputer Speech and Language (CSL), 2023

274

29 May 2023

A Neural State-Space Model Approach to Efficient Speech Separation

Chen Chen

Chao-Han Huck Yang

Kai Li

Yuchen Hu

Pin-Jui Ku

Chng Eng Siong

170

26 May 2023

Towards Solving Cocktail-Party: The First Method to Build a Realistic Dataset with Ground Truths for Speech Separation

Rawad Melhem

Assef Jafar

Oumayma Al Dakkak

234

25 May 2023

Noise-Aware Speech Separation with Contrastive LearningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Yuchen Hu

277

18 May 2023