Wavesplit: End-to-End Speech Separation by Speaker Clustering

20 February 2020

Papers citing "Wavesplit: End-to-End Speech Separation by Speaker Clustering"

26 / 26 papers shown

Title
Listen to Extract: Onset-Prompted Target Speaker Extraction Pengjie Shen Kangrui Chen Shulin He Pengru Chen Shuqi Yuan He Kong Xueliang Zhang Z. Wang 46 0 0 08 May 2025
Beyond Speaker Identity: Text Guided Target Speech Extraction Mingyue Huo Abhinav Jain Cong Phuoc Huynh Fanjie Kong Pichao Wang Zhu Liu Vimal Bhat 42 0 0 17 Jan 2025
ConSep: a Noise- and Reverberation-Robust Speech Separation Framework by Magnitude Conditioning Kuan-Hsun Ho J. Hung Berlin Chen 34 0 0 04 Mar 2024
TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion Samuel Pegg Kai Li Xiaolin Hu 24 1 0 25 Jan 2024
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation Shengkui Zhao Yukun Ma Chongjia Ni Chong Zhang Hao Wang Trung Hieu Nguyen Kun Zhou J. Yip Dianwen Ng Bin Ma 8 21 0 19 Dec 2023
On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments William Ravenscroft Stefan Goetze Thomas Hain 25 7 0 09 Oct 2023
Complete and separate: Conditional separation with missing target source attribute completion Dimitrios Bralios Efthymios Tzinis Paris Smaragdis 27 0 0 27 Jul 2023
Mixture Encoder for Joint Speech Separation and Recognition Simon Berger Peter Vieting Christoph Boeddeker Ralf Schluter Reinhold Häb-Umbach 11 6 0 21 Jun 2023
An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings L. Serafini Samuele Cornell Giovanni Morrone Enrico Zovato A. Brutti S. Squartini 27 9 0 29 May 2023
Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning Zhaoxi Mu Xinyu Yang Wenjing Zhu 14 5 0 07 Mar 2023
Improving Target Speaker Extraction with Sparse LDA-transformed Speaker Embeddings Kai Liu Xucheng Wan Z.C. Du Huan Zhou VLM 20 1 0 16 Jan 2023
Deep neural network techniques for monaural speech enhancement: state of the art analysis P. Ochieng 28 21 0 01 Dec 2022
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation Zhongqiu Wang Samuele Cornell Shukjae Choi Younglo Lee Byeonghak Kim Shinji Watanabe 15 116 0 22 Nov 2022
Diffusion-based Generative Speech Source Separation Robin Scheibler Youna Ji Soo-Whan Chung J. Byun Soyeon Choe Min-Seok Choi DiffM 14 38 0 31 Oct 2022
Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation William Ravenscroft Stefan Goetze Thomas Hain 25 11 0 27 Oct 2022
SepIt: Approaching a Single Channel Speech Separation Bound Shahar Lutati Eliya Nachmani Lior Wolf VLM 40 27 0 24 May 2022
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction Zexu Pan Meng Ge Haizhou Li 11 17 0 31 Mar 2022
Speaker Extraction with Co-Speech Gestures Cue Zexu Pan Xinyuan Qian Haizhou Li SLR 16 26 0 31 Mar 2022
MixCycle: Unsupervised Speech Separation via Cyclic Mixture Permutation Invariant Training Ertuğ Karamatlı S. Kırbız SSL 15 9 0 08 Feb 2022
A Time-domain Real-valued Generalized Wiener Filter for Multi-channel Neural Separation Systems Yi Luo 16 14 0 07 Dec 2021
Progressive Learning for Stabilizing Label Selection in Speech Separation with Mapping-based Method Chenyang Gao Yue Gu I. Marsic 16 0 0 20 Oct 2021
Encoder-Decoder Based Attractors for End-to-End Neural Diarization Shota Horiguchi Yusuke Fujita Shinji Watanabe Yawen Xue Leibny Paola García-Perera 22 63 0 20 Jun 2021
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend Wangyou Zhang Christoph Boeddeker Shinji Watanabe Tomohiro Nakatani Marc Delcroix K. Kinoshita Tsubasa Ochiai Naoyuki Kamo Reinhold Haeb-Umbach Y. Qian 6 32 0 23 Feb 2021
Sudo rm -rf: Efficient Networks for Universal Audio Source Separation Efthymios Tzinis Zhepei Wang Paris Smaragdis 25 127 0 14 Jul 2020
Voice Separation with an Unknown Number of Multiple Speakers Eliya Nachmani Yossi Adi Lior Wolf 11 175 0 29 Feb 2020
End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction Zhong-Qiu Wang Jonathan Le Roux DeLiang Wang J. Hershey 88 123 0 26 Apr 2018