ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.08933
  4. Cited By
Wavesplit: End-to-End Speech Separation by Speaker Clustering

Wavesplit: End-to-End Speech Separation by Speaker Clustering

20 February 2020
Neil Zeghidour
David Grangier
    VLM
ArXivPDFHTML

Papers citing "Wavesplit: End-to-End Speech Separation by Speaker Clustering"

26 / 26 papers shown
Title
Listen to Extract: Onset-Prompted Target Speaker Extraction
Listen to Extract: Onset-Prompted Target Speaker Extraction
Pengjie Shen
Kangrui Chen
Shulin He
Pengru Chen
Shuqi Yuan
He Kong
Xueliang Zhang
Z. Wang
46
0
0
08 May 2025
Beyond Speaker Identity: Text Guided Target Speech Extraction
Beyond Speaker Identity: Text Guided Target Speech Extraction
Mingyue Huo
Abhinav Jain
Cong Phuoc Huynh
Fanjie Kong
Pichao Wang
Zhu Liu
Vimal Bhat
42
0
0
17 Jan 2025
ConSep: a Noise- and Reverberation-Robust Speech Separation Framework by
  Magnitude Conditioning
ConSep: a Noise- and Reverberation-Robust Speech Separation Framework by Magnitude Conditioning
Kuan-Hsun Ho
J. Hung
Berlin Chen
34
0
0
04 Mar 2024
TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down
  Fusion
TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion
Samuel Pegg
Kai Li
Xiaolin Hu
24
1
0
25 Jan 2024
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for
  Enhanced Time-Domain Monaural Speech Separation
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation
Shengkui Zhao
Yukun Ma
Chongjia Ni
Chong Zhang
Hao Wang
Trung Hieu Nguyen
Kun Zhou
J. Yip
Dianwen Ng
Bin Ma
8
21
0
19 Dec 2023
On Time Domain Conformer Models for Monaural Speech Separation in Noisy
  Reverberant Acoustic Environments
On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments
William Ravenscroft
Stefan Goetze
Thomas Hain
25
7
0
09 Oct 2023
Complete and separate: Conditional separation with missing target source
  attribute completion
Complete and separate: Conditional separation with missing target source attribute completion
Dimitrios Bralios
Efthymios Tzinis
Paris Smaragdis
27
0
0
27 Jul 2023
Mixture Encoder for Joint Speech Separation and Recognition
Mixture Encoder for Joint Speech Separation and Recognition
Simon Berger
Peter Vieting
Christoph Boeddeker
Ralf Schluter
Reinhold Häb-Umbach
11
6
0
21 Jun 2023
An Experimental Review of Speaker Diarization methods with application
  to Two-Speaker Conversational Telephone Speech recordings
An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings
L. Serafini
Samuele Cornell
Giovanni Morrone
Enrico Zovato
A. Brutti
S. Squartini
27
9
0
29 May 2023
Multi-Dimensional and Multi-Scale Modeling for Speech Separation
  Optimized by Discriminative Learning
Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning
Zhaoxi Mu
Xinyu Yang
Wenjing Zhu
14
5
0
07 Mar 2023
Improving Target Speaker Extraction with Sparse LDA-transformed Speaker
  Embeddings
Improving Target Speaker Extraction with Sparse LDA-transformed Speaker Embeddings
Kai Liu
Xucheng Wan
Z.C. Du
Huan Zhou
VLM
20
1
0
16 Jan 2023
Deep neural network techniques for monaural speech enhancement: state of
  the art analysis
Deep neural network techniques for monaural speech enhancement: state of the art analysis
P. Ochieng
28
21
0
01 Dec 2022
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech
  Separation
TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation
Zhongqiu Wang
Samuele Cornell
Shukjae Choi
Younglo Lee
Byeonghak Kim
Shinji Watanabe
15
116
0
22 Nov 2022
Diffusion-based Generative Speech Source Separation
Diffusion-based Generative Speech Source Separation
Robin Scheibler
Youna Ji
Soo-Whan Chung
J. Byun
Soyeon Choe
Min-Seok Choi
DiffM
14
38
0
31 Oct 2022
Deformable Temporal Convolutional Networks for Monaural Noisy
  Reverberant Speech Separation
Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation
William Ravenscroft
Stefan Goetze
Thomas Hain
25
11
0
27 Oct 2022
SepIt: Approaching a Single Channel Speech Separation Bound
SepIt: Approaching a Single Channel Speech Separation Bound
Shahar Lutati
Eliya Nachmani
Lior Wolf
VLM
40
27
0
24 May 2022
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain
  Target Speaker Extraction
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction
Zexu Pan
Meng Ge
Haizhou Li
11
17
0
31 Mar 2022
Speaker Extraction with Co-Speech Gestures Cue
Speaker Extraction with Co-Speech Gestures Cue
Zexu Pan
Xinyuan Qian
Haizhou Li
SLR
16
26
0
31 Mar 2022
MixCycle: Unsupervised Speech Separation via Cyclic Mixture Permutation
  Invariant Training
MixCycle: Unsupervised Speech Separation via Cyclic Mixture Permutation Invariant Training
Ertuğ Karamatlı
S. Kırbız
SSL
15
9
0
08 Feb 2022
A Time-domain Real-valued Generalized Wiener Filter for Multi-channel
  Neural Separation Systems
A Time-domain Real-valued Generalized Wiener Filter for Multi-channel Neural Separation Systems
Yi Luo
16
14
0
07 Dec 2021
Progressive Learning for Stabilizing Label Selection in Speech
  Separation with Mapping-based Method
Progressive Learning for Stabilizing Label Selection in Speech Separation with Mapping-based Method
Chenyang Gao
Yue Gu
I. Marsic
16
0
0
20 Oct 2021
Encoder-Decoder Based Attractors for End-to-End Neural Diarization
Encoder-Decoder Based Attractors for End-to-End Neural Diarization
Shota Horiguchi
Yusuke Fujita
Shinji Watanabe
Yawen Xue
Leibny Paola García-Perera
22
63
0
20 Jun 2021
End-to-End Dereverberation, Beamforming, and Speech Recognition with
  Improved Numerical Stability and Advanced Frontend
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend
Wangyou Zhang
Christoph Boeddeker
Shinji Watanabe
Tomohiro Nakatani
Marc Delcroix
K. Kinoshita
Tsubasa Ochiai
Naoyuki Kamo
Reinhold Haeb-Umbach
Y. Qian
6
32
0
23 Feb 2021
Sudo rm -rf: Efficient Networks for Universal Audio Source Separation
Sudo rm -rf: Efficient Networks for Universal Audio Source Separation
Efthymios Tzinis
Zhepei Wang
Paris Smaragdis
25
127
0
14 Jul 2020
Voice Separation with an Unknown Number of Multiple Speakers
Voice Separation with an Unknown Number of Multiple Speakers
Eliya Nachmani
Yossi Adi
Lior Wolf
11
175
0
29 Feb 2020
End-to-End Speech Separation with Unfolded Iterative Phase
  Reconstruction
End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction
Zhong-Qiu Wang
Jonathan Le Roux
DeLiang Wang
J. Hershey
88
123
0
26 Apr 2018
1