Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1508.04306
Cited By
Deep clustering: Discriminative embeddings for segmentation and separation
18 August 2015
J. Hershey
Zhuo Chen
Jonathan Le Roux
Shinji Watanabe
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep clustering: Discriminative embeddings for segmentation and separation"
50 / 357 papers shown
Title
Learning to Separate Voices by Spatial Regions
Alan Xu
Romit Roy Choudhury
106
10
0
09 Jul 2022
Multi-Modal Multi-Correlation Learning for Audio-Visual Speech Separation
Xiaoyu Wang
Xiangyu Kong
Xiulian Peng
Yan Lu
62
6
0
04 Jul 2022
An Empirical Analysis on the Vulnerabilities of End-to-End Speech Segregation Models
Rahil Parikh
G. Rochette
C. Espy-Wilson
S. Shamma
UQCV
35
0
0
20 Jun 2022
Resource-Efficient Separation Transformer
Luca Della Libera
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Frédéric Lepoutre
François Grondin
VLM
99
18
0
19 Jun 2022
Semi-supervised Time Domain Target Speaker Extraction with Attention
Zhepei Wang
Ritwik Giri
Shrikant Venkataramani
Umut Isik
J. Valin
Paris Smaragdis
Mike Goodwin
A. Krishnaswamy
59
7
0
18 Jun 2022
On the Use of Deep Mask Estimation Module for Neural Source Separation Systems
Kai Li
Xiaolin Hu
Yi Luo
72
16
0
15 Jun 2022
Online Neural Diarization of Unlimited Numbers of Speakers Using Global and Local Attractors
Shota Horiguchi
Shinji Watanabe
Leibny Paola García-Perera
Yuki Takashima
Yohei Kawaguchi
98
24
0
06 Jun 2022
SepIt: Approaching a Single Channel Speech Separation Bound
Shahar Lutati
Eliya Nachmani
Lior Wolf
VLM
131
27
0
24 May 2022
Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition
Catalin Zorila
R. Doddipatla
38
11
0
09 May 2022
Improving the Naturalness of Simulated Conversations for End-to-End Neural Diarization
Natsuo Yamashita
Shota Horiguchi
Takeshi Homma
74
18
0
24 Apr 2022
Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation
Jiangyu Han
Yanhua Long
65
6
0
23 Apr 2022
Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction
Zifeng Zhao
Rongzhi Gu
Dongchao Yang
Jinchuan Tian
Yuexian Zou
59
2
0
15 Apr 2022
RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation System
M. Z. Ozturk
Chenshu Wu
Beibei Wang
Min Wu
K. Liu
62
21
0
14 Apr 2022
Small Footprint Multi-channel ConvMixer for Keyword Spotting with Centroid Based Awareness
Dianwen Ng
Jing Pang
Yanghua Xiao
Biao Tian
Qiang Fu
Eng Siong Chng
50
2
0
11 Apr 2022
Listen only to me! How well can target speech extraction handle false alarms?
Marc Delcroix
K. Kinoshita
Tsubasa Ochiai
Kateřina Žmolíková
Hiroshi Sato
Tomohiro Nakatani
71
15
0
11 Apr 2022
Multichannel Speech Separation with Narrow-band Conformer
Changsheng Quan
Xiaofei Li
69
13
0
09 Apr 2022
SoundBeam: Target sound extraction conditioned on sound-class labels and enrollment clues for increased performance and continuous learning
Marc Delcroix
Jorge Bennasar Vázquez
Tsubasa Ochiai
K. Kinoshita
Yasunori Ohishi
S. Araki
VLM
83
34
0
08 Apr 2022
Heterogeneous Target Speech Separation
Hyunjae Cho
Wonbin Jung
Junhyeok Lee
Paris Smaragdis
Sanghyun Woo
92
26
0
07 Apr 2022
End-to-end multi-talker audio-visual ASR using an active speaker attention module
R. Rose
Olivier Siohan
58
3
0
01 Apr 2022
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction
Zexu Pan
Meng Ge
Haizhou Li
72
20
0
31 Mar 2022
Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers
Zhenhao Jin
Xiang Hao
Xiangdong Su
55
4
0
30 Mar 2022
Learning a Structured Latent Space for Unsupervised Point Cloud Completion
Yingjie Cai
Kwan-Yee Lin
Chao Zhang
Qiang Wang
Xiaogang Wang
Hongsheng Li
3DPC
SSL
95
37
0
29 Mar 2022
Remix-cycle-consistent Learning on Adversarially Learned Separator for Accurate and Stable Unsupervised Speech Separation
Kohei Saijo
Tetsuji Ogawa
46
9
0
26 Mar 2022
Embedding Recurrent Layers with Dual-Path Strategy in a Variant of Convolutional Network for Speaker-Independent Speech Separation
Xue Yang
C. Bao
51
3
0
25 Mar 2022
Harmonicity Plays a Critical Role in DNN Based Versus in Biologically-Inspired Monaural Speech Segregation Systems
Rahil Parikh
Ilya Kavalerov
C. Espy-Wilson
Shihab Shamma Institute for Systems Research
30
3
0
08 Mar 2022
L-SpEx: Localized Target Speaker Extraction
Meng Ge
Chenglin Xu
Longbiao Wang
Eng Siong Chng
Jianwu Dang
Haizhou Li
52
24
0
21 Feb 2022
Multi-Channel Speech Denoising for Machine Ears
Cong Han
Emine Merve Kaya
Kyle Hoefer
M. Slaney
S. Carlile
27
2
0
17 Feb 2022
Time-Frequency Mask Aware Bi-directional LSTM: A Deep Learning Approach for Underwater Acoustic Signal Separation
Jier Chen
Chang Liu
Jiawu Xie
Jie An
Nan Huang
16
10
0
09 Feb 2022
Exploring Self-Attention Mechanisms for Speech Separation
Cem Subakan
Mirco Ravanelli
Samuele Cornell
François Grondin
Mirko Bronzi
66
23
0
06 Feb 2022
SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech Separation
Chenda Li
Lei Yang
Weiqin Wang
Y. Qian
83
27
0
26 Jan 2022
Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition
Hiroshi Sato
Tsubasa Ochiai
Marc Delcroix
K. Kinoshita
Naoyuki Kamo
Takafumi Moriya
70
27
0
11 Jan 2022
Signal-Aware Direction-of-Arrival Estimation Using Attention Mechanisms
Wolfgang Mack
Julian Wechsler
Emanuel Habets
118
11
0
03 Jan 2022
Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem
Jing Shi
Xuankai Chang
Tomoki Hayashi
Yen-Ju Lu
Shinji Watanabe
Bo Xu
105
19
0
17 Dec 2021
Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech
Rohit Paturi
S. Srinivasan
Katrin Kirchhoff
Daniel Garcia-Romero
67
9
0
10 Dec 2021
Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features
Yicheng Hsu
Yonghan Lee
M. Bai
52
10
0
10 Dec 2021
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network
Xiaolin Hu
Kai Li
Weiyi Zhang
Yi Luo
Jean-Marie Lemercier
Timo Gerkmann
91
51
0
04 Dec 2021
Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature
Yiwen Shao
Shi-Xiong Zhang
Dong Yu
88
15
0
22 Nov 2021
Single-channel speech separation using Soft-minimum Permutation Invariant Training
Midia Yousefi
John H. L. Hansen
27
3
0
16 Nov 2021
Monaural source separation: From anechoic to reverberant environments
Tobias Cord-Landwehr
Christoph Boeddeker
Thilo von Neumann
Catalin Zorila
R. Doddipatla
Reinhold Haeb-Umbach
55
31
0
15 Nov 2021
Reduction of Subjective Listening Effort for TV Broadcast Signals with Recurrent Neural Networks
Nils L. Westhausen
R. Huber
Hannah Baumgartner
Ragini Sinha
J. Rennies
B. Meyer
56
10
0
02 Nov 2021
Recent Advances in End-to-End Automatic Speech Recognition
Jinyu Li
VLM
160
377
0
02 Nov 2021
Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model
M. Kocour
Kateřina Žmolíková
Lucas Ondel
J. Svec
Marc Delcroix
Tsubasa Ochiai
L. Burget
J. Černocký
29
1
0
31 Oct 2021
SA-SDR: A novel loss function for separation of meeting style data
Thilo von Neumann
K. Kinoshita
Christoph Boeddeker
Marc Delcroix
Reinhold Haeb-Umbach
74
21
0
29 Oct 2021
Continuous Speech Separation with Recurrent Selective Attention Network
Yixuan Zhang
Zhuo Chen
Jian Wu
Takuya Yoshioka
Peidong Wang
Zhong Meng
Jinyu Li
BDL
72
8
0
28 Oct 2021
Separating Long-Form Speech with Group-Wise Permutation Invariant Training
Wangyou Zhang
Zhuo Chen
Naoyuki Kanda
Shujie Liu
Jinyu Li
...
Takuya Yoshioka
Xiong Xiao
Zhong Meng
Y. Qian
Furu Wei
VLM
64
6
0
27 Oct 2021
REAL-M: Towards Speech Separation on Real Mixtures
Cem Subakan
Mirco Ravanelli
Samuele Cornell
François Grondin
59
18
0
20 Oct 2021
Adapting Speech Separation to Real-World Meetings Using Mixture Invariant Training
Aswin Sivaraman
Scott Wisdom
Hakan Erdogan
J. Hershey
49
22
0
20 Oct 2021
The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks
Darius Petermann
Gordon Wichern
Zhong-Qiu Wang
Jonathan Le Roux
62
38
0
19 Oct 2021
Personalized Speech Enhancement: New Models and Comprehensive Evaluation
Sefik Emre Eskimez
Takuya Yoshioka
Huaming Wang
Xiaofei Wang
Zhuo Chen
Xuedong Huang
87
62
0
18 Oct 2021
M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge
Fan Yu
Shiliang Zhang
Yihui Fu
Lei Xie
Siqi Zheng
...
Pengcheng Guo
Zhijie Yan
B. Ma
Xin Xu
Hui Bu
73
119
0
14 Oct 2021
Previous
1
2
3
4
5
6
7
8
Next