Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1508.04306
Cited By
Deep clustering: Discriminative embeddings for segmentation and separation
18 August 2015
J. Hershey
Zhuo Chen
Jonathan Le Roux
Shinji Watanabe
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep clustering: Discriminative embeddings for segmentation and separation"
50 / 357 papers shown
Title
TransMask: A Compact and Fast Speech Separation Model Based on Transformer
Zining Zhang
Bingsheng He
Zhenjie Zhang
62
23
0
19 Feb 2021
A Deep Embedded Refined Clustering Approach for Breast Cancer Distinction based on DNA Methylation
Rocío del Amor
Adrián Colomer
C. Monteagudo
Valery Naranjo
17
24
0
18 Feb 2021
Speaker and Direction Inferred Dual-channel Speech Separation
Chenxing Li
Jiaming Xu
N. Mesgarani
Bo Xu
34
8
0
08 Feb 2021
Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism
Jisi Zhang
Catalin Zorila
R. Doddipatla
Jon Barker
44
13
0
07 Feb 2021
A Review of Speaker Diarization: Recent Advances with Deep Learning
Tae Jin Park
Naoyuki Kanda
Dimitrios Dimitriadis
Kyu Jeong Han
Shinji Watanabe
Shrikanth Narayanan
VLM
382
337
0
24 Jan 2021
A Joint Diagonalization Based Efficient Approach to Underdetermined Blind Audio Source Separation Using the Multichannel Wiener Filter
N. Ito
Rintaro Ikeshita
H. Sawada
Tomohiro Nakatani
25
26
0
21 Jan 2021
Effective Low-Cost Time-Domain Audio Separation Using Globally Attentive Locally Recurrent Networks
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
89
29
0
13 Jan 2021
VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency
Ruohan Gao
Kristen Grauman
CVBM
243
202
0
08 Jan 2021
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
Shinji Watanabe
Florian Boyer
Xuankai Chang
Pengcheng Guo
Tomoki Hayashi
...
Shigeki Karita
Chenda Li
Jing Shi
Aswin Shanmugam Subramanian
Wangyou Zhang
VLM
105
38
0
23 Dec 2020
Continuous Speech Separation Using Speaker Inventory for Long Multi-talker Recording
Cong Han
Yi Luo
Chenda Li
Tianyan Zhou
K. Kinoshita
...
Marc Delcroix
Hakan Erdogan
J. Hershey
N. Mesgarani
Zhuo Chen
58
8
0
17 Dec 2020
Deep Ad-hoc Beamforming Based on Speaker Extraction for Target-Dependent Speech Separation
Ziye Yang
Shanzheng Guan
Xiao-Lei Zhang
32
14
0
01 Dec 2020
Convolutive Transfer Function Invariant SDR training criteria for Multi-Channel Reverberant Speech Separation
Christoph Boeddeker
Wangyou Zhang
Tomohiro Nakatani
K. Kinoshita
Tsubasa Ochiai
Marc Delcroix
Naoyuki Kamo
Y. Qian
Reinhold Haeb-Umbach
87
30
0
30 Nov 2020
A comparison of handcrafted, parameterized, and learnable features for speech separation
Wenbo Zhu
Mou Wang
Xiao-Lei Zhang
S. Rahardja
38
4
0
29 Nov 2020
Streaming end-to-end multi-talker speech recognition
Liang Lu
Naoyuki Kanda
Jinyu Li
Jiawei Liu
72
44
0
26 Nov 2020
Streaming Multi-speaker ASR with RNN-T
Ilya Sklyar
A. Piunova
Yulan Liu
80
37
0
23 Nov 2020
Rethinking the Separation Layers in Speech Separation Networks
Yi Luo
Zhuo Chen
Cong Han
Chenda Li
Tianyan Zhou
N. Mesgarani
36
10
0
17 Nov 2020
On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments
Jisi Zhang
Catalin Zorila
R. Doddipatla
Jon Barker
76
46
0
11 Nov 2020
Surrogate Source Model Learning for Determined Source Separation
Robin Scheibler
M. Togami
82
22
0
11 Nov 2020
ESPnet-se: end-to-end speech enhancement and separation toolkit designed for asr integration
Chenda Li
Jing Shi
Wangyou Zhang
Aswin Shanmugam Subramanian
Xuankai Chang
...
Moto Hira
Tomoki Hayashi
Christoph Boeddeker
Zhuo Chen
Shinji Watanabe
VLM
95
82
0
07 Nov 2020
Single channel voice separation for unknown number of speakers under reverberant and noisy settings
Shlomo E. Chazan
Lior Wolf
Eliya Nachmani
Yossi Adi
68
29
0
04 Nov 2020
DESNet: A Multi-channel Network for Simultaneous Speech Dereverberation, Enhancement and Separation
Yihui Fu
Jian Wu
Yanxin Hu
Mengtao Xing
Lei Xie
72
24
0
04 Nov 2020
Stabilizing Label Assignment for Speech Separation by Self-supervised Pre-training
Sung-Feng Huang
Shun-Po Chuang
Da-Rong Liu
Yi-Chen Chen
Gene-Ping Yang
Hung-yi Lee
SSL
92
14
0
29 Oct 2020
Attention is All You Need in Speech Separation
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Mirko Bronzi
Jianyuan Zhong
101
567
0
25 Oct 2020
Speakerfilter-Pro: an improved target speaker extractor combines the time domain and frequency domain
Shulin He
Hao Li
Xueliang Zhang
23
3
0
25 Oct 2020
Training Noisy Single-Channel Speech Separation With Noisy Oracle Sources: A Large Gap and A Small Step
Matthew Maciejewski
Jing Shi
Shinji Watanabe
Sanjeev Khudanpur
61
11
0
23 Oct 2020
Speaker Separation Using Speaker Inventories and Estimated Speech
Peidong Wang
Zhuo Chen
DeLiang Wang
Jinyu Li
Jiawei Liu
93
11
0
20 Oct 2020
Attention-based scaling adaptation for target speech extraction
Jiangyu Han
Wei Rao
Yanhua Long
Jiaen Liang
62
9
0
19 Oct 2020
Muse: Multi-modal target speaker extraction with visual cues
Zexu Pan
Ruijie Tao
Chenglin Xu
Haizhou Li
51
50
0
15 Oct 2020
The Cone of Silence: Speech Separation by Localization
Teerapat Jenrungrot
V. Jayaram
S. M. Seitz
Ira Kemelmacher-Shlizerman
76
56
0
12 Oct 2020
Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation
Zhong-Qiu Wang
Peidong Wang
DeLiang Wang
58
90
0
04 Oct 2020
X-DC: Explainable Deep Clustering based on Learnable Spectrogram Templates
C. Watanabe
Hirokazu Kameoka
34
0
0
18 Sep 2020
VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Quan Wang
Ignacio López Moreno
Mert Saglam
K. Wilson
Alan Chiao
...
Yanzhang He
Wei Li
Jason W. Pelecanos
M. Nika
A. Gruenstein
VLM
71
86
0
09 Sep 2020
An End-to-end Architecture of Online Multi-channel Speech Separation
Jian Wu
Zhuo Chen
Jinyu Li
Takuya Yoshioka
Zhili Tan
Ed Lin
Yi Luo
Lei Xie
3DV
38
21
0
07 Sep 2020
SAGRNN: Self-Attentive Gated RNN for Binaural Speaker Separation with Interaural Cue Preservation
Ke Tan
Buye Xu
Anurag Kumar
Eliya Nachmani
Yossi Adi
59
29
0
02 Sep 2020
Variational Autoencoder for Anti-Cancer Drug Response Prediction
Hongyuan Dong
Jiaqing Xie
Zhi Jing
Dexin Ren
DRL
43
14
0
22 Aug 2020
Continuous Speech Separation with Conformer
Sanyuan Chen
Yu-Huan Wu
Zhuo Chen
Jian Wu
Jinyu Li
Takuya Yoshioka
Chengyi Wang
Shujie Liu
M. Zhou
76
130
0
13 Aug 2020
Speech Separation Based on Multi-Stage Elaborated Dual-Path Deep BiLSTM with Auxiliary Identity Loss
Ziqiang Shi
Rujie Liu
Jiqing Han
38
7
0
06 Aug 2020
MIRNet: Learning multiple identities representations in overlapped speech
Hyewon Han
Soo-Whan Chung
Hong-Goo Kang
69
8
0
04 Aug 2020
Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation
Jing-jing Chen
Qi-rong Mao
Dong Liu
120
289
0
28 Jul 2020
Dereverberation using joint estimation of dry speech signal and acoustic system
Sanna Wager
Keunwoo Choi
Simon Durand
96
3
0
24 Jul 2020
Sudo rm -rf: Efficient Networks for Universal Audio Source Separation
Efthymios Tzinis
Zhepei Wang
Paris Smaragdis
96
130
0
14 Jul 2020
Do We Need Sound for Sound Source Localization?
Takashi Oya
Shohei Iwase
Ryota Natsume
Takahiro Itazuri
Shugo Yamaguchi
Shigeo Morishima
41
22
0
11 Jul 2020
Progressive Tandem Learning for Pattern Recognition with Deep Spiking Neural Networks
Jibin Wu
Chenglin Xu
Daquan Zhou
Haizhou Li
Kay Chen Tan
67
117
0
02 Jul 2020
Exploring the time-domain deep attractor network with two-stream architectures in a reverberant environment
Hangting Chen
Pengyuan Zhang
32
6
0
01 Jul 2020
Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals
Jing Shi
Xuankai Chang
Pengcheng Guo
Shinji Watanabe
Yusuke Fujita
Jiaming Xu
Bo Xu
Lei Xie
88
22
0
25 Jun 2020
Speaker-Conditional Chain Model for Speech Separation and Extraction
Jing Shi
Jiaming Xu
Yusuke Fujita
Shinji Watanabe
Bo Xu
BDL
70
21
0
25 Jun 2020
Unsupervised Sound Separation Using Mixture Invariant Training
Scott Wisdom
Efthymios Tzinis
Hakan Erdogan
Ron J. Weiss
K. Wilson
J. Hershey
70
27
0
23 Jun 2020
Visually Guided Sound Source Separation using Cascaded Opponent Filter Network
Lingyu Zhu
Esa Rahtu
103
23
0
04 Jun 2020
Neural Speaker Diarization with Speaker-Wise Chain Rule
Yusuke Fujita
Shinji Watanabe
Shota Horiguchi
Yawen Xue
Jing Shi
Kenji Nagamatsu
100
45
0
02 Jun 2020
Efficient Integration of Multi-channel Information for Speaker-independent Speech Separation
Yuichiro Koyama
Oluwafemi Azeez
Bhiksha Raj
44
4
0
23 May 2020
Previous
1
2
3
4
5
6
7
8
Next