Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1508.04306
Cited By
Deep clustering: Discriminative embeddings for segmentation and separation
18 August 2015
J. Hershey
Zhuo Chen
Jonathan Le Roux
Shinji Watanabe
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep clustering: Discriminative embeddings for segmentation and separation"
50 / 357 papers shown
Title
SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition
Lukas Drude
Jens Heitkaemper
Christoph Boeddeker
Reinhold Haeb-Umbach
66
72
0
30 Oct 2019
Mixup-breakdown: a consistency training method for improving generalization of speech separation models
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
90
23
0
28 Oct 2019
A Multi-Phase Gammatone Filterbank for Speech Separation via TasNet
David Ditter
Timo Gerkmann
63
58
0
25 Oct 2019
Filterbank design for end-to-end speech separation
Manuel Pariente
Samuele Cornell
Antoine Deleforge
Emmanuel Vincent
106
69
0
23 Oct 2019
WHAMR!: Noisy and Reverberant Single-Channel Speech Separation
Matthew Maciejewski
Gordon Wichern
E. McQuinn
Jonathan Le Roux
89
184
0
22 Oct 2019
Two-Step Sound Source Separation: Training on Learned Latent Targets
Efthymios Tzinis
Shrikant Venkataramani
Zhepei Wang
Y. C. Sübakan
Paris Smaragdis
73
65
0
22 Oct 2019
Simultaneous Separation and Transcription of Mixtures with Multiple Polyphonic and Percussive Instruments
Ethan Manilow
Prem Seetharaman
Bryan Pardo
72
42
0
22 Oct 2019
Discriminative Neural Clustering for Speaker Diarisation
Qiujia Li
Florian Kreyssig
Chao Zhang
P. Woodland
61
46
0
22 Oct 2019
Unsupervised Multi-Task Feature Learning on Point Clouds
Kaveh Hassani
Mike Haley
SSL
3DPC
170
195
0
18 Oct 2019
MIMO-SPEECH: End-to-End Multi-Channel Multi-Speaker Speech Recognition
Xuankai Chang
Wangyou Zhang
Y. Qian
Jonathan Le Roux
Shinji Watanabe
95
121
0
15 Oct 2019
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation
Yi Luo
Zhuo Chen
Takuya Yoshioka
AI4TS
127
776
0
14 Oct 2019
CochleaNet: A Robust Language-independent Audio-Visual Model for Speech Enhancement
M. Gogate
K. Dashtipour
Ahsan Adeel
Amir Hussain
50
53
0
23 Sep 2019
Cutting Music Source Separation Some Slakh: A Dataset to Study the Impact of Training Data Quality and Quantity
Ethan Manilow
Gordon Wichern
Prem Seetharaman
Jonathan Le Roux
77
127
0
18 Sep 2019
Simultaneous Speech Recognition and Speaker Diarization for Monaural Dialogue Recordings with Target-Speaker Acoustic Models
Naoyuki Kanda
Shota Horiguchi
Yusuke Fujita
Yawen Xue
Kenji Nagamatsu
Shinji Watanabe
53
36
0
17 Sep 2019
End-to-End Neural Speaker Diarization with Self-attention
Yusuke Fujita
Naoyuki Kanda
Shota Horiguchi
Yawen Xue
Kenji Nagamatsu
Shinji Watanabe
240
243
0
13 Sep 2019
End-to-End Neural Speaker Diarization with Permutation-Free Objectives
Yusuke Fujita
Naoyuki Kanda
Shota Horiguchi
Kenji Nagamatsu
Shinji Watanabe
200
255
0
12 Sep 2019
Deep Metric Learning with Density Adaptivity
Yehao Li
Ting Yao
Yingwei Pan
Hongyang Chao
Tao Mei
144
11
0
09 Sep 2019
Deep Bayesian Unsupervised Source Separation Based on a Complex Gaussian Mixture Model
Yoshiaki Bando
Y. Sasaki
Kazuyoshi Yoshii
BDL
52
9
0
29 Aug 2019
Nearest Neighbor Search-Based Bitwise Source Separation Using Discriminant Winner-Take-All Hashing
Sunwoo Kim
Minje Kim
18
0
0
26 Aug 2019
Audio query-based music source separation
Jie Hwan Lee
Hyeong-Seok Choi
Kyogu Lee
71
45
0
19 Aug 2019
Probabilistic Permutation Invariant Training for Speech Separation
Midia Yousefi
S. Khorram
John H. L. Hansen
47
23
0
04 Aug 2019
Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features
Cunhang Fan
B. Liu
J. Tao
Jiangyan Yi
Zhengqi Wen
51
22
0
23 Jul 2019
DNN-based Speaker Embedding Using Subjective Inter-speaker Similarity for Multi-speaker Modeling in Speech Synthesis
Yuki Saito
Shinnosuke Takamichi
Hiroshi Saruwatari
32
10
0
19 Jul 2019
Multichannel Loss Function for Supervised Speech Source Separation by Mask-based Beamforming
Yoshiki Masuyama
M. Togami
Tatsuya Komatsu
38
8
0
11 Jul 2019
My lips are concealed: Audio-visual speech enhancement through obstructions
Triantafyllos Afouras
Joon Son Chung
Andrew Zisserman
65
91
0
11 Jul 2019
Improving Reverberant Speech Training Using Diffuse Acoustic Simulation
Zhenyu Tang
Lianwu Chen
Bo Wu
Dong Yu
Tianyi Zhou
AI4CE
75
35
0
09 Jul 2019
WHAM!: Extending Speech Separation to Noisy Environments
Gordon Wichern
J. Antognini
Michael Flynn
Licheng Richard Zhu
E. McQuinn
Dwight Crow
Ethan Manilow
Jonathan Le Roux
86
355
0
02 Jul 2019
Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition
Naoyuki Kanda
Shota Horiguchi
R. Takashima
Yusuke Fujita
Kenji Nagamatsu
Shinji Watanabe
63
34
0
26 Jun 2019
Single-Channel Speech Separation with Auxiliary Speaker Embeddings
Shuo Liu
Gil Keren
Björn Schuller
33
3
0
24 Jun 2019
From Clustering to Cluster Explanations via Neural Networks
Jacob R. Kauffmann
Malte Esders
Lukas Ruff
G. Montavon
Wojciech Samek
K. Müller
79
72
0
18 Jun 2019
Divide and Conquer the Embedding Space for Metric Learning
A. Sanakoyeu
Vadim Tschernezki
U. Büchler
Bjorn Ommer
SSL
86
107
0
14 Jun 2019
A comprehensive study of speech separation: spectrogram vs waveform separation
F. Bahmaninezhad
Jian Wu
Rongzhi Gu
Shi-Xiong Zhang
Yong-mei Xu
Meng Yu
Dong Yu
81
81
0
17 May 2019
End-to-End Multi-Channel Speech Separation
Rongzhi Gu
Jian Wu
Shi-Xiong Zhang
Lianwu Chen
Yong-mei Xu
Meng Yu
Dan Su
Yuexian Zou
Dong Yu
56
77
0
15 May 2019
Machine learning in acoustics: theory and applications
Michael J. Bianco
Peter Gerstoft
James Traer
Emma Ozanich
M. Roch
Sharon Gannot
Charles-Alban Deledalle
AI4CE
89
391
0
11 May 2019
Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
T. Menne
Ilya Sklyar
Ralf Schluter
Hermann Ney
158
35
0
09 May 2019
Universal Sound Separation
Ilya Kavalerov
Scott Wisdom
Hakan Erdogan
Brian Patton
K. Wilson
Jonathan Le Roux
J. Hershey
86
187
0
08 May 2019
A Statistically Principled and Computationally Efficient Approach to Speech Enhancement using Variational Autoencoders
Manuel Pariente
Antoine Deleforge
Emmanuel Vincent
62
21
0
03 May 2019
A Style Transfer Approach to Source Separation
Shrikant Venkataramani
Efthymios Tzinis
Paris Smaragdis
OOD
DRL
44
6
0
01 May 2019
Divide and Conquer: A Deep CASA Approach to Talker-independent Monaural Speaker Separation
Yuzhou Liu
DeLiang Wang
91
158
0
25 Apr 2019
Self-Supervised Audio-Visual Co-Segmentation
Andrew Rouditchenko
Hang Zhao
Chuang Gan
Josh H. McDermott
Antonio Torralba
VLM
SSL
62
105
0
18 Apr 2019
Deep Filtering: Signal Extraction and Reconstruction Using Complex Time-Frequency Filters
Wolfgang Mack
Emanuel Habets
66
86
0
17 Apr 2019
Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Gene-Ping Yang
Chao-I Tuan
Hung-yi Lee
Lin-Shan Lee
61
25
0
16 Apr 2019
Co-Separating Sounds of Visual Objects
Ruohan Gao
Kristen Grauman
140
210
0
16 Apr 2019
The Sound of Motions
Hang Zhao
Chuang Gan
Wei-Chiu Ma
Antonio Torralba
88
254
0
11 Apr 2019
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation
Fadi Biadsy
Ron J. Weiss
Pedro J. Moreno
D. Kanvesky
Ye Jia
88
115
0
08 Apr 2019
Time Domain Audio Visual Speech Separation
Jian Wu
Yong-mei Xu
Shi-Xiong Zhang
Lianwu Chen
Meng Yu
Lei Xie
Dong Yu
107
118
0
07 Apr 2019
Unsupervised Image Matching and Object Discovery as Optimization
Huy V. Vo
Francis R. Bach
Minsu Cho
Kai Han
Yann LeCun
P. Pérez
Jean Ponce
OCL
113
66
0
05 Apr 2019
Recursive speech separation for unknown number of speakers
Naoya Takahashi
Sudarsanam Parthasaarathy
Nabarun Goswami
Yuki Mitsufuji
58
81
0
05 Apr 2019
Point Cloud Oversegmentation with Graph-Structured Deep Metric Learning
Loic Landrieu
Mohamed Boussaha
3DPC
75
152
0
03 Apr 2019
Unsupervised training of a deep clustering model for multichannel blind source separation
Lukas Drude
Daniel Hasenklever
Reinhold Häb-Umbach
SSL
69
58
0
02 Apr 2019
Previous
1
2
3
4
5
6
7
8
Next