Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1508.04306
Cited By
Deep clustering: Discriminative embeddings for segmentation and separation
18 August 2015
J. Hershey
Zhuo Chen
Jonathan Le Roux
Shinji Watanabe
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep clustering: Discriminative embeddings for segmentation and separation"
50 / 357 papers shown
Title
Ranked List Loss for Deep Metric Learning
Xinshao Wang
Yang Hua
Elyor Kodirov
N. Robertson
120
248
0
08 Mar 2019
Low-Latency Deep Clustering For Speech Separation
Shanshan Wang
Gaurav Naithani
Tuomas Virtanen
49
15
0
19 Feb 2019
Target Speaker Extraction for Overlapped Multi-Talker Speaker Verification
Wei Rao
Chenglin Xu
Chng Eng Siong
Haizhou Li
31
11
0
07 Feb 2019
FurcaNet: An end-to-end deep gated convolutional, long short-term memory, deep neural networks for single channel speech separation
Ziqiang Shi
Huibin Lin
Liu Liu
Rujie Liu
Shoji Hayakawa
Shouji Harada
Jiqing Han
42
22
0
02 Feb 2019
Is CQT more suitable for monaural speech separation than STFT? an empirical study
Ziqiang Shi
Huibin Lin
Liu Liu
Rujie Liu
Jiqing Han
39
14
0
02 Feb 2019
Learning for Multi-Model and Multi-Type Fitting
Xun Xu
L. Cheong
Zhuwen Li
53
5
0
29 Jan 2019
Spectral Feature Transformation for Person Re-identification
Chuanchen Luo
Yuntao Chen
Naiyan Wang
Zhaoxiang Zhang
111
124
0
28 Nov 2018
Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective
Zhong-Qiu Wang
Ke Tan
DeLiang Wang
119
95
0
22 Nov 2018
Class-conditional embeddings for music source separation
A. Labatie
Gordon Wichern
Shrikant Venkataramani
Jonathan Le Roux
BDL
82
42
0
07 Nov 2018
Building Corpora for Single-Channel Speech Separation Across Multiple Domains
Aman Rana
Gregory Sell
Leibny Paola García Perera
A. Lowe
Pratik Shah
61
10
0
06 Nov 2018
SDR - half-baked or well done?
F. Sánchez-Martínez
M. Esplà-Gomis
Hakan Erdogan
J. Hershey
165
1,208
0
06 Nov 2018
Bootstrapping single-channel source separation via unsupervised spatial clustering on stereo mixtures
Prem Seetharaman
Gordon Wichern
Jonathan Le Roux
Bryan Pardo
68
36
0
06 Nov 2018
End-to-End Monaural Multi-speaker ASR System without Pretraining
Xuankai Chang
Y. Qian
Yi Liang
Deming Chen
87
77
0
05 Nov 2018
Trainable Adaptive Window Switching for Speech Enhancement
Yuma Koizumi
Noboru Harada
Y. Haneda
84
8
0
05 Nov 2018
Unsupervised Deep Clustering for Source Separation: Direct Learning from Mixtures using Spatial Information
Efthymios Tzinis
Shrikant Venkataramani
Paris Smaragdis
SSL
85
50
0
05 Nov 2018
Audio Source Separation Using Variational Autoencoders and Weak Class Supervision
Ertuğ Karamatlı
A. Cemgil
S. Kırbız
BDL
DRL
41
26
0
31 Oct 2018
Speaker Selective Beamformer with Keyword Mask Estimation
Yusuke Kida
Dung T. Tran
Motoi Omachi
T. Taniguchi
Yuya Fujita
26
3
0
25 Oct 2018
DNN-based Source Enhancement to Increase Objective Sound Quality Assessment Score
Yuma Koizumi
Kenta Niwa
Yusuke Hioka
Kazunori Kobayashi
Y. Haneda
49
63
0
22 Oct 2018
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
Quan Wang
Hannah Muckenhirn
K. Wilson
Prashant Sridhar
Zelin Wu
J. Hershey
Rif A. Saurous
Ron J. Weiss
Ye Jia
Ignacio López Moreno
100
370
0
11 Oct 2018
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks
Takuya Yoshioka
Hakan Erdogan
Zhuo Chen
Xiong Xiao
F. Alleva
BDL
84
82
0
08 Oct 2018
Phasebook and Friends: Leveraging Discrete Representations for Source Separation
Jonathan Le Roux
Gordon Wichern
Shinji Watanabe
Andy M. Sarroff
J. Hershey
68
77
0
02 Oct 2018
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
Yi Luo
N. Mesgarani
181
1,799
0
20 Sep 2018
Memory Time Span in LSTMs for Multi-Speaker Source Separation
Jeroen Zegers
Hugo Van hamme
17
5
0
24 Aug 2018
Multi-scenario deep learning for multi-speaker source separation
Jeroen Zegers
Hugo Van hamme
21
3
0
24 Aug 2018
Superpixel Sampling Networks
Varun Jampani
Deqing Sun
Ming-Yuan Liu
Ming-Hsuan Yang
Jan Kautz
SSeg
66
226
0
26 Jul 2018
Deep Extractor Network for Target Speaker Recovery From Single Channel Speech Mixtures
Jun Wang
Jie Chen
Dan Su
Lianwu Chen
Meng Yu
Y. Qian
Dong Yu
93
91
0
24 Jul 2018
Monaural source enhancement maximizing source-to-distortion ratio via automatic differentiation
Hiroaki Nakajima
Yu Takahashi
Kazunobu Kondo
Yuji Hisaminato
13
5
0
15 Jun 2018
SCSP: Spectral Clustering Filter Pruning with Soft Self-adaption Manners
Huiyuan Zhuo
Xuelin Qian
Yanwei Fu
Heng Yang
Xiangyang Xue
74
37
0
14 Jun 2018
ALMN: Deep Embedding Learning with Geometrical Virtual Point Generating
Binghui Chen
Weihong Deng
64
7
0
04 Jun 2018
Backpropagation with N-D Vector-Valued Neurons Using Arbitrary Bilinear Products
Zhe-Cheng Fan
T. Chan
Yi-Hsuan Yang
J. Jang
65
7
0
24 May 2018
A Purely End-to-end System for Multi-speaker Speech Recognition
Hiroshi Seki
Takaaki Hori
Shinji Watanabe
Jonathan Le Roux
J. Hershey
49
89
0
15 May 2018
Deep Speech Denoising with Vector Space Projections
Jeff Hetherly
Paul Gamble
M. Barrios
Cory Stephenson
Karl S. Ni
24
0
0
27 Apr 2018
End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction
Zhong-Qiu Wang
Jonathan Le Roux
DeLiang Wang
J. Hershey
151
124
0
26 Apr 2018
Recent Progresses in Deep Learning based Acoustic Models (Updated)
Dong Yu
Jinyu Li
VLM
77
160
0
25 Apr 2018
An Overview of Lead and Accompaniment Separation in Music
Z. Rafii
Antoine Liutkus
Fabian-Robert Stöter
S. I. Mimilakis
D. Fitzgerald
Bryan Pardo
60
103
0
23 Apr 2018
Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
Andrew Owens
Alexei A. Efros
SSL
110
754
0
10 Apr 2018
The Sound of Pixels
Hang Zhao
Chuang Gan
Andrew Rouditchenko
Carl Vondrick
Josh H. McDermott
Antonio Torralba
VLM
108
537
0
09 Apr 2018
Learning to Separate Object Sounds by Watching Unlabeled Video
Ruohan Gao
Rogerio Feris
Kristen Grauman
SSL
76
285
0
05 Apr 2018
Cracking the cocktail party problem by multi-beam deep attractor network
Zhuo Chen
Jinyu Li
Xiong Xiao
Takuya Yoshioka
Huaming Wang
Zhenghao Wang
Jiawei Liu
49
36
0
29 Mar 2018
Generalization Challenges for Neural Architectures in Audio Source Separation
Shariq Mobin
Brian Cheung
Bruno A. Olshausen
DRL
36
2
0
23 Mar 2018
SpectralNet: Spectral Clustering using Deep Neural Networks
Uri Shaham
Kelly P. Stanton
Henry Li
B. Nadler
Ronen Basri
Y. Kluger
GNN
145
287
0
04 Jan 2018
A Survey on Multi-View Clustering
Guoqing Chao
Shiliang Sun
J. Bi
63
236
0
18 Dec 2017
Classification vs. Regression in Supervised Learning for Single Channel Speaker Count Estimation
Fabian-Robert Stöter
Soumitro Chakrabarty
B. Edler
Emanuel Habets
BDL
106
38
0
12 Dec 2017
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks
Szu-Wei Fu
Tao-Wei Wang
Yu Tsao
Xugang Lu
Hisashi Kawai
94
277
0
12 Sep 2017
Joint Separation and Denoising of Noisy Multi-talker Speech using Recurrent Neural Networks and Permutation Invariant Training
Morten Kolbæk
Dong Yu
Zheng-Hua Tan
Jesper Jensen
78
22
0
31 Aug 2017
Supervised Speech Separation Based on Deep Learning: An Overview
DeLiang Wang
Jitong Chen
SSL
96
1,377
0
24 Aug 2017
Speaker Diarization using Deep Recurrent Convolutional Neural Networks for Speaker Embeddings
Pawel Cyrta
Tomasz Trzciñski
Wojciech Stokowiec
68
33
0
09 Aug 2017
Progressive Joint Modeling in Unsupervised Single-channel Overlapped Speech Recognition
Zhehuai Chen
J. Droppo
Jinyu Li
Wayne Xiong
83
65
0
21 Jul 2017
Speaker-independent Speech Separation with Deep Attractor Network
Yi Luo
Zhuo Chen
N. Mesgarani
97
247
0
12 Jul 2017
No Fuss Distance Metric Learning using Proxies
Yair Movshovitz-Attias
Alexander Toshev
Thomas Leung
Sergey Ioffe
Saurabh Singh
97
644
0
21 Mar 2017
Previous
1
2
3
4
5
6
7
8
Next