ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1508.04306
  4. Cited By
Deep clustering: Discriminative embeddings for segmentation and
  separation

Deep clustering: Discriminative embeddings for segmentation and separation

18 August 2015
J. Hershey
Zhuo Chen
Jonathan Le Roux
Shinji Watanabe
ArXiv (abs)PDFHTML

Papers citing "Deep clustering: Discriminative embeddings for segmentation and separation"

50 / 357 papers shown
Title
Ranked List Loss for Deep Metric Learning
Ranked List Loss for Deep Metric Learning
Xinshao Wang
Yang Hua
Elyor Kodirov
N. Robertson
120
248
0
08 Mar 2019
Low-Latency Deep Clustering For Speech Separation
Low-Latency Deep Clustering For Speech Separation
Shanshan Wang
Gaurav Naithani
Tuomas Virtanen
49
15
0
19 Feb 2019
Target Speaker Extraction for Overlapped Multi-Talker Speaker
  Verification
Target Speaker Extraction for Overlapped Multi-Talker Speaker Verification
Wei Rao
Chenglin Xu
Chng Eng Siong
Haizhou Li
31
11
0
07 Feb 2019
FurcaNet: An end-to-end deep gated convolutional, long short-term
  memory, deep neural networks for single channel speech separation
FurcaNet: An end-to-end deep gated convolutional, long short-term memory, deep neural networks for single channel speech separation
Ziqiang Shi
Huibin Lin
Liu Liu
Rujie Liu
Shoji Hayakawa
Shouji Harada
Jiqing Han
42
22
0
02 Feb 2019
Is CQT more suitable for monaural speech separation than STFT? an
  empirical study
Is CQT more suitable for monaural speech separation than STFT? an empirical study
Ziqiang Shi
Huibin Lin
Liu Liu
Rujie Liu
Jiqing Han
39
14
0
02 Feb 2019
Learning for Multi-Model and Multi-Type Fitting
Learning for Multi-Model and Multi-Type Fitting
Xun Xu
L. Cheong
Zhuwen Li
53
5
0
29 Jan 2019
Spectral Feature Transformation for Person Re-identification
Spectral Feature Transformation for Person Re-identification
Chuanchen Luo
Yuntao Chen
Naiyan Wang
Zhaoxiang Zhang
111
124
0
28 Nov 2018
Deep Learning Based Phase Reconstruction for Speaker Separation: A
  Trigonometric Perspective
Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective
Zhong-Qiu Wang
Ke Tan
DeLiang Wang
119
95
0
22 Nov 2018
Class-conditional embeddings for music source separation
Class-conditional embeddings for music source separation
A. Labatie
Gordon Wichern
Shrikant Venkataramani
Jonathan Le Roux
BDL
82
42
0
07 Nov 2018
Building Corpora for Single-Channel Speech Separation Across Multiple
  Domains
Building Corpora for Single-Channel Speech Separation Across Multiple Domains
Aman Rana
Gregory Sell
Leibny Paola García Perera
A. Lowe
Pratik Shah
61
10
0
06 Nov 2018
SDR - half-baked or well done?
SDR - half-baked or well done?
F. Sánchez-Martínez
M. Esplà-Gomis
Hakan Erdogan
J. Hershey
165
1,208
0
06 Nov 2018
Bootstrapping single-channel source separation via unsupervised spatial
  clustering on stereo mixtures
Bootstrapping single-channel source separation via unsupervised spatial clustering on stereo mixtures
Prem Seetharaman
Gordon Wichern
Jonathan Le Roux
Bryan Pardo
68
36
0
06 Nov 2018
End-to-End Monaural Multi-speaker ASR System without Pretraining
End-to-End Monaural Multi-speaker ASR System without Pretraining
Xuankai Chang
Y. Qian
Yi Liang
Deming Chen
87
77
0
05 Nov 2018
Trainable Adaptive Window Switching for Speech Enhancement
Trainable Adaptive Window Switching for Speech Enhancement
Yuma Koizumi
Noboru Harada
Y. Haneda
84
8
0
05 Nov 2018
Unsupervised Deep Clustering for Source Separation: Direct Learning from
  Mixtures using Spatial Information
Unsupervised Deep Clustering for Source Separation: Direct Learning from Mixtures using Spatial Information
Efthymios Tzinis
Shrikant Venkataramani
Paris Smaragdis
SSL
85
50
0
05 Nov 2018
Audio Source Separation Using Variational Autoencoders and Weak Class
  Supervision
Audio Source Separation Using Variational Autoencoders and Weak Class Supervision
Ertuğ Karamatlı
A. Cemgil
S. Kırbız
BDLDRL
41
26
0
31 Oct 2018
Speaker Selective Beamformer with Keyword Mask Estimation
Speaker Selective Beamformer with Keyword Mask Estimation
Yusuke Kida
Dung T. Tran
Motoi Omachi
T. Taniguchi
Yuya Fujita
26
3
0
25 Oct 2018
DNN-based Source Enhancement to Increase Objective Sound Quality
  Assessment Score
DNN-based Source Enhancement to Increase Objective Sound Quality Assessment Score
Yuma Koizumi
Kenta Niwa
Yusuke Hioka
Kazunori Kobayashi
Y. Haneda
49
63
0
22 Oct 2018
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned
  Spectrogram Masking
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
Quan Wang
Hannah Muckenhirn
K. Wilson
Prashant Sridhar
Zelin Wu
J. Hershey
Rif A. Saurous
Ron J. Weiss
Ye Jia
Ignacio López Moreno
100
370
0
11 Oct 2018
Recognizing Overlapped Speech in Meetings: A Multichannel Separation
  Approach Using Neural Networks
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks
Takuya Yoshioka
Hakan Erdogan
Zhuo Chen
Xiong Xiao
F. Alleva
BDL
84
82
0
08 Oct 2018
Phasebook and Friends: Leveraging Discrete Representations for Source
  Separation
Phasebook and Friends: Leveraging Discrete Representations for Source Separation
Jonathan Le Roux
Gordon Wichern
Shinji Watanabe
Andy M. Sarroff
J. Hershey
68
77
0
02 Oct 2018
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
Yi Luo
N. Mesgarani
181
1,799
0
20 Sep 2018
Memory Time Span in LSTMs for Multi-Speaker Source Separation
Memory Time Span in LSTMs for Multi-Speaker Source Separation
Jeroen Zegers
Hugo Van hamme
17
5
0
24 Aug 2018
Multi-scenario deep learning for multi-speaker source separation
Multi-scenario deep learning for multi-speaker source separation
Jeroen Zegers
Hugo Van hamme
21
3
0
24 Aug 2018
Superpixel Sampling Networks
Superpixel Sampling Networks
Varun Jampani
Deqing Sun
Ming-Yuan Liu
Ming-Hsuan Yang
Jan Kautz
SSeg
66
226
0
26 Jul 2018
Deep Extractor Network for Target Speaker Recovery From Single Channel
  Speech Mixtures
Deep Extractor Network for Target Speaker Recovery From Single Channel Speech Mixtures
Jun Wang
Jie Chen
Dan Su
Lianwu Chen
Meng Yu
Y. Qian
Dong Yu
93
91
0
24 Jul 2018
Monaural source enhancement maximizing source-to-distortion ratio via
  automatic differentiation
Monaural source enhancement maximizing source-to-distortion ratio via automatic differentiation
Hiroaki Nakajima
Yu Takahashi
Kazunobu Kondo
Yuji Hisaminato
13
5
0
15 Jun 2018
SCSP: Spectral Clustering Filter Pruning with Soft Self-adaption Manners
SCSP: Spectral Clustering Filter Pruning with Soft Self-adaption Manners
Huiyuan Zhuo
Xuelin Qian
Yanwei Fu
Heng Yang
Xiangyang Xue
74
37
0
14 Jun 2018
ALMN: Deep Embedding Learning with Geometrical Virtual Point Generating
ALMN: Deep Embedding Learning with Geometrical Virtual Point Generating
Binghui Chen
Weihong Deng
64
7
0
04 Jun 2018
Backpropagation with N-D Vector-Valued Neurons Using Arbitrary Bilinear
  Products
Backpropagation with N-D Vector-Valued Neurons Using Arbitrary Bilinear Products
Zhe-Cheng Fan
T. Chan
Yi-Hsuan Yang
J. Jang
65
7
0
24 May 2018
A Purely End-to-end System for Multi-speaker Speech Recognition
A Purely End-to-end System for Multi-speaker Speech Recognition
Hiroshi Seki
Takaaki Hori
Shinji Watanabe
Jonathan Le Roux
J. Hershey
49
89
0
15 May 2018
Deep Speech Denoising with Vector Space Projections
Deep Speech Denoising with Vector Space Projections
Jeff Hetherly
Paul Gamble
M. Barrios
Cory Stephenson
Karl S. Ni
24
0
0
27 Apr 2018
End-to-End Speech Separation with Unfolded Iterative Phase
  Reconstruction
End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction
Zhong-Qiu Wang
Jonathan Le Roux
DeLiang Wang
J. Hershey
151
124
0
26 Apr 2018
Recent Progresses in Deep Learning based Acoustic Models (Updated)
Recent Progresses in Deep Learning based Acoustic Models (Updated)
Dong Yu
Jinyu Li
VLM
77
160
0
25 Apr 2018
An Overview of Lead and Accompaniment Separation in Music
An Overview of Lead and Accompaniment Separation in Music
Z. Rafii
Antoine Liutkus
Fabian-Robert Stöter
S. I. Mimilakis
D. Fitzgerald
Bryan Pardo
60
103
0
23 Apr 2018
Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
Andrew Owens
Alexei A. Efros
SSL
110
754
0
10 Apr 2018
The Sound of Pixels
The Sound of Pixels
Hang Zhao
Chuang Gan
Andrew Rouditchenko
Carl Vondrick
Josh H. McDermott
Antonio Torralba
VLM
108
537
0
09 Apr 2018
Learning to Separate Object Sounds by Watching Unlabeled Video
Learning to Separate Object Sounds by Watching Unlabeled Video
Ruohan Gao
Rogerio Feris
Kristen Grauman
SSL
76
285
0
05 Apr 2018
Cracking the cocktail party problem by multi-beam deep attractor network
Cracking the cocktail party problem by multi-beam deep attractor network
Zhuo Chen
Jinyu Li
Xiong Xiao
Takuya Yoshioka
Huaming Wang
Zhenghao Wang
Jiawei Liu
49
36
0
29 Mar 2018
Generalization Challenges for Neural Architectures in Audio Source
  Separation
Generalization Challenges for Neural Architectures in Audio Source Separation
Shariq Mobin
Brian Cheung
Bruno A. Olshausen
DRL
36
2
0
23 Mar 2018
SpectralNet: Spectral Clustering using Deep Neural Networks
SpectralNet: Spectral Clustering using Deep Neural Networks
Uri Shaham
Kelly P. Stanton
Henry Li
B. Nadler
Ronen Basri
Y. Kluger
GNN
145
287
0
04 Jan 2018
A Survey on Multi-View Clustering
A Survey on Multi-View Clustering
Guoqing Chao
Shiliang Sun
J. Bi
63
236
0
18 Dec 2017
Classification vs. Regression in Supervised Learning for Single Channel
  Speaker Count Estimation
Classification vs. Regression in Supervised Learning for Single Channel Speaker Count Estimation
Fabian-Robert Stöter
Soumitro Chakrabarty
B. Edler
Emanuel Habets
BDL
106
38
0
12 Dec 2017
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics
  Optimization by Fully Convolutional Neural Networks
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks
Szu-Wei Fu
Tao-Wei Wang
Yu Tsao
Xugang Lu
Hisashi Kawai
94
277
0
12 Sep 2017
Joint Separation and Denoising of Noisy Multi-talker Speech using
  Recurrent Neural Networks and Permutation Invariant Training
Joint Separation and Denoising of Noisy Multi-talker Speech using Recurrent Neural Networks and Permutation Invariant Training
Morten Kolbæk
Dong Yu
Zheng-Hua Tan
Jesper Jensen
78
22
0
31 Aug 2017
Supervised Speech Separation Based on Deep Learning: An Overview
Supervised Speech Separation Based on Deep Learning: An Overview
DeLiang Wang
Jitong Chen
SSL
96
1,377
0
24 Aug 2017
Speaker Diarization using Deep Recurrent Convolutional Neural Networks
  for Speaker Embeddings
Speaker Diarization using Deep Recurrent Convolutional Neural Networks for Speaker Embeddings
Pawel Cyrta
Tomasz Trzciñski
Wojciech Stokowiec
68
33
0
09 Aug 2017
Progressive Joint Modeling in Unsupervised Single-channel Overlapped
  Speech Recognition
Progressive Joint Modeling in Unsupervised Single-channel Overlapped Speech Recognition
Zhehuai Chen
J. Droppo
Jinyu Li
Wayne Xiong
83
65
0
21 Jul 2017
Speaker-independent Speech Separation with Deep Attractor Network
Speaker-independent Speech Separation with Deep Attractor Network
Yi Luo
Zhuo Chen
N. Mesgarani
97
247
0
12 Jul 2017
No Fuss Distance Metric Learning using Proxies
No Fuss Distance Metric Learning using Proxies
Yair Movshovitz-Attias
Alexander Toshev
Thomas Leung
Sergey Ioffe
Saurabh Singh
97
644
0
21 Mar 2017
Previous
12345678
Next