ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1508.04306
  4. Cited By
Deep clustering: Discriminative embeddings for segmentation and
  separation

Deep clustering: Discriminative embeddings for segmentation and separation

18 August 2015
J. Hershey
Zhuo Chen
Jonathan Le Roux
Shinji Watanabe
ArXiv (abs)PDFHTML

Papers citing "Deep clustering: Discriminative embeddings for segmentation and separation"

50 / 357 papers shown
Title
SMS-WSJ: Database, performance measures, and baseline recipe for
  multi-channel source separation and recognition
SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition
Lukas Drude
Jens Heitkaemper
Christoph Boeddeker
Reinhold Haeb-Umbach
66
72
0
30 Oct 2019
Mixup-breakdown: a consistency training method for improving
  generalization of speech separation models
Mixup-breakdown: a consistency training method for improving generalization of speech separation models
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
90
23
0
28 Oct 2019
A Multi-Phase Gammatone Filterbank for Speech Separation via TasNet
A Multi-Phase Gammatone Filterbank for Speech Separation via TasNet
David Ditter
Timo Gerkmann
63
58
0
25 Oct 2019
Filterbank design for end-to-end speech separation
Filterbank design for end-to-end speech separation
Manuel Pariente
Samuele Cornell
Antoine Deleforge
Emmanuel Vincent
106
69
0
23 Oct 2019
WHAMR!: Noisy and Reverberant Single-Channel Speech Separation
WHAMR!: Noisy and Reverberant Single-Channel Speech Separation
Matthew Maciejewski
Gordon Wichern
E. McQuinn
Jonathan Le Roux
89
184
0
22 Oct 2019
Two-Step Sound Source Separation: Training on Learned Latent Targets
Two-Step Sound Source Separation: Training on Learned Latent Targets
Efthymios Tzinis
Shrikant Venkataramani
Zhepei Wang
Y. C. Sübakan
Paris Smaragdis
73
65
0
22 Oct 2019
Simultaneous Separation and Transcription of Mixtures with Multiple
  Polyphonic and Percussive Instruments
Simultaneous Separation and Transcription of Mixtures with Multiple Polyphonic and Percussive Instruments
Ethan Manilow
Prem Seetharaman
Bryan Pardo
72
42
0
22 Oct 2019
Discriminative Neural Clustering for Speaker Diarisation
Discriminative Neural Clustering for Speaker Diarisation
Qiujia Li
Florian Kreyssig
Chao Zhang
P. Woodland
61
46
0
22 Oct 2019
Unsupervised Multi-Task Feature Learning on Point Clouds
Unsupervised Multi-Task Feature Learning on Point Clouds
Kaveh Hassani
Mike Haley
SSL3DPC
170
195
0
18 Oct 2019
MIMO-SPEECH: End-to-End Multi-Channel Multi-Speaker Speech Recognition
MIMO-SPEECH: End-to-End Multi-Channel Multi-Speaker Speech Recognition
Xuankai Chang
Wangyou Zhang
Y. Qian
Jonathan Le Roux
Shinji Watanabe
95
121
0
15 Oct 2019
Dual-path RNN: efficient long sequence modeling for time-domain
  single-channel speech separation
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation
Yi Luo
Zhuo Chen
Takuya Yoshioka
AI4TS
127
776
0
14 Oct 2019
CochleaNet: A Robust Language-independent Audio-Visual Model for Speech
  Enhancement
CochleaNet: A Robust Language-independent Audio-Visual Model for Speech Enhancement
M. Gogate
K. Dashtipour
Ahsan Adeel
Amir Hussain
50
53
0
23 Sep 2019
Cutting Music Source Separation Some Slakh: A Dataset to Study the
  Impact of Training Data Quality and Quantity
Cutting Music Source Separation Some Slakh: A Dataset to Study the Impact of Training Data Quality and Quantity
Ethan Manilow
Gordon Wichern
Prem Seetharaman
Jonathan Le Roux
77
127
0
18 Sep 2019
Simultaneous Speech Recognition and Speaker Diarization for Monaural
  Dialogue Recordings with Target-Speaker Acoustic Models
Simultaneous Speech Recognition and Speaker Diarization for Monaural Dialogue Recordings with Target-Speaker Acoustic Models
Naoyuki Kanda
Shota Horiguchi
Yusuke Fujita
Yawen Xue
Kenji Nagamatsu
Shinji Watanabe
53
36
0
17 Sep 2019
End-to-End Neural Speaker Diarization with Self-attention
End-to-End Neural Speaker Diarization with Self-attention
Yusuke Fujita
Naoyuki Kanda
Shota Horiguchi
Yawen Xue
Kenji Nagamatsu
Shinji Watanabe
240
243
0
13 Sep 2019
End-to-End Neural Speaker Diarization with Permutation-Free Objectives
End-to-End Neural Speaker Diarization with Permutation-Free Objectives
Yusuke Fujita
Naoyuki Kanda
Shota Horiguchi
Kenji Nagamatsu
Shinji Watanabe
200
255
0
12 Sep 2019
Deep Metric Learning with Density Adaptivity
Deep Metric Learning with Density Adaptivity
Yehao Li
Ting Yao
Yingwei Pan
Hongyang Chao
Tao Mei
144
11
0
09 Sep 2019
Deep Bayesian Unsupervised Source Separation Based on a Complex Gaussian
  Mixture Model
Deep Bayesian Unsupervised Source Separation Based on a Complex Gaussian Mixture Model
Yoshiaki Bando
Y. Sasaki
Kazuyoshi Yoshii
BDL
52
9
0
29 Aug 2019
Nearest Neighbor Search-Based Bitwise Source Separation Using
  Discriminant Winner-Take-All Hashing
Nearest Neighbor Search-Based Bitwise Source Separation Using Discriminant Winner-Take-All Hashing
Sunwoo Kim
Minje Kim
18
0
0
26 Aug 2019
Audio query-based music source separation
Audio query-based music source separation
Jie Hwan Lee
Hyeong-Seok Choi
Kyogu Lee
71
45
0
19 Aug 2019
Probabilistic Permutation Invariant Training for Speech Separation
Probabilistic Permutation Invariant Training for Speech Separation
Midia Yousefi
S. Khorram
John H. L. Hansen
47
23
0
04 Aug 2019
Discriminative Learning for Monaural Speech Separation Using Deep
  Embedding Features
Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features
Cunhang Fan
B. Liu
J. Tao
Jiangyan Yi
Zhengqi Wen
51
22
0
23 Jul 2019
DNN-based Speaker Embedding Using Subjective Inter-speaker Similarity
  for Multi-speaker Modeling in Speech Synthesis
DNN-based Speaker Embedding Using Subjective Inter-speaker Similarity for Multi-speaker Modeling in Speech Synthesis
Yuki Saito
Shinnosuke Takamichi
Hiroshi Saruwatari
32
10
0
19 Jul 2019
Multichannel Loss Function for Supervised Speech Source Separation by
  Mask-based Beamforming
Multichannel Loss Function for Supervised Speech Source Separation by Mask-based Beamforming
Yoshiki Masuyama
M. Togami
Tatsuya Komatsu
38
8
0
11 Jul 2019
My lips are concealed: Audio-visual speech enhancement through
  obstructions
My lips are concealed: Audio-visual speech enhancement through obstructions
Triantafyllos Afouras
Joon Son Chung
Andrew Zisserman
65
91
0
11 Jul 2019
Improving Reverberant Speech Training Using Diffuse Acoustic Simulation
Improving Reverberant Speech Training Using Diffuse Acoustic Simulation
Zhenyu Tang
Lianwu Chen
Bo Wu
Dong Yu
Tianyi Zhou
AI4CE
75
35
0
09 Jul 2019
WHAM!: Extending Speech Separation to Noisy Environments
WHAM!: Extending Speech Separation to Noisy Environments
Gordon Wichern
J. Antognini
Michael Flynn
Licheng Richard Zhu
E. McQuinn
Dwight Crow
Ethan Manilow
Jonathan Le Roux
86
355
0
02 Jul 2019
Auxiliary Interference Speaker Loss for Target-Speaker Speech
  Recognition
Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition
Naoyuki Kanda
Shota Horiguchi
R. Takashima
Yusuke Fujita
Kenji Nagamatsu
Shinji Watanabe
63
34
0
26 Jun 2019
Single-Channel Speech Separation with Auxiliary Speaker Embeddings
Single-Channel Speech Separation with Auxiliary Speaker Embeddings
Shuo Liu
Gil Keren
Björn Schuller
33
3
0
24 Jun 2019
From Clustering to Cluster Explanations via Neural Networks
From Clustering to Cluster Explanations via Neural Networks
Jacob R. Kauffmann
Malte Esders
Lukas Ruff
G. Montavon
Wojciech Samek
K. Müller
79
72
0
18 Jun 2019
Divide and Conquer the Embedding Space for Metric Learning
Divide and Conquer the Embedding Space for Metric Learning
A. Sanakoyeu
Vadim Tschernezki
U. Büchler
Bjorn Ommer
SSL
86
107
0
14 Jun 2019
A comprehensive study of speech separation: spectrogram vs waveform
  separation
A comprehensive study of speech separation: spectrogram vs waveform separation
F. Bahmaninezhad
Jian Wu
Rongzhi Gu
Shi-Xiong Zhang
Yong-mei Xu
Meng Yu
Dong Yu
81
81
0
17 May 2019
End-to-End Multi-Channel Speech Separation
End-to-End Multi-Channel Speech Separation
Rongzhi Gu
Jian Wu
Shi-Xiong Zhang
Lianwu Chen
Yong-mei Xu
Meng Yu
Dan Su
Yuexian Zou
Dong Yu
56
77
0
15 May 2019
Machine learning in acoustics: theory and applications
Machine learning in acoustics: theory and applications
Michael J. Bianco
Peter Gerstoft
James Traer
Emma Ozanich
M. Roch
Sharon Gannot
Charles-Alban Deledalle
AI4CE
89
391
0
11 May 2019
Analysis of Deep Clustering as Preprocessing for Automatic Speech
  Recognition of Sparsely Overlapping Speech
Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
T. Menne
Ilya Sklyar
Ralf Schluter
Hermann Ney
158
35
0
09 May 2019
Universal Sound Separation
Universal Sound Separation
Ilya Kavalerov
Scott Wisdom
Hakan Erdogan
Brian Patton
K. Wilson
Jonathan Le Roux
J. Hershey
86
187
0
08 May 2019
A Statistically Principled and Computationally Efficient Approach to
  Speech Enhancement using Variational Autoencoders
A Statistically Principled and Computationally Efficient Approach to Speech Enhancement using Variational Autoencoders
Manuel Pariente
Antoine Deleforge
Emmanuel Vincent
62
21
0
03 May 2019
A Style Transfer Approach to Source Separation
A Style Transfer Approach to Source Separation
Shrikant Venkataramani
Efthymios Tzinis
Paris Smaragdis
OODDRL
44
6
0
01 May 2019
Divide and Conquer: A Deep CASA Approach to Talker-independent Monaural
  Speaker Separation
Divide and Conquer: A Deep CASA Approach to Talker-independent Monaural Speaker Separation
Yuzhou Liu
DeLiang Wang
91
158
0
25 Apr 2019
Self-Supervised Audio-Visual Co-Segmentation
Self-Supervised Audio-Visual Co-Segmentation
Andrew Rouditchenko
Hang Zhao
Chuang Gan
Josh H. McDermott
Antonio Torralba
VLMSSL
62
105
0
18 Apr 2019
Deep Filtering: Signal Extraction and Reconstruction Using Complex
  Time-Frequency Filters
Deep Filtering: Signal Extraction and Reconstruction Using Complex Time-Frequency Filters
Wolfgang Mack
Emanuel Habets
66
86
0
17 Apr 2019
Improved Speech Separation with Time-and-Frequency Cross-domain Joint
  Embedding and Clustering
Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Gene-Ping Yang
Chao-I Tuan
Hung-yi Lee
Lin-Shan Lee
61
25
0
16 Apr 2019
Co-Separating Sounds of Visual Objects
Co-Separating Sounds of Visual Objects
Ruohan Gao
Kristen Grauman
140
210
0
16 Apr 2019
The Sound of Motions
The Sound of Motions
Hang Zhao
Chuang Gan
Wei-Chiu Ma
Antonio Torralba
88
254
0
11 Apr 2019
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its
  Applications to Hearing-Impaired Speech and Speech Separation
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation
Fadi Biadsy
Ron J. Weiss
Pedro J. Moreno
D. Kanvesky
Ye Jia
88
115
0
08 Apr 2019
Time Domain Audio Visual Speech Separation
Time Domain Audio Visual Speech Separation
Jian Wu
Yong-mei Xu
Shi-Xiong Zhang
Lianwu Chen
Meng Yu
Lei Xie
Dong Yu
107
118
0
07 Apr 2019
Unsupervised Image Matching and Object Discovery as Optimization
Unsupervised Image Matching and Object Discovery as Optimization
Huy V. Vo
Francis R. Bach
Minsu Cho
Kai Han
Yann LeCun
P. Pérez
Jean Ponce
OCL
113
66
0
05 Apr 2019
Recursive speech separation for unknown number of speakers
Recursive speech separation for unknown number of speakers
Naoya Takahashi
Sudarsanam Parthasaarathy
Nabarun Goswami
Yuki Mitsufuji
58
81
0
05 Apr 2019
Point Cloud Oversegmentation with Graph-Structured Deep Metric Learning
Point Cloud Oversegmentation with Graph-Structured Deep Metric Learning
Loic Landrieu
Mohamed Boussaha
3DPC
75
152
0
03 Apr 2019
Unsupervised training of a deep clustering model for multichannel blind
  source separation
Unsupervised training of a deep clustering model for multichannel blind source separation
Lukas Drude
Daniel Hasenklever
Reinhold Häb-Umbach
SSL
69
58
0
02 Apr 2019
Previous
12345678
Next