ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1508.04306
  4. Cited By
Deep clustering: Discriminative embeddings for segmentation and
  separation

Deep clustering: Discriminative embeddings for segmentation and separation

18 August 2015
J. Hershey
Zhuo Chen
Jonathan Le Roux
Shinji Watanabe
ArXiv (abs)PDFHTML

Papers citing "Deep clustering: Discriminative embeddings for segmentation and separation"

50 / 357 papers shown
Title
TransMask: A Compact and Fast Speech Separation Model Based on
  Transformer
TransMask: A Compact and Fast Speech Separation Model Based on Transformer
Zining Zhang
Bingsheng He
Zhenjie Zhang
62
23
0
19 Feb 2021
A Deep Embedded Refined Clustering Approach for Breast Cancer
  Distinction based on DNA Methylation
A Deep Embedded Refined Clustering Approach for Breast Cancer Distinction based on DNA Methylation
Rocío del Amor
Adrián Colomer
C. Monteagudo
Valery Naranjo
17
24
0
18 Feb 2021
Speaker and Direction Inferred Dual-channel Speech Separation
Speaker and Direction Inferred Dual-channel Speech Separation
Chenxing Li
Jiaming Xu
N. Mesgarani
Bo Xu
34
8
0
08 Feb 2021
Time-Domain Speech Extraction with Spatial Information and Multi Speaker
  Conditioning Mechanism
Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism
Jisi Zhang
Catalin Zorila
R. Doddipatla
Jon Barker
44
13
0
07 Feb 2021
A Review of Speaker Diarization: Recent Advances with Deep Learning
A Review of Speaker Diarization: Recent Advances with Deep Learning
Tae Jin Park
Naoyuki Kanda
Dimitrios Dimitriadis
Kyu Jeong Han
Shinji Watanabe
Shrikanth Narayanan
VLM
382
337
0
24 Jan 2021
A Joint Diagonalization Based Efficient Approach to Underdetermined
  Blind Audio Source Separation Using the Multichannel Wiener Filter
A Joint Diagonalization Based Efficient Approach to Underdetermined Blind Audio Source Separation Using the Multichannel Wiener Filter
N. Ito
Rintaro Ikeshita
H. Sawada
Tomohiro Nakatani
25
26
0
21 Jan 2021
Effective Low-Cost Time-Domain Audio Separation Using Globally Attentive
  Locally Recurrent Networks
Effective Low-Cost Time-Domain Audio Separation Using Globally Attentive Locally Recurrent Networks
Max W. Y. Lam
Jun Wang
Dan Su
Dong Yu
89
29
0
13 Jan 2021
VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency
VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency
Ruohan Gao
Kristen Grauman
CVBM
243
202
0
08 Jan 2021
The 2020 ESPnet update: new features, broadened applications,
  performance improvements, and future plans
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
Shinji Watanabe
Florian Boyer
Xuankai Chang
Pengcheng Guo
Tomoki Hayashi
...
Shigeki Karita
Chenda Li
Jing Shi
Aswin Shanmugam Subramanian
Wangyou Zhang
VLM
105
38
0
23 Dec 2020
Continuous Speech Separation Using Speaker Inventory for Long
  Multi-talker Recording
Continuous Speech Separation Using Speaker Inventory for Long Multi-talker Recording
Cong Han
Yi Luo
Chenda Li
Tianyan Zhou
K. Kinoshita
...
Marc Delcroix
Hakan Erdogan
J. Hershey
N. Mesgarani
Zhuo Chen
58
8
0
17 Dec 2020
Deep Ad-hoc Beamforming Based on Speaker Extraction for Target-Dependent
  Speech Separation
Deep Ad-hoc Beamforming Based on Speaker Extraction for Target-Dependent Speech Separation
Ziye Yang
Shanzheng Guan
Xiao-Lei Zhang
32
14
0
01 Dec 2020
Convolutive Transfer Function Invariant SDR training criteria for
  Multi-Channel Reverberant Speech Separation
Convolutive Transfer Function Invariant SDR training criteria for Multi-Channel Reverberant Speech Separation
Christoph Boeddeker
Wangyou Zhang
Tomohiro Nakatani
K. Kinoshita
Tsubasa Ochiai
Marc Delcroix
Naoyuki Kamo
Y. Qian
Reinhold Haeb-Umbach
87
30
0
30 Nov 2020
A comparison of handcrafted, parameterized, and learnable features for
  speech separation
A comparison of handcrafted, parameterized, and learnable features for speech separation
Wenbo Zhu
Mou Wang
Xiao-Lei Zhang
S. Rahardja
38
4
0
29 Nov 2020
Streaming end-to-end multi-talker speech recognition
Streaming end-to-end multi-talker speech recognition
Liang Lu
Naoyuki Kanda
Jinyu Li
Jiawei Liu
72
44
0
26 Nov 2020
Streaming Multi-speaker ASR with RNN-T
Streaming Multi-speaker ASR with RNN-T
Ilya Sklyar
A. Piunova
Yulan Liu
80
37
0
23 Nov 2020
Rethinking the Separation Layers in Speech Separation Networks
Rethinking the Separation Layers in Speech Separation Networks
Yi Luo
Zhuo Chen
Cong Han
Chenda Li
Tianyan Zhou
N. Mesgarani
36
10
0
17 Nov 2020
On End-to-end Multi-channel Time Domain Speech Separation in Reverberant
  Environments
On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments
Jisi Zhang
Catalin Zorila
R. Doddipatla
Jon Barker
76
46
0
11 Nov 2020
Surrogate Source Model Learning for Determined Source Separation
Surrogate Source Model Learning for Determined Source Separation
Robin Scheibler
M. Togami
82
22
0
11 Nov 2020
ESPnet-se: end-to-end speech enhancement and separation toolkit designed
  for asr integration
ESPnet-se: end-to-end speech enhancement and separation toolkit designed for asr integration
Chenda Li
Jing Shi
Wangyou Zhang
Aswin Shanmugam Subramanian
Xuankai Chang
...
Moto Hira
Tomoki Hayashi
Christoph Boeddeker
Zhuo Chen
Shinji Watanabe
VLM
95
82
0
07 Nov 2020
Single channel voice separation for unknown number of speakers under
  reverberant and noisy settings
Single channel voice separation for unknown number of speakers under reverberant and noisy settings
Shlomo E. Chazan
Lior Wolf
Eliya Nachmani
Yossi Adi
68
29
0
04 Nov 2020
DESNet: A Multi-channel Network for Simultaneous Speech Dereverberation,
  Enhancement and Separation
DESNet: A Multi-channel Network for Simultaneous Speech Dereverberation, Enhancement and Separation
Yihui Fu
Jian Wu
Yanxin Hu
Mengtao Xing
Lei Xie
72
24
0
04 Nov 2020
Stabilizing Label Assignment for Speech Separation by Self-supervised
  Pre-training
Stabilizing Label Assignment for Speech Separation by Self-supervised Pre-training
Sung-Feng Huang
Shun-Po Chuang
Da-Rong Liu
Yi-Chen Chen
Gene-Ping Yang
Hung-yi Lee
SSL
92
14
0
29 Oct 2020
Attention is All You Need in Speech Separation
Attention is All You Need in Speech Separation
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Mirko Bronzi
Jianyuan Zhong
101
567
0
25 Oct 2020
Speakerfilter-Pro: an improved target speaker extractor combines the
  time domain and frequency domain
Speakerfilter-Pro: an improved target speaker extractor combines the time domain and frequency domain
Shulin He
Hao Li
Xueliang Zhang
23
3
0
25 Oct 2020
Training Noisy Single-Channel Speech Separation With Noisy Oracle
  Sources: A Large Gap and A Small Step
Training Noisy Single-Channel Speech Separation With Noisy Oracle Sources: A Large Gap and A Small Step
Matthew Maciejewski
Jing Shi
Shinji Watanabe
Sanjeev Khudanpur
61
11
0
23 Oct 2020
Speaker Separation Using Speaker Inventories and Estimated Speech
Speaker Separation Using Speaker Inventories and Estimated Speech
Peidong Wang
Zhuo Chen
DeLiang Wang
Jinyu Li
Jiawei Liu
93
11
0
20 Oct 2020
Attention-based scaling adaptation for target speech extraction
Attention-based scaling adaptation for target speech extraction
Jiangyu Han
Wei Rao
Yanhua Long
Jiaen Liang
62
9
0
19 Oct 2020
Muse: Multi-modal target speaker extraction with visual cues
Muse: Multi-modal target speaker extraction with visual cues
Zexu Pan
Ruijie Tao
Chenglin Xu
Haizhou Li
51
50
0
15 Oct 2020
The Cone of Silence: Speech Separation by Localization
The Cone of Silence: Speech Separation by Localization
Teerapat Jenrungrot
V. Jayaram
S. M. Seitz
Ira Kemelmacher-Shlizerman
76
56
0
12 Oct 2020
Multi-microphone Complex Spectral Mapping for Utterance-wise and
  Continuous Speech Separation
Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation
Zhong-Qiu Wang
Peidong Wang
DeLiang Wang
58
90
0
04 Oct 2020
X-DC: Explainable Deep Clustering based on Learnable Spectrogram
  Templates
X-DC: Explainable Deep Clustering based on Learnable Spectrogram Templates
C. Watanabe
Hirokazu Kameoka
34
0
0
18 Sep 2020
VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device
  Speech Recognition
VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Quan Wang
Ignacio López Moreno
Mert Saglam
K. Wilson
Alan Chiao
...
Yanzhang He
Wei Li
Jason W. Pelecanos
M. Nika
A. Gruenstein
VLM
71
86
0
09 Sep 2020
An End-to-end Architecture of Online Multi-channel Speech Separation
An End-to-end Architecture of Online Multi-channel Speech Separation
Jian Wu
Zhuo Chen
Jinyu Li
Takuya Yoshioka
Zhili Tan
Ed Lin
Yi Luo
Lei Xie
3DV
38
21
0
07 Sep 2020
SAGRNN: Self-Attentive Gated RNN for Binaural Speaker Separation with
  Interaural Cue Preservation
SAGRNN: Self-Attentive Gated RNN for Binaural Speaker Separation with Interaural Cue Preservation
Ke Tan
Buye Xu
Anurag Kumar
Eliya Nachmani
Yossi Adi
59
29
0
02 Sep 2020
Variational Autoencoder for Anti-Cancer Drug Response Prediction
Variational Autoencoder for Anti-Cancer Drug Response Prediction
Hongyuan Dong
Jiaqing Xie
Zhi Jing
Dexin Ren
DRL
43
14
0
22 Aug 2020
Continuous Speech Separation with Conformer
Continuous Speech Separation with Conformer
Sanyuan Chen
Yu-Huan Wu
Zhuo Chen
Jian Wu
Jinyu Li
Takuya Yoshioka
Chengyi Wang
Shujie Liu
M. Zhou
76
130
0
13 Aug 2020
Speech Separation Based on Multi-Stage Elaborated Dual-Path Deep BiLSTM
  with Auxiliary Identity Loss
Speech Separation Based on Multi-Stage Elaborated Dual-Path Deep BiLSTM with Auxiliary Identity Loss
Ziqiang Shi
Rujie Liu
Jiqing Han
38
7
0
06 Aug 2020
MIRNet: Learning multiple identities representations in overlapped
  speech
MIRNet: Learning multiple identities representations in overlapped speech
Hyewon Han
Soo-Whan Chung
Hong-Goo Kang
69
8
0
04 Aug 2020
Dual-Path Transformer Network: Direct Context-Aware Modeling for
  End-to-End Monaural Speech Separation
Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation
Jing-jing Chen
Qi-rong Mao
Dong Liu
120
289
0
28 Jul 2020
Dereverberation using joint estimation of dry speech signal and acoustic
  system
Dereverberation using joint estimation of dry speech signal and acoustic system
Sanna Wager
Keunwoo Choi
Simon Durand
96
3
0
24 Jul 2020
Sudo rm -rf: Efficient Networks for Universal Audio Source Separation
Sudo rm -rf: Efficient Networks for Universal Audio Source Separation
Efthymios Tzinis
Zhepei Wang
Paris Smaragdis
96
130
0
14 Jul 2020
Do We Need Sound for Sound Source Localization?
Do We Need Sound for Sound Source Localization?
Takashi Oya
Shohei Iwase
Ryota Natsume
Takahiro Itazuri
Shugo Yamaguchi
Shigeo Morishima
41
22
0
11 Jul 2020
Progressive Tandem Learning for Pattern Recognition with Deep Spiking
  Neural Networks
Progressive Tandem Learning for Pattern Recognition with Deep Spiking Neural Networks
Jibin Wu
Chenglin Xu
Daquan Zhou
Haizhou Li
Kay Chen Tan
67
117
0
02 Jul 2020
Exploring the time-domain deep attractor network with two-stream
  architectures in a reverberant environment
Exploring the time-domain deep attractor network with two-stream architectures in a reverberant environment
Hangting Chen
Pengyuan Zhang
32
6
0
01 Jul 2020
Sequence to Multi-Sequence Learning via Conditional Chain Mapping for
  Mixture Signals
Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals
Jing Shi
Xuankai Chang
Pengcheng Guo
Shinji Watanabe
Yusuke Fujita
Jiaming Xu
Bo Xu
Lei Xie
88
22
0
25 Jun 2020
Speaker-Conditional Chain Model for Speech Separation and Extraction
Speaker-Conditional Chain Model for Speech Separation and Extraction
Jing Shi
Jiaming Xu
Yusuke Fujita
Shinji Watanabe
Bo Xu
BDL
70
21
0
25 Jun 2020
Unsupervised Sound Separation Using Mixture Invariant Training
Unsupervised Sound Separation Using Mixture Invariant Training
Scott Wisdom
Efthymios Tzinis
Hakan Erdogan
Ron J. Weiss
K. Wilson
J. Hershey
70
27
0
23 Jun 2020
Visually Guided Sound Source Separation using Cascaded Opponent Filter
  Network
Visually Guided Sound Source Separation using Cascaded Opponent Filter Network
Lingyu Zhu
Esa Rahtu
103
23
0
04 Jun 2020
Neural Speaker Diarization with Speaker-Wise Chain Rule
Neural Speaker Diarization with Speaker-Wise Chain Rule
Yusuke Fujita
Shinji Watanabe
Shota Horiguchi
Yawen Xue
Jing Shi
Kenji Nagamatsu
100
45
0
02 Jun 2020
Efficient Integration of Multi-channel Information for
  Speaker-independent Speech Separation
Efficient Integration of Multi-channel Information for Speaker-independent Speech Separation
Yuichiro Koyama
Oluwafemi Azeez
Bhiksha Raj
44
4
0
23 May 2020
Previous
12345678
Next