ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1508.04306
  4. Cited By
Deep clustering: Discriminative embeddings for segmentation and
  separation

Deep clustering: Discriminative embeddings for segmentation and separation

18 August 2015
J. Hershey
Zhuo Chen
Jonathan Le Roux
Shinji Watanabe
ArXiv (abs)PDFHTML

Papers citing "Deep clustering: Discriminative embeddings for segmentation and separation"

50 / 357 papers shown
Title
Learning to Separate Voices by Spatial Regions
Learning to Separate Voices by Spatial Regions
Alan Xu
Romit Roy Choudhury
106
10
0
09 Jul 2022
Multi-Modal Multi-Correlation Learning for Audio-Visual Speech
  Separation
Multi-Modal Multi-Correlation Learning for Audio-Visual Speech Separation
Xiaoyu Wang
Xiangyu Kong
Xiulian Peng
Yan Lu
62
6
0
04 Jul 2022
An Empirical Analysis on the Vulnerabilities of End-to-End Speech
  Segregation Models
An Empirical Analysis on the Vulnerabilities of End-to-End Speech Segregation Models
Rahil Parikh
G. Rochette
C. Espy-Wilson
S. Shamma
UQCV
35
0
0
20 Jun 2022
Resource-Efficient Separation Transformer
Resource-Efficient Separation Transformer
Luca Della Libera
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Frédéric Lepoutre
François Grondin
VLM
99
18
0
19 Jun 2022
Semi-supervised Time Domain Target Speaker Extraction with Attention
Semi-supervised Time Domain Target Speaker Extraction with Attention
Zhepei Wang
Ritwik Giri
Shrikant Venkataramani
Umut Isik
J. Valin
Paris Smaragdis
Mike Goodwin
A. Krishnaswamy
59
7
0
18 Jun 2022
On the Use of Deep Mask Estimation Module for Neural Source Separation
  Systems
On the Use of Deep Mask Estimation Module for Neural Source Separation Systems
Kai Li
Xiaolin Hu
Yi Luo
72
16
0
15 Jun 2022
Online Neural Diarization of Unlimited Numbers of Speakers Using Global
  and Local Attractors
Online Neural Diarization of Unlimited Numbers of Speakers Using Global and Local Attractors
Shota Horiguchi
Shinji Watanabe
Leibny Paola García-Perera
Yuki Takashima
Yohei Kawaguchi
98
24
0
06 Jun 2022
SepIt: Approaching a Single Channel Speech Separation Bound
SepIt: Approaching a Single Channel Speech Separation Bound
Shahar Lutati
Eliya Nachmani
Lior Wolf
VLM
131
27
0
24 May 2022
Speaker Reinforcement Using Target Source Extraction for Robust
  Automatic Speech Recognition
Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition
Catalin Zorila
R. Doddipatla
38
11
0
09 May 2022
Improving the Naturalness of Simulated Conversations for End-to-End
  Neural Diarization
Improving the Naturalness of Simulated Conversations for End-to-End Neural Diarization
Natsuo Yamashita
Shota Horiguchi
Takeshi Homma
74
18
0
24 Apr 2022
Heterogeneous Separation Consistency Training for Adaptation of
  Unsupervised Speech Separation
Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation
Jiangyu Han
Yanhua Long
65
6
0
23 Apr 2022
Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker
  Extraction
Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction
Zifeng Zhao
Rongzhi Gu
Dongchao Yang
Jinchuan Tian
Yuexian Zou
59
2
0
15 Apr 2022
RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation
  System
RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation System
M. Z. Ozturk
Chenshu Wu
Beibei Wang
Min Wu
K. Liu
62
21
0
14 Apr 2022
Small Footprint Multi-channel ConvMixer for Keyword Spotting with
  Centroid Based Awareness
Small Footprint Multi-channel ConvMixer for Keyword Spotting with Centroid Based Awareness
Dianwen Ng
Jing Pang
Yanghua Xiao
Biao Tian
Qiang Fu
Eng Siong Chng
50
2
0
11 Apr 2022
Listen only to me! How well can target speech extraction handle false
  alarms?
Listen only to me! How well can target speech extraction handle false alarms?
Marc Delcroix
K. Kinoshita
Tsubasa Ochiai
Kateřina Žmolíková
Hiroshi Sato
Tomohiro Nakatani
71
15
0
11 Apr 2022
Multichannel Speech Separation with Narrow-band Conformer
Multichannel Speech Separation with Narrow-band Conformer
Changsheng Quan
Xiaofei Li
69
13
0
09 Apr 2022
SoundBeam: Target sound extraction conditioned on sound-class labels and
  enrollment clues for increased performance and continuous learning
SoundBeam: Target sound extraction conditioned on sound-class labels and enrollment clues for increased performance and continuous learning
Marc Delcroix
Jorge Bennasar Vázquez
Tsubasa Ochiai
K. Kinoshita
Yasunori Ohishi
S. Araki
VLM
83
34
0
08 Apr 2022
Heterogeneous Target Speech Separation
Heterogeneous Target Speech Separation
Hyunjae Cho
Wonbin Jung
Junhyeok Lee
Paris Smaragdis
Sanghyun Woo
92
26
0
07 Apr 2022
End-to-end multi-talker audio-visual ASR using an active speaker
  attention module
End-to-end multi-talker audio-visual ASR using an active speaker attention module
R. Rose
Olivier Siohan
58
3
0
01 Apr 2022
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain
  Target Speaker Extraction
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction
Zexu Pan
Meng Ge
Haizhou Li
72
20
0
31 Mar 2022
Coarse-to-Fine Recursive Speech Separation for Unknown Number of
  Speakers
Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers
Zhenhao Jin
Xiang Hao
Xiangdong Su
55
4
0
30 Mar 2022
Learning a Structured Latent Space for Unsupervised Point Cloud
  Completion
Learning a Structured Latent Space for Unsupervised Point Cloud Completion
Yingjie Cai
Kwan-Yee Lin
Chao Zhang
Qiang Wang
Xiaogang Wang
Hongsheng Li
3DPCSSL
95
37
0
29 Mar 2022
Remix-cycle-consistent Learning on Adversarially Learned Separator for
  Accurate and Stable Unsupervised Speech Separation
Remix-cycle-consistent Learning on Adversarially Learned Separator for Accurate and Stable Unsupervised Speech Separation
Kohei Saijo
Tetsuji Ogawa
46
9
0
26 Mar 2022
Embedding Recurrent Layers with Dual-Path Strategy in a Variant of
  Convolutional Network for Speaker-Independent Speech Separation
Embedding Recurrent Layers with Dual-Path Strategy in a Variant of Convolutional Network for Speaker-Independent Speech Separation
Xue Yang
C. Bao
51
3
0
25 Mar 2022
Harmonicity Plays a Critical Role in DNN Based Versus in
  Biologically-Inspired Monaural Speech Segregation Systems
Harmonicity Plays a Critical Role in DNN Based Versus in Biologically-Inspired Monaural Speech Segregation Systems
Rahil Parikh
Ilya Kavalerov
C. Espy-Wilson
Shihab Shamma Institute for Systems Research
30
3
0
08 Mar 2022
L-SpEx: Localized Target Speaker Extraction
L-SpEx: Localized Target Speaker Extraction
Meng Ge
Chenglin Xu
Longbiao Wang
Eng Siong Chng
Jianwu Dang
Haizhou Li
52
24
0
21 Feb 2022
Multi-Channel Speech Denoising for Machine Ears
Multi-Channel Speech Denoising for Machine Ears
Cong Han
Emine Merve Kaya
Kyle Hoefer
M. Slaney
S. Carlile
27
2
0
17 Feb 2022
Time-Frequency Mask Aware Bi-directional LSTM: A Deep Learning Approach
  for Underwater Acoustic Signal Separation
Time-Frequency Mask Aware Bi-directional LSTM: A Deep Learning Approach for Underwater Acoustic Signal Separation
Jier Chen
Chang Liu
Jiawu Xie
Jie An
Nan Huang
16
10
0
09 Feb 2022
Exploring Self-Attention Mechanisms for Speech Separation
Exploring Self-Attention Mechanisms for Speech Separation
Cem Subakan
Mirco Ravanelli
Samuele Cornell
François Grondin
Mirko Bronzi
66
23
0
06 Feb 2022
SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech
  Separation
SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech Separation
Chenda Li
Lei Yang
Weiqin Wang
Y. Qian
83
27
0
26 Jan 2022
Learning to Enhance or Not: Neural Network-Based Switching of Enhanced
  and Observed Signals for Overlapping Speech Recognition
Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition
Hiroshi Sato
Tsubasa Ochiai
Marc Delcroix
K. Kinoshita
Naoyuki Kamo
Takafumi Moriya
70
27
0
11 Jan 2022
Signal-Aware Direction-of-Arrival Estimation Using Attention Mechanisms
Signal-Aware Direction-of-Arrival Estimation Using Attention Mechanisms
Wolfgang Mack
Julian Wechsler
Emanuel Habets
118
11
0
03 Jan 2022
Discretization and Re-synthesis: an alternative method to solve the
  Cocktail Party Problem
Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem
Jing Shi
Xuankai Chang
Tomoki Hayashi
Yen-Ju Lu
Shinji Watanabe
Bo Xu
105
19
0
17 Dec 2021
Directed Speech Separation for Automatic Speech Recognition of Long Form
  Conversational Speech
Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech
Rohit Paturi
S. Srinivasan
Katrin Kirchhoff
Daniel Garcia-Romero
67
9
0
10 Dec 2021
Learning-based personal speech enhancement for teleconferencing by
  exploiting spatial-spectral features
Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features
Yicheng Hsu
Yonghan Lee
M. Bai
52
10
0
10 Dec 2021
Speech Separation Using an Asynchronous Fully Recurrent Convolutional
  Neural Network
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network
Xiaolin Hu
Kai Li
Weiyi Zhang
Yi Luo
Jean-Marie Lemercier
Timo Gerkmann
91
51
0
04 Dec 2021
Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature
Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature
Yiwen Shao
Shi-Xiong Zhang
Dong Yu
88
15
0
22 Nov 2021
Single-channel speech separation using Soft-minimum Permutation
  Invariant Training
Single-channel speech separation using Soft-minimum Permutation Invariant Training
Midia Yousefi
John H. L. Hansen
27
3
0
16 Nov 2021
Monaural source separation: From anechoic to reverberant environments
Monaural source separation: From anechoic to reverberant environments
Tobias Cord-Landwehr
Christoph Boeddeker
Thilo von Neumann
Catalin Zorila
R. Doddipatla
Reinhold Haeb-Umbach
55
31
0
15 Nov 2021
Reduction of Subjective Listening Effort for TV Broadcast Signals with
  Recurrent Neural Networks
Reduction of Subjective Listening Effort for TV Broadcast Signals with Recurrent Neural Networks
Nils L. Westhausen
R. Huber
Hannah Baumgartner
Ragini Sinha
J. Rennies
B. Meyer
56
10
0
02 Nov 2021
Recent Advances in End-to-End Automatic Speech Recognition
Recent Advances in End-to-End Automatic Speech Recognition
Jinyu Li
VLM
160
377
0
02 Nov 2021
Revisiting joint decoding based multi-talker speech recognition with DNN
  acoustic model
Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model
M. Kocour
Kateřina Žmolíková
Lucas Ondel
J. Svec
Marc Delcroix
Tsubasa Ochiai
L. Burget
J. Černocký
29
1
0
31 Oct 2021
SA-SDR: A novel loss function for separation of meeting style data
SA-SDR: A novel loss function for separation of meeting style data
Thilo von Neumann
K. Kinoshita
Christoph Boeddeker
Marc Delcroix
Reinhold Haeb-Umbach
74
21
0
29 Oct 2021
Continuous Speech Separation with Recurrent Selective Attention Network
Continuous Speech Separation with Recurrent Selective Attention Network
Yixuan Zhang
Zhuo Chen
Jian Wu
Takuya Yoshioka
Peidong Wang
Zhong Meng
Jinyu Li
BDL
72
8
0
28 Oct 2021
Separating Long-Form Speech with Group-Wise Permutation Invariant
  Training
Separating Long-Form Speech with Group-Wise Permutation Invariant Training
Wangyou Zhang
Zhuo Chen
Naoyuki Kanda
Shujie Liu
Jinyu Li
...
Takuya Yoshioka
Xiong Xiao
Zhong Meng
Y. Qian
Furu Wei
VLM
64
6
0
27 Oct 2021
REAL-M: Towards Speech Separation on Real Mixtures
REAL-M: Towards Speech Separation on Real Mixtures
Cem Subakan
Mirco Ravanelli
Samuele Cornell
François Grondin
59
18
0
20 Oct 2021
Adapting Speech Separation to Real-World Meetings Using Mixture
  Invariant Training
Adapting Speech Separation to Real-World Meetings Using Mixture Invariant Training
Aswin Sivaraman
Scott Wisdom
Hakan Erdogan
J. Hershey
49
22
0
20 Oct 2021
The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World
  Soundtracks
The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks
Darius Petermann
Gordon Wichern
Zhong-Qiu Wang
Jonathan Le Roux
62
38
0
19 Oct 2021
Personalized Speech Enhancement: New Models and Comprehensive Evaluation
Personalized Speech Enhancement: New Models and Comprehensive Evaluation
Sefik Emre Eskimez
Takuya Yoshioka
Huaming Wang
Xiaofei Wang
Zhuo Chen
Xuedong Huang
87
62
0
18 Oct 2021
M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription
  Challenge
M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge
Fan Yu
Shiliang Zhang
Yihui Fu
Lei Xie
Siqi Zheng
...
Pengcheng Guo
Zhijie Yan
B. Ma
Xin Xu
Hui Bu
73
119
0
14 Oct 2021
Previous
12345678
Next