ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1508.04306
  4. Cited By
Deep clustering: Discriminative embeddings for segmentation and
  separation

Deep clustering: Discriminative embeddings for segmentation and separation

18 August 2015
J. Hershey
Zhuo Chen
Jonathan Le Roux
Shinji Watanabe
ArXiv (abs)PDFHTML

Papers citing "Deep clustering: Discriminative embeddings for segmentation and separation"

50 / 357 papers shown
Title
End-to-End Multi-Look Keyword Spotting
End-to-End Multi-Look Keyword Spotting
Meng Yu
Xuan Ji
Bo Wu
Dan Su
Dong Yu
44
19
0
20 May 2020
Jointly optimal denoising, dereverberation, and source separation
Jointly optimal denoising, dereverberation, and source separation
Tomohiro Nakatani
Christoph Boeddeker
K. Kinoshita
Rintaro Ikeshita
Marc Delcroix
Reinhold Haeb-Umbach
47
46
0
20 May 2020
Atss-Net: Target Speaker Separation via Attention-based Neural Network
Atss-Net: Target Speaker Separation via Attention-based Neural Network
Tingle Li
Qingjian Lin
Yuanyuan Bao
Ming Li
31
38
0
19 May 2020
Multimodal Target Speech Separation with Voice and Face References
Multimodal Target Speech Separation with Voice and Face References
Leyuan Qu
C. Weber
S. Wermter
CVBM
63
19
0
17 May 2020
Sparse Mixture of Local Experts for Efficient Speech Enhancement
Sparse Mixture of Local Experts for Efficient Speech Enhancement
Aswin Sivaraman
Minje Kim
MoE
59
13
0
16 May 2020
Dual-Signal Transformation LSTM Network for Real-Time Noise Suppression
Dual-Signal Transformation LSTM Network for Real-Time Noise Suppression
Nils L. Westhausen
B. Meyer
59
102
0
15 May 2020
FaceFilter: Audio-visual speech separation using still images
FaceFilter: Audio-visual speech separation using still images
Soo-Whan Chung
Soyeon Choe
Joon Son Chung
Hong-Goo Kang
CVBM
114
67
0
14 May 2020
Foreground-Background Ambient Sound Scene Separation
Foreground-Background Ambient Sound Scene Separation
Michel Olvera
Emmanuel Vincent
Romain Serizel
Gilles Gasso
52
9
0
11 May 2020
SpEx+: A Complete Time Domain Speaker Extraction Network
SpEx+: A Complete Time Domain Speaker Extraction Network
Meng Ge
Chenglin Xu
Longbiao Wang
Chng Eng Siong
Jianwu Dang
Haizhou Li
79
149
0
10 May 2020
Asteroid: the PyTorch-based audio source separation toolkit for
  researchers
Asteroid: the PyTorch-based audio source separation toolkit for researchers
Manuel Pariente
Samuele Cornell
Joris Cosentino
S. Sivasankaran
Efthymios Tzinis
...
Juan M. Martín-Donas
David Ditter
Ariel Frank
Antoine Deleforge
Emmanuel Vincent
94
157
0
08 May 2020
Neural Spatio-Temporal Beamformer for Target Speech Separation
Neural Spatio-Temporal Beamformer for Target Speech Separation
Yong-mei Xu
Meng Yu
Shi-Xiong Zhang
Lianwu Chen
Chao Weng
Jianming Liu
Dong Yu
82
41
0
08 May 2020
Time-domain speaker extraction network
Time-domain speaker extraction network
Chenglin Xu
Wei Rao
Chng Eng Siong
Haizhou Li
50
55
0
29 Apr 2020
Music Gesture for Visual Sound Separation
Music Gesture for Visual Sound Separation
Chuang Gan
Deng Huang
Hang Zhao
J. Tenenbaum
Antonio Torralba
97
205
0
20 Apr 2020
SpEx: Multi-Scale Time Domain Speaker Extraction Network
SpEx: Multi-Scale Time Domain Speaker Extraction Network
Chenglin Xu
Wei Rao
Eng Siong Chng
Haizhou Li
61
173
0
17 Apr 2020
Two-stage model and optimal SI-SNR for monaural multi-speaker speech separation in noisy environment
Chao Ma
Dongmei Li
Xupeng Jia
34
5
0
14 Apr 2020
Simultaneous Denoising and Dereverberation Using Deep Embedding Features
Simultaneous Denoising and Dereverberation Using Deep Embedding Features
Cunhang Fan
J. Tao
B. Liu
Jiangyan Yi
Zhengqi Wen
25
2
0
06 Apr 2020
ProxyNCA++: Revisiting and Revitalizing Proxy Neighborhood Component
  Analysis
ProxyNCA++: Revisiting and Revitalizing Proxy Neighborhood Component Analysis
Eu Wern Teh
Terrance Devries
Graham W. Taylor
83
159
0
02 Apr 2020
Separating Varying Numbers of Sources with Auxiliary Autoencoding Loss
Separating Varying Numbers of Sources with Auxiliary Autoencoding Loss
Yi Luo
N. Mesgarani
81
29
0
27 Mar 2020
Deep Attention Fusion Feature for Speech Separation with End-to-End
  Post-filter Method
Deep Attention Fusion Feature for Speech Separation with End-to-End Post-filter Method
Cunhang Fan
J. Tao
B. Liu
Jiangyan Yi
Zhengqi Wen
Xuefei Liu
61
9
0
17 Mar 2020
Tackling real noisy reverberant meetings with all-neural source
  separation, counting, and diarization system
Tackling real noisy reverberant meetings with all-neural source separation, counting, and diarization system
K. Kinoshita
Marc Delcroix
S. Araki
Tomohiro Nakatani
238
30
0
09 Mar 2020
Enhancing End-to-End Multi-channel Speech Separation via Spatial Feature
  Learning
Enhancing End-to-End Multi-channel Speech Separation via Spatial Feature Learning
Rongzhi Gu
Shi-Xiong Zhang
Lianwu Chen
Yong-mei Xu
Meng Yu
Dan Su
Yuexian Zou
Dong Yu
71
61
0
09 Mar 2020
Embedding Expansion: Augmentation in Embedding Space for Deep Metric
  Learning
Embedding Expansion: Augmentation in Embedding Space for Deep Metric Learning
ByungSoo Ko
Geonmo Gu
153
54
0
05 Mar 2020
Multi-Microphone Complex Spectral Mapping for Speech Dereverberation
Multi-Microphone Complex Spectral Mapping for Speech Dereverberation
Zhong-Qiu Wang
DeLiang Wang
56
63
0
04 Mar 2020
Voice Separation with an Unknown Number of Multiple Speakers
Voice Separation with an Unknown Number of Multiple Speakers
Eliya Nachmani
Yossi Adi
Lior Wolf
101
175
0
29 Feb 2020
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Neil Zeghidour
David Grangier
VLM
112
265
0
20 Feb 2020
An empirical study of Conv-TasNet
An empirical study of Conv-TasNet
Berkan Kadıoğlu
Michael Horgan
Xiaoyu Liu
Jordi Pons
Dan Darcy
Vivek Kumar
40
44
0
20 Feb 2020
Real-time binaural speech separation with preserved spatial cues
Real-time binaural speech separation with preserved spatial cues
Cong Han
Yi Luo
N. Mesgarani
81
42
0
16 Feb 2020
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention
Yuma Koizumi
Kohei Yatabe
Marc Delcroix
Yoshiki Masuyama
Daiki Takeuchi
51
125
0
14 Feb 2020
Real-time speech enhancement using equilibriated RNN
Real-time speech enhancement using equilibriated RNN
Daiki Takeuchi
Kohei Yatabe
Yuma Koizumi
Yasuhiro Oikawa
Noboru Harada
41
36
0
14 Feb 2020
On Cross-Corpus Generalization of Deep Learning Based Speech Enhancement
On Cross-Corpus Generalization of Deep Learning Based Speech Enhancement
Ashutosh Pandey
DeLiang Wang
89
53
0
10 Feb 2020
End-to-End Multi-speaker Speech Recognition with Transformer
End-to-End Multi-speaker Speech Recognition with Transformer
Xuankai Chang
Wangyou Zhang
Y. Qian
Jonathan Le Roux
Shinji Watanabe
ViT
96
106
0
10 Feb 2020
Spatial and spectral deep attention fusion for multi-channel speech
  separation using deep embedding features
Spatial and spectral deep attention fusion for multi-channel speech separation using deep embedding features
Cunhang Fan
B. Liu
J. Tao
Jiangyan Yi
Zhengqi Wen
46
11
0
05 Feb 2020
Continuous speech separation: dataset and analysis
Continuous speech separation: dataset and analysis
Zhuo Chen
Takuya Yoshioka
Liang Lu
Tianyan Zhou
Zhong Meng
Yi Luo
Jian Wu
Xiong Xiao
Jinyu Li
109
217
0
30 Jan 2020
Time-Domain Audio Source Separation Based on Wave-U-Net Combined with
  Discrete Wavelet Transform
Time-Domain Audio Source Separation Based on Wave-U-Net Combined with Discrete Wavelet Transform
Tomohiko Nakamura
Hiroshi Saruwatari
AI4TS
31
18
0
28 Jan 2020
Improving speaker discrimination of target speech extraction with
  time-domain SpeakerBeam
Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam
Marc Delcroix
Tsubasa Ochiai
Kateřina Žmolíková
K. Kinoshita
Naohiro Tawara
Tomohiro Nakatani
S. Araki
129
124
0
23 Jan 2020
LaFurca: Iterative Refined Speech Separation Based on Context-Aware
  Dual-Path Parallel Bi-LSTM
LaFurca: Iterative Refined Speech Separation Based on Context-Aware Dual-Path Parallel Bi-LSTM
Ziqiang Shi
Rujie Liu
Jiqing Han
34
4
0
23 Jan 2020
Audio-visual Recognition of Overlapped speech for the LRS2 dataset
Audio-visual Recognition of Overlapped speech for the LRS2 dataset
Jianwei Yu
Shi-Xiong Zhang
Jian Wu
Shahram Ghorbani
Bo Wu
Shiyin Kang
Shansong Liu
Xunying Liu
Helen Meng
Dong Yu
85
73
0
06 Jan 2020
Temporal-Spatial Neural Filter: Direction Informed End-to-End
  Multi-channel Target Speech Separation
Temporal-Spatial Neural Filter: Direction Informed End-to-End Multi-channel Target Speech Separation
Rongzhi Gu
Yuexian Zou
60
18
0
02 Jan 2020
Practical applicability of deep neural networks for overlapping speaker
  separation
Practical applicability of deep neural networks for overlapping speaker separation
Pieter Appeltans
Jeroen Zegers
Hugo Van hamme
23
6
0
19 Dec 2019
CNN-LSTM models for Multi-Speaker Source Separation using Bayesian Hyper
  Parameter Optimization
CNN-LSTM models for Multi-Speaker Source Separation using Bayesian Hyper Parameter Optimization
Jeroen Zegers
Hugo Van hamme
BDL
34
7
0
19 Dec 2019
Advances in Online Audio-Visual Meeting Transcription
Advances in Online Audio-Visual Meeting Transcription
Takuya Yoshioka
Igor Abramovski
Cem Aksoylar
Zhuo Chen
Moshe David
...
Huaming Wang
Zhenghao Wang
Jun Zhang
Yong Zhao
Tianyan Zhou
95
75
0
10 Dec 2019
Improving Voice Separation by Incorporating End-to-end Speech
  Recognition
Improving Voice Separation by Incorporating End-to-end Speech Recognition
Naoya Takahashi
M. Singh
Sakya Basak
Sudarsanam Parthasaarathy
Sriram Ganapathy
Yuki Mitsufuji
VLM
43
19
0
29 Nov 2019
Region segmentation via deep learning and convex optimization
Region segmentation via deep learning and convex optimization
Matthias Sonntag
V. Morgenshtern
3DPC
20
1
0
28 Nov 2019
Demystifying TasNet: A Dissecting Approach
Demystifying TasNet: A Dissecting Approach
Jens Heitkaemper
Darius Jakobeit
Christoph Boeddeker
Lukas Drude
Reinhold Haeb-Umbach
54
58
0
20 Nov 2019
Improving Universal Sound Separation Using Sound Classification
Improving Universal Sound Separation Using Sound Classification
Efthymios Tzinis
Scott Wisdom
J. Hershey
A. Jansen
D. Ellis
VLM
79
73
0
18 Nov 2019
N-HANS: Introducing the Augsburg Neuro-Holistic Audio-eNhancement System
N-HANS: Introducing the Augsburg Neuro-Holistic Audio-eNhancement System
Shuo Liu
Gil Keren
Björn Schuller
57
4
0
16 Nov 2019
Unsupervised Training for Deep Speech Source Separation with
  Kullback-Leibler Divergence Based Probabilistic Loss Function
Unsupervised Training for Deep Speech Source Separation with Kullback-Leibler Divergence Based Probabilistic Loss Function
M. Togami
Yoshiki Masuyama
Tatsuya Komatsu
Yumi Nakagome
57
25
0
11 Nov 2019
The Speed Submission to DIHARD II: Contributions & Lessons Learned
The Speed Submission to DIHARD II: Contributions & Lessons Learned
Md. Sahidullah
J. Patino
Samuele Cornell
Ruiqing Yin
S. Sivasankaran
...
Emmanuel Vincent
Nicholas W. D. Evans
S´ebastien Marcel
S. Squartini
C. Barras
VLM
83
16
0
06 Nov 2019
Finding Strength in Weakness: Learning to Separate Sounds with Weak
  Supervision
Finding Strength in Weakness: Learning to Separate Sounds with Weak Supervision
Fatemeh Pishdadian
Gordon Wichern
Jonathan Le Roux
72
43
0
06 Nov 2019
End-to-end Non-Negative Autoencoders for Sound Source Separation
End-to-end Non-Negative Autoencoders for Sound Source Separation
Shrikant Venkataramani
Efthymios Tzinis
Paris Smaragdis
80
5
0
31 Oct 2019
Previous
12345678
Next