Deep clustering: Discriminative embeddings for segmentation and separation

18 August 2015

Papers citing "Deep clustering: Discriminative embeddings for segmentation and separation"

50 / 357 papers shown

Title
End-to-End Multi-Look Keyword Spotting Meng Yu Xuan Ji Bo Wu Dan Su Dong Yu 44 19 0 20 May 2020
Jointly optimal denoising, dereverberation, and source separation Tomohiro Nakatani Christoph Boeddeker K. Kinoshita Rintaro Ikeshita Marc Delcroix Reinhold Haeb-Umbach 47 46 0 20 May 2020
Atss-Net: Target Speaker Separation via Attention-based Neural Network Tingle Li Qingjian Lin Yuanyuan Bao Ming Li 31 38 0 19 May 2020
Multimodal Target Speech Separation with Voice and Face References Leyuan Qu C. Weber S. Wermter CVBM 63 19 0 17 May 2020
Sparse Mixture of Local Experts for Efficient Speech Enhancement Aswin Sivaraman Minje Kim MoE 59 13 0 16 May 2020
Dual-Signal Transformation LSTM Network for Real-Time Noise Suppression Nils L. Westhausen B. Meyer 59 102 0 15 May 2020
FaceFilter: Audio-visual speech separation using still images Soo-Whan Chung Soyeon Choe Joon Son Chung Hong-Goo Kang CVBM 114 67 0 14 May 2020
Foreground-Background Ambient Sound Scene Separation Michel Olvera Emmanuel Vincent Romain Serizel Gilles Gasso 52 9 0 11 May 2020
SpEx+: A Complete Time Domain Speaker Extraction Network Meng Ge Chenglin Xu Longbiao Wang Chng Eng Siong Jianwu Dang Haizhou Li 79 149 0 10 May 2020
Asteroid: the PyTorch-based audio source separation toolkit for researchers Manuel Pariente Samuele Cornell Joris Cosentino S. Sivasankaran Efthymios Tzinis ... Juan M. Martín-Donas David Ditter Ariel Frank Antoine Deleforge Emmanuel Vincent 94 157 0 08 May 2020
Neural Spatio-Temporal Beamformer for Target Speech Separation Yong-mei Xu Meng Yu Shi-Xiong Zhang Lianwu Chen Chao Weng Jianming Liu Dong Yu 82 41 0 08 May 2020
Time-domain speaker extraction network Chenglin Xu Wei Rao Chng Eng Siong Haizhou Li 50 55 0 29 Apr 2020
Music Gesture for Visual Sound Separation Chuang Gan Deng Huang Hang Zhao J. Tenenbaum Antonio Torralba 97 205 0 20 Apr 2020
SpEx: Multi-Scale Time Domain Speaker Extraction Network Chenglin Xu Wei Rao Eng Siong Chng Haizhou Li 61 173 0 17 Apr 2020
Two-stage model and optimal SI-SNR for monaural multi-speaker speech separation in noisy environment Chao Ma Dongmei Li Xupeng Jia 34 5 0 14 Apr 2020
Simultaneous Denoising and Dereverberation Using Deep Embedding Features Cunhang Fan J. Tao B. Liu Jiangyan Yi Zhengqi Wen 25 2 0 06 Apr 2020
ProxyNCA++: Revisiting and Revitalizing Proxy Neighborhood Component Analysis Eu Wern Teh Terrance Devries Graham W. Taylor 83 159 0 02 Apr 2020
Separating Varying Numbers of Sources with Auxiliary Autoencoding Loss Yi Luo N. Mesgarani 81 29 0 27 Mar 2020
Deep Attention Fusion Feature for Speech Separation with End-to-End Post-filter Method Cunhang Fan J. Tao B. Liu Jiangyan Yi Zhengqi Wen Xuefei Liu 61 9 0 17 Mar 2020
Tackling real noisy reverberant meetings with all-neural source separation, counting, and diarization system K. Kinoshita Marc Delcroix S. Araki Tomohiro Nakatani 238 30 0 09 Mar 2020
Enhancing End-to-End Multi-channel Speech Separation via Spatial Feature Learning Rongzhi Gu Shi-Xiong Zhang Lianwu Chen Yong-mei Xu Meng Yu Dan Su Yuexian Zou Dong Yu 71 61 0 09 Mar 2020
Embedding Expansion: Augmentation in Embedding Space for Deep Metric Learning ByungSoo Ko Geonmo Gu 153 54 0 05 Mar 2020
Multi-Microphone Complex Spectral Mapping for Speech Dereverberation Zhong-Qiu Wang DeLiang Wang 56 63 0 04 Mar 2020
Voice Separation with an Unknown Number of Multiple Speakers Eliya Nachmani Yossi Adi Lior Wolf 101 175 0 29 Feb 2020
Wavesplit: End-to-End Speech Separation by Speaker Clustering Neil Zeghidour David Grangier VLM 112 265 0 20 Feb 2020
An empirical study of Conv-TasNet Berkan Kadıoğlu Michael Horgan Xiaoyu Liu Jordi Pons Dan Darcy Vivek Kumar 40 44 0 20 Feb 2020
Real-time binaural speech separation with preserved spatial cues Cong Han Yi Luo N. Mesgarani 81 42 0 16 Feb 2020
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention Yuma Koizumi Kohei Yatabe Marc Delcroix Yoshiki Masuyama Daiki Takeuchi 51 125 0 14 Feb 2020
Real-time speech enhancement using equilibriated RNN Daiki Takeuchi Kohei Yatabe Yuma Koizumi Yasuhiro Oikawa Noboru Harada 41 36 0 14 Feb 2020
On Cross-Corpus Generalization of Deep Learning Based Speech Enhancement Ashutosh Pandey DeLiang Wang 89 53 0 10 Feb 2020
End-to-End Multi-speaker Speech Recognition with Transformer Xuankai Chang Wangyou Zhang Y. Qian Jonathan Le Roux Shinji Watanabe ViT 96 106 0 10 Feb 2020
Spatial and spectral deep attention fusion for multi-channel speech separation using deep embedding features Cunhang Fan B. Liu J. Tao Jiangyan Yi Zhengqi Wen 46 11 0 05 Feb 2020
Continuous speech separation: dataset and analysis Zhuo Chen Takuya Yoshioka Liang Lu Tianyan Zhou Zhong Meng Yi Luo Jian Wu Xiong Xiao Jinyu Li 109 217 0 30 Jan 2020
Time-Domain Audio Source Separation Based on Wave-U-Net Combined with Discrete Wavelet Transform Tomohiko Nakamura Hiroshi Saruwatari AI4TS 31 18 0 28 Jan 2020
Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam Marc Delcroix Tsubasa Ochiai Kateřina Žmolíková K. Kinoshita Naohiro Tawara Tomohiro Nakatani S. Araki 129 124 0 23 Jan 2020
LaFurca: Iterative Refined Speech Separation Based on Context-Aware Dual-Path Parallel Bi-LSTM Ziqiang Shi Rujie Liu Jiqing Han 34 4 0 23 Jan 2020
Audio-visual Recognition of Overlapped speech for the LRS2 dataset Jianwei Yu Shi-Xiong Zhang Jian Wu Shahram Ghorbani Bo Wu Shiyin Kang Shansong Liu Xunying Liu Helen Meng Dong Yu 85 73 0 06 Jan 2020
Temporal-Spatial Neural Filter: Direction Informed End-to-End Multi-channel Target Speech Separation Rongzhi Gu Yuexian Zou 60 18 0 02 Jan 2020
Practical applicability of deep neural networks for overlapping speaker separation Pieter Appeltans Jeroen Zegers Hugo Van hamme 23 6 0 19 Dec 2019
CNN-LSTM models for Multi-Speaker Source Separation using Bayesian Hyper Parameter Optimization Jeroen Zegers Hugo Van hamme BDL 34 7 0 19 Dec 2019
Advances in Online Audio-Visual Meeting Transcription Takuya Yoshioka Igor Abramovski Cem Aksoylar Zhuo Chen Moshe David ... Huaming Wang Zhenghao Wang Jun Zhang Yong Zhao Tianyan Zhou 95 75 0 10 Dec 2019
Improving Voice Separation by Incorporating End-to-end Speech Recognition Naoya Takahashi M. Singh Sakya Basak Sudarsanam Parthasaarathy Sriram Ganapathy Yuki Mitsufuji VLM 43 19 0 29 Nov 2019
Region segmentation via deep learning and convex optimization Matthias Sonntag V. Morgenshtern 3DPC 20 1 0 28 Nov 2019
Demystifying TasNet: A Dissecting Approach Jens Heitkaemper Darius Jakobeit Christoph Boeddeker Lukas Drude Reinhold Haeb-Umbach 54 58 0 20 Nov 2019
Improving Universal Sound Separation Using Sound Classification Efthymios Tzinis Scott Wisdom J. Hershey A. Jansen D. Ellis VLM 79 73 0 18 Nov 2019
N-HANS: Introducing the Augsburg Neuro-Holistic Audio-eNhancement System Shuo Liu Gil Keren Björn Schuller 57 4 0 16 Nov 2019
Unsupervised Training for Deep Speech Source Separation with Kullback-Leibler Divergence Based Probabilistic Loss Function M. Togami Yoshiki Masuyama Tatsuya Komatsu Yumi Nakagome 57 25 0 11 Nov 2019
The Speed Submission to DIHARD II: Contributions & Lessons Learned Md. Sahidullah J. Patino Samuele Cornell Ruiqing Yin S. Sivasankaran ... Emmanuel Vincent Nicholas W. D. Evans S´ebastien Marcel S. Squartini C. Barras VLM 83 16 0 06 Nov 2019
Finding Strength in Weakness: Learning to Separate Sounds with Weak Supervision Fatemeh Pishdadian Gordon Wichern Jonathan Le Roux 72 43 0 06 Nov 2019
End-to-end Non-Negative Autoencoders for Sound Source Separation Shrikant Venkataramani Efthymios Tzinis Paris Smaragdis 80 5 0 31 Oct 2019