Deep clustering: Discriminative embeddings for segmentation and separation

18 August 2015

Papers citing "Deep clustering: Discriminative embeddings for segmentation and separation"

50 / 357 papers shown

Title
SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition Lukas Drude Jens Heitkaemper Christoph Boeddeker Reinhold Haeb-Umbach 66 72 0 30 Oct 2019
Mixup-breakdown: a consistency training method for improving generalization of speech separation models Max W. Y. Lam Jun Wang Dan Su Dong Yu 90 23 0 28 Oct 2019
A Multi-Phase Gammatone Filterbank for Speech Separation via TasNet David Ditter Timo Gerkmann 63 58 0 25 Oct 2019
Filterbank design for end-to-end speech separation Manuel Pariente Samuele Cornell Antoine Deleforge Emmanuel Vincent 106 69 0 23 Oct 2019
WHAMR!: Noisy and Reverberant Single-Channel Speech Separation Matthew Maciejewski Gordon Wichern E. McQuinn Jonathan Le Roux 89 184 0 22 Oct 2019
Two-Step Sound Source Separation: Training on Learned Latent Targets Efthymios Tzinis Shrikant Venkataramani Zhepei Wang Y. C. Sübakan Paris Smaragdis 73 65 0 22 Oct 2019
Simultaneous Separation and Transcription of Mixtures with Multiple Polyphonic and Percussive Instruments Ethan Manilow Prem Seetharaman Bryan Pardo 72 42 0 22 Oct 2019
Discriminative Neural Clustering for Speaker Diarisation Qiujia Li Florian Kreyssig Chao Zhang P. Woodland 61 46 0 22 Oct 2019
Unsupervised Multi-Task Feature Learning on Point Clouds Kaveh Hassani Mike Haley SSL 3DPC 170 195 0 18 Oct 2019
MIMO-SPEECH: End-to-End Multi-Channel Multi-Speaker Speech Recognition Xuankai Chang Wangyou Zhang Y. Qian Jonathan Le Roux Shinji Watanabe 95 121 0 15 Oct 2019
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation Yi Luo Zhuo Chen Takuya Yoshioka AI4TS 127 776 0 14 Oct 2019
CochleaNet: A Robust Language-independent Audio-Visual Model for Speech Enhancement M. Gogate K. Dashtipour Ahsan Adeel Amir Hussain 50 53 0 23 Sep 2019
Cutting Music Source Separation Some Slakh: A Dataset to Study the Impact of Training Data Quality and Quantity Ethan Manilow Gordon Wichern Prem Seetharaman Jonathan Le Roux 77 127 0 18 Sep 2019
Simultaneous Speech Recognition and Speaker Diarization for Monaural Dialogue Recordings with Target-Speaker Acoustic Models Naoyuki Kanda Shota Horiguchi Yusuke Fujita Yawen Xue Kenji Nagamatsu Shinji Watanabe 53 36 0 17 Sep 2019
End-to-End Neural Speaker Diarization with Self-attention Yusuke Fujita Naoyuki Kanda Shota Horiguchi Yawen Xue Kenji Nagamatsu Shinji Watanabe 240 243 0 13 Sep 2019
End-to-End Neural Speaker Diarization with Permutation-Free Objectives Yusuke Fujita Naoyuki Kanda Shota Horiguchi Kenji Nagamatsu Shinji Watanabe 200 255 0 12 Sep 2019
Deep Metric Learning with Density Adaptivity Yehao Li Ting Yao Yingwei Pan Hongyang Chao Tao Mei 144 11 0 09 Sep 2019
Deep Bayesian Unsupervised Source Separation Based on a Complex Gaussian Mixture Model Yoshiaki Bando Y. Sasaki Kazuyoshi Yoshii BDL 52 9 0 29 Aug 2019
Nearest Neighbor Search-Based Bitwise Source Separation Using Discriminant Winner-Take-All Hashing Sunwoo Kim Minje Kim 18 0 0 26 Aug 2019
Audio query-based music source separation Jie Hwan Lee Hyeong-Seok Choi Kyogu Lee 71 45 0 19 Aug 2019
Probabilistic Permutation Invariant Training for Speech Separation Midia Yousefi S. Khorram John H. L. Hansen 47 23 0 04 Aug 2019
Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features Cunhang Fan B. Liu J. Tao Jiangyan Yi Zhengqi Wen 51 22 0 23 Jul 2019
DNN-based Speaker Embedding Using Subjective Inter-speaker Similarity for Multi-speaker Modeling in Speech Synthesis Yuki Saito Shinnosuke Takamichi Hiroshi Saruwatari 32 10 0 19 Jul 2019
Multichannel Loss Function for Supervised Speech Source Separation by Mask-based Beamforming Yoshiki Masuyama M. Togami Tatsuya Komatsu 38 8 0 11 Jul 2019
My lips are concealed: Audio-visual speech enhancement through obstructions Triantafyllos Afouras Joon Son Chung Andrew Zisserman 65 91 0 11 Jul 2019
Improving Reverberant Speech Training Using Diffuse Acoustic Simulation Zhenyu Tang Lianwu Chen Bo Wu Dong Yu Tianyi Zhou AI4CE 75 35 0 09 Jul 2019
WHAM!: Extending Speech Separation to Noisy Environments Gordon Wichern J. Antognini Michael Flynn Licheng Richard Zhu E. McQuinn Dwight Crow Ethan Manilow Jonathan Le Roux 86 355 0 02 Jul 2019
Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition Naoyuki Kanda Shota Horiguchi R. Takashima Yusuke Fujita Kenji Nagamatsu Shinji Watanabe 63 34 0 26 Jun 2019
Single-Channel Speech Separation with Auxiliary Speaker Embeddings Shuo Liu Gil Keren Björn Schuller 33 3 0 24 Jun 2019
From Clustering to Cluster Explanations via Neural Networks Jacob R. Kauffmann Malte Esders Lukas Ruff G. Montavon Wojciech Samek K. Müller 79 72 0 18 Jun 2019
Divide and Conquer the Embedding Space for Metric Learning A. Sanakoyeu Vadim Tschernezki U. Büchler Bjorn Ommer SSL 86 107 0 14 Jun 2019
A comprehensive study of speech separation: spectrogram vs waveform separation F. Bahmaninezhad Jian Wu Rongzhi Gu Shi-Xiong Zhang Yong-mei Xu Meng Yu Dong Yu 81 81 0 17 May 2019
End-to-End Multi-Channel Speech Separation Rongzhi Gu Jian Wu Shi-Xiong Zhang Lianwu Chen Yong-mei Xu Meng Yu Dan Su Yuexian Zou Dong Yu 56 77 0 15 May 2019
Machine learning in acoustics: theory and applications Michael J. Bianco Peter Gerstoft James Traer Emma Ozanich M. Roch Sharon Gannot Charles-Alban Deledalle AI4CE 89 391 0 11 May 2019
Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech T. Menne Ilya Sklyar Ralf Schluter Hermann Ney 158 35 0 09 May 2019
Universal Sound Separation Ilya Kavalerov Scott Wisdom Hakan Erdogan Brian Patton K. Wilson Jonathan Le Roux J. Hershey 86 187 0 08 May 2019
A Statistically Principled and Computationally Efficient Approach to Speech Enhancement using Variational Autoencoders Manuel Pariente Antoine Deleforge Emmanuel Vincent 62 21 0 03 May 2019
A Style Transfer Approach to Source Separation Shrikant Venkataramani Efthymios Tzinis Paris Smaragdis OOD DRL 44 6 0 01 May 2019
Divide and Conquer: A Deep CASA Approach to Talker-independent Monaural Speaker Separation Yuzhou Liu DeLiang Wang 91 158 0 25 Apr 2019
Self-Supervised Audio-Visual Co-Segmentation Andrew Rouditchenko Hang Zhao Chuang Gan Josh H. McDermott Antonio Torralba VLM SSL 62 105 0 18 Apr 2019
Deep Filtering: Signal Extraction and Reconstruction Using Complex Time-Frequency Filters Wolfgang Mack Emanuel Habets 66 86 0 17 Apr 2019
Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering Gene-Ping Yang Chao-I Tuan Hung-yi Lee Lin-Shan Lee 61 25 0 16 Apr 2019
Co-Separating Sounds of Visual Objects Ruohan Gao Kristen Grauman 140 210 0 16 Apr 2019
The Sound of Motions Hang Zhao Chuang Gan Wei-Chiu Ma Antonio Torralba 88 254 0 11 Apr 2019
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation Fadi Biadsy Ron J. Weiss Pedro J. Moreno D. Kanvesky Ye Jia 88 115 0 08 Apr 2019
Time Domain Audio Visual Speech Separation Jian Wu Yong-mei Xu Shi-Xiong Zhang Lianwu Chen Meng Yu Lei Xie Dong Yu 107 118 0 07 Apr 2019
Unsupervised Image Matching and Object Discovery as Optimization Huy V. Vo Francis R. Bach Minsu Cho Kai Han Yann LeCun P. Pérez Jean Ponce OCL 113 66 0 05 Apr 2019
Recursive speech separation for unknown number of speakers Naoya Takahashi Sudarsanam Parthasaarathy Nabarun Goswami Yuki Mitsufuji 58 81 0 05 Apr 2019
Point Cloud Oversegmentation with Graph-Structured Deep Metric Learning Loic Landrieu Mohamed Boussaha 3DPC 75 152 0 03 Apr 2019
Unsupervised training of a deep clustering model for multichannel blind source separation Lukas Drude Daniel Hasenklever Reinhold Häb-Umbach SSL 69 58 0 02 Apr 2019