Continuous speech separation: dataset and analysis

30 January 2020

Jian Wu

Papers citing "Continuous speech separation: dataset and analysis"

50 / 128 papers shown

Title
Speaker Change Detection for Transformer Transducer ASR Jian Wu Zhuo Chen Min Hu Xiong Xiao Jinyu Li 20 4 0 16 Feb 2023
Multi-resolution location-based training for multi-channel continuous speech separation H. Taherian DeLiang Wang 38 7 0 16 Jan 2023
GPU-accelerated Guided Source Separation for Meeting Transcription Desh Raj Daniel Povey Sanjeev Khudanpur 26 35 0 10 Dec 2022
On Word Error Rate Definitions and their Efficient Computation for Multi-Speaker Speech Recognition Systems Thilo von Neumann Christoph Boeddeker K. Kinoshita Marc Delcroix Reinhold Haeb-Umbach 37 19 0 29 Nov 2022
Self-Remixing: Unsupervised Speech Separation via Separation and Remixing Kohei Saijo Tetsuji Ogawa SSL 22 11 0 18 Nov 2022
Reverberation as Supervision for Speech Separation R. Aralikatti Christoph Boeddeker Gordon Wichern Aswin Shanmugam Subramanian Jonathan Le Roux 24 7 0 15 Nov 2022
MedleyVox: An Evaluation Dataset for Multiple Singing Voices Separation Chang-Bin Jeon Hyeongi Moon Keunwoo Choi Ben Sangbae Chon Kyogu Lee 20 5 0 14 Nov 2022
Simulating realistic speech overlaps improves multi-talker ASR Muqiao Yang Naoyuki Kanda Xiaofei Wang Jian Wu S. Sivasankaran Zhuo Chen Jinyu Li Takuya Yoshioka 23 13 0 27 Oct 2022
MFCCA:Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario Fan Yu Shiliang Zhang Pengcheng Guo Yuhao Liang Zhihao Du Yuxiao Lin Linfu Xie 33 11 0 11 Oct 2022
Mutual Learning of Single- and Multi-Channel End-to-End Neural Diarization Shota Horiguchi Yuki Takashima Shinji Watanabe Leibny Paola García-Perera 36 2 0 07 Oct 2022
VarArray Meets t-SOT: Advancing the State of the Art of Streaming Distant Conversational Speech Recognition Naoyuki Kanda Jian Wu Xiaofei Wang Zhuo Chen Jinyu Li Takuya Yoshioka 29 16 0 12 Sep 2022
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding Yen-Ju Lu Xuankai Chang Chenda Li Wangyou Zhang Samuele Cornell ... Robin Scheibler Zhong-Qiu Wang Yu Tsao Y. Qian Shinji Watanabe VLM 24 28 0 19 Jul 2022
Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech Ilya Sklyar A. Piunova Christian Osendorfer 11 6 0 10 May 2022
A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network Tobias Gburrek Christoph Boeddeker Thilo von Neumann Tobias Cord-Landwehr Joerg Schmalenstroeer Reinhold Haeb-Umbach 11 5 0 02 May 2022
Ultra Fast Speech Separation Model with Teacher Student Learning Sanyuan Chen Yu-Huan Wu Zhuo Chen Jian Wu Takuya Yoshioka Shujie Liu Jinyu Li Xiangzhan Yu 25 14 0 27 Apr 2022
Improving the Naturalness of Simulated Conversations for End-to-End Neural Diarization Natsuo Yamashita Shota Horiguchi Takeshi Homma 26 16 0 24 Apr 2022
Leveraging Real Conversational Data for Multi-Channel Continuous Speech Separation Xiaofei Wang Dongmei Wang Naoyuki Kanda Sefik Emre Eskimez Takuya Yoshioka 25 8 0 07 Apr 2022
An Initialization Scheme for Meeting Separation with Spatial Mixture Models Christoph Boeddeker Tobias Cord-Landwehr Thilo von Neumann Reinhold Haeb-Umbach 30 10 0 04 Apr 2022
End-to-end multi-talker audio-visual ASR using an active speaker attention module R. Rose Olivier Siohan 20 3 0 01 Apr 2022
EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers Soumi Maiti Yushi Ueda Shinji Watanabe Chunlei Zhang Meng Yu Shi-Xiong Zhang Yong-mei Xu 42 33 0 31 Mar 2022
A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings Fan Yu Zhihao Du Shiliang Zhang Yuxiao Lin Linfu Xie 22 13 0 31 Mar 2022
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings Naoyuki Kanda Jian Wu Yu Wu Xiong Xiao Zhong Meng Xiaofei Wang Yashesh Gaur Zhuo Chen Jinyu Li Takuya Yoshioka 24 26 0 30 Mar 2022
Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks Fan Wang Hung-Shin Lee Yu Tsao Hsin-Min Wang 29 4 0 30 Mar 2022
Extended Graph Temporal Classification for Multi-Speaker End-to-End ASR Xuankai Chang Niko Moritz Takaaki Hori Shinji Watanabe Jonathan Le Roux 24 6 0 01 Mar 2022
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge Fan Yu Shiliang Zhang Pengcheng Guo Yihui Fu Zhihao Du ... Kong Aik Lee Zhijie Yan B. Ma Xin Xu Hui Bu 18 28 0 08 Feb 2022
Exploring Self-Attention Mechanisms for Speech Separation Cem Subakan Mirco Ravanelli Samuele Cornell François Grondin Mirko Bronzi 40 23 0 06 Feb 2022
Streaming Multi-Talker ASR with Token-Level Serialized Output Training Naoyuki Kanda Jian Wu Yu Wu Xiong Xiao Zhong Meng Xiaofei Wang Yashesh Gaur Zhuo Chen Jinyu Li Takuya Yoshioka 36 54 0 02 Feb 2022
SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech Separation Chenda Li Lei Yang Weiqin Wang Y. Qian 37 25 0 26 Jan 2022
Endpoint Detection for Streaming End-to-End Multi-talker ASR Liang Lu Jinyu Li Yifan Gong 17 17 0 24 Jan 2022
Multi-turn RNN-T for streaming recognition of multi-party speech Ilya Sklyar A. Piunova Xianrui Zheng Yulan Liu 24 22 0 19 Dec 2021
Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech Rohit Paturi S. Srinivasan Katrin Kirchhoff Daniel Garcia-Romero 22 9 0 10 Dec 2021
SA-SDR: A novel loss function for separation of meeting style data Thilo von Neumann K. Kinoshita Christoph Boeddeker Marc Delcroix Reinhold Haeb-Umbach 29 20 0 29 Oct 2021
Continuous Speech Separation with Recurrent Selective Attention Network Yixuan Zhang Zhuo Chen Jian Wu Takuya Yoshioka Peidong Wang Zhong Meng Jinyu Li BDL 27 7 0 28 Oct 2021
Separating Long-Form Speech with Group-Wise Permutation Invariant Training Wangyou Zhang Zhuo Chen Naoyuki Kanda Shujie Liu Jinyu Li ... Takuya Yoshioka Xiong Xiao Zhong Meng Y. Qian Furu Wei VLM 19 6 0 27 Oct 2021
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing Sanyuan Chen Chengyi Wang Zhengyang Chen Yu-Huan Wu Shujie Liu ... Yao Qian Jian Wu Micheal Zeng Xiangzhan Yu Furu Wei SSL 138 1,721 0 26 Oct 2021
The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks Darius Petermann Gordon Wichern Zhong-Qiu Wang Jonathan Le Roux 23 37 0 19 Oct 2021
M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge Fan Yu Shiliang Zhang Yihui Fu Lei Xie Siqi Zheng ... Pengcheng Guo Zhijie Yan B. Ma Xin Xu Hui Bu 11 106 0 14 Oct 2021
All-neural beamformer for continuous speech separation Zhuohuang Zhang Takuya Yoshioka Naoyuki Kanda Zhuo Chen Xiaofei Wang Dongmei Wang Sefik Emre Eskimez 33 15 0 13 Oct 2021
VarArray: Array-Geometry-Agnostic Continuous Speech Separation Takuya Yoshioka Xiaofei Wang Dongmei Wang M. Tang Zirun Zhu Zhuo Chen Naoyuki Kanda 17 37 0 12 Oct 2021
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR Naoyuki Kanda Xiong Xiao Yashesh Gaur Xiaofei Wang Zhong Meng Zhuo Chen Takuya Yoshioka 29 36 0 07 Oct 2021
USEV: Universal Speaker Extraction with Visual Cue Zexu Pan Meng Ge Haizhou Li 34 41 0 30 Sep 2021
Continuous Streaming Multi-Talker ASR with Dual-path Transducers Desh Raj Liang Lu Zhuo Chen Yashesh Gaur Jinyu Li 24 17 0 17 Sep 2021
Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation Zhong-Qiu Wang Gordon Wichern Jonathan Le Roux 22 31 0 16 Aug 2021
Graph-PIT: Generalized permutation invariant training for continuous separation of arbitrary numbers of speakers Thilo von Neumann K. Kinoshita Christoph Boeddeker Marc Delcroix Reinhold Haeb-Umbach 28 23 0 30 Jul 2021
Speeding Up Permutation Invariant Training for Source Separation Thilo von Neumann Christoph Boeddeker K. Kinoshita Marc Delcroix Reinhold Haeb-Umbach 16 6 0 30 Jul 2021
Localization Based Sequential Grouping for Continuous Speech Separation Zhong-Qiu Wang DeLiang Wang 21 12 0 14 Jul 2021
A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio Naoyuki Kanda Xiong Xiao Jian Wu Tianyan Zhou Yashesh Gaur Xiaofei Wang Zhong Meng Zhuo Chen Takuya Yoshioka 19 14 0 06 Jul 2021
Investigation of Practical Aspects of Single Channel Speech Separation for ASR Jian Wu Zhuo Chen Sanyuan Chen Yu-Huan Wu Takuya Yoshioka Naoyuki Kanda Shujie Liu Jinyu Li 30 17 0 05 Jul 2021
Towards Neural Diarization for Unlimited Numbers of Speakers Using Global and Local Attractors Shota Horiguchi Shinji Watanabe Leibny Paola García-Perera Yawen Xue Yuki Takashima Y. Kawaguchi 36 37 0 04 Jul 2021
Encoder-Decoder Based Attractors for End-to-End Neural Diarization Shota Horiguchi Yusuke Fujita Shinji Watanabe Yawen Xue Leibny Paola García-Perera 37 64 0 20 Jun 2021