ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2001.11482
  4. Cited By
Continuous speech separation: dataset and analysis

Continuous speech separation: dataset and analysis

30 January 2020
Zhuo Chen
Takuya Yoshioka
Liang Lu
Tianyan Zhou
Zhong Meng
Yi Luo
Jian Wu
Xiong Xiao
Jinyu Li
ArXivPDFHTML

Papers citing "Continuous speech separation: dataset and analysis"

50 / 128 papers shown
Title
Speaker Change Detection for Transformer Transducer ASR
Speaker Change Detection for Transformer Transducer ASR
Jian Wu
Zhuo Chen
Min Hu
Xiong Xiao
Jinyu Li
20
4
0
16 Feb 2023
Multi-resolution location-based training for multi-channel continuous
  speech separation
Multi-resolution location-based training for multi-channel continuous speech separation
H. Taherian
DeLiang Wang
38
7
0
16 Jan 2023
GPU-accelerated Guided Source Separation for Meeting Transcription
GPU-accelerated Guided Source Separation for Meeting Transcription
Desh Raj
Daniel Povey
Sanjeev Khudanpur
26
35
0
10 Dec 2022
On Word Error Rate Definitions and their Efficient Computation for
  Multi-Speaker Speech Recognition Systems
On Word Error Rate Definitions and their Efficient Computation for Multi-Speaker Speech Recognition Systems
Thilo von Neumann
Christoph Boeddeker
K. Kinoshita
Marc Delcroix
Reinhold Haeb-Umbach
37
19
0
29 Nov 2022
Self-Remixing: Unsupervised Speech Separation via Separation and
  Remixing
Self-Remixing: Unsupervised Speech Separation via Separation and Remixing
Kohei Saijo
Tetsuji Ogawa
SSL
22
11
0
18 Nov 2022
Reverberation as Supervision for Speech Separation
Reverberation as Supervision for Speech Separation
R. Aralikatti
Christoph Boeddeker
Gordon Wichern
Aswin Shanmugam Subramanian
Jonathan Le Roux
24
7
0
15 Nov 2022
MedleyVox: An Evaluation Dataset for Multiple Singing Voices Separation
MedleyVox: An Evaluation Dataset for Multiple Singing Voices Separation
Chang-Bin Jeon
Hyeongi Moon
Keunwoo Choi
Ben Sangbae Chon
Kyogu Lee
20
5
0
14 Nov 2022
Simulating realistic speech overlaps improves multi-talker ASR
Simulating realistic speech overlaps improves multi-talker ASR
Muqiao Yang
Naoyuki Kanda
Xiaofei Wang
Jian Wu
S. Sivasankaran
Zhuo Chen
Jinyu Li
Takuya Yoshioka
23
13
0
27 Oct 2022
MFCCA:Multi-Frame Cross-Channel attention for multi-speaker ASR in
  Multi-party meeting scenario
MFCCA:Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario
Fan Yu
Shiliang Zhang
Pengcheng Guo
Yuhao Liang
Zhihao Du
Yuxiao Lin
Linfu Xie
33
11
0
11 Oct 2022
Mutual Learning of Single- and Multi-Channel End-to-End Neural
  Diarization
Mutual Learning of Single- and Multi-Channel End-to-End Neural Diarization
Shota Horiguchi
Yuki Takashima
Shinji Watanabe
Leibny Paola García-Perera
36
2
0
07 Oct 2022
VarArray Meets t-SOT: Advancing the State of the Art of Streaming
  Distant Conversational Speech Recognition
VarArray Meets t-SOT: Advancing the State of the Art of Streaming Distant Conversational Speech Recognition
Naoyuki Kanda
Jian Wu
Xiaofei Wang
Zhuo Chen
Jinyu Li
Takuya Yoshioka
29
16
0
12 Sep 2022
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition,
  Translation, and Understanding
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding
Yen-Ju Lu
Xuankai Chang
Chenda Li
Wangyou Zhang
Samuele Cornell
...
Robin Scheibler
Zhong-Qiu Wang
Yu Tsao
Y. Qian
Shinji Watanabe
VLM
24
28
0
19 Jul 2022
Separator-Transducer-Segmenter: Streaming Recognition and Segmentation
  of Multi-party Speech
Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech
Ilya Sklyar
A. Piunova
Christian Osendorfer
11
6
0
10 May 2022
A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network
A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network
Tobias Gburrek
Christoph Boeddeker
Thilo von Neumann
Tobias Cord-Landwehr
Joerg Schmalenstroeer
Reinhold Haeb-Umbach
11
5
0
02 May 2022
Ultra Fast Speech Separation Model with Teacher Student Learning
Ultra Fast Speech Separation Model with Teacher Student Learning
Sanyuan Chen
Yu-Huan Wu
Zhuo Chen
Jian Wu
Takuya Yoshioka
Shujie Liu
Jinyu Li
Xiangzhan Yu
25
14
0
27 Apr 2022
Improving the Naturalness of Simulated Conversations for End-to-End
  Neural Diarization
Improving the Naturalness of Simulated Conversations for End-to-End Neural Diarization
Natsuo Yamashita
Shota Horiguchi
Takeshi Homma
26
16
0
24 Apr 2022
Leveraging Real Conversational Data for Multi-Channel Continuous Speech
  Separation
Leveraging Real Conversational Data for Multi-Channel Continuous Speech Separation
Xiaofei Wang
Dongmei Wang
Naoyuki Kanda
Sefik Emre Eskimez
Takuya Yoshioka
25
8
0
07 Apr 2022
An Initialization Scheme for Meeting Separation with Spatial Mixture
  Models
An Initialization Scheme for Meeting Separation with Spatial Mixture Models
Christoph Boeddeker
Tobias Cord-Landwehr
Thilo von Neumann
Reinhold Haeb-Umbach
30
10
0
04 Apr 2022
End-to-end multi-talker audio-visual ASR using an active speaker
  attention module
End-to-end multi-talker audio-visual ASR using an active speaker attention module
R. Rose
Olivier Siohan
20
3
0
01 Apr 2022
EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech
  Separation for Flexible Number of Speakers
EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers
Soumi Maiti
Yushi Ueda
Shinji Watanabe
Chunlei Zhang
Meng Yu
Shi-Xiong Zhang
Yong-mei Xu
42
33
0
31 Mar 2022
A Comparative Study on Speaker-attributed Automatic Speech Recognition
  in Multi-party Meetings
A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings
Fan Yu
Zhihao Du
Shiliang Zhang
Yuxiao Lin
Linfu Xie
22
13
0
31 Mar 2022
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings
Naoyuki Kanda
Jian Wu
Yu Wu
Xiong Xiao
Zhong Meng
Xiaofei Wang
Yashesh Gaur
Zhuo Chen
Jinyu Li
Takuya Yoshioka
24
26
0
30 Mar 2022
Disentangling the Impacts of Language and Channel Variability on Speech
  Separation Networks
Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks
Fan Wang
Hung-Shin Lee
Yu Tsao
Hsin-Min Wang
29
4
0
30 Mar 2022
Extended Graph Temporal Classification for Multi-Speaker End-to-End ASR
Extended Graph Temporal Classification for Multi-Speaker End-to-End ASR
Xuankai Chang
Niko Moritz
Takaaki Hori
Shinji Watanabe
Jonathan Le Roux
24
6
0
01 Mar 2022
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting
  Transcription Grand Challenge
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge
Fan Yu
Shiliang Zhang
Pengcheng Guo
Yihui Fu
Zhihao Du
...
Kong Aik Lee
Zhijie Yan
B. Ma
Xin Xu
Hui Bu
18
28
0
08 Feb 2022
Exploring Self-Attention Mechanisms for Speech Separation
Exploring Self-Attention Mechanisms for Speech Separation
Cem Subakan
Mirco Ravanelli
Samuele Cornell
François Grondin
Mirko Bronzi
40
23
0
06 Feb 2022
Streaming Multi-Talker ASR with Token-Level Serialized Output Training
Streaming Multi-Talker ASR with Token-Level Serialized Output Training
Naoyuki Kanda
Jian Wu
Yu Wu
Xiong Xiao
Zhong Meng
Xiaofei Wang
Yashesh Gaur
Zhuo Chen
Jinyu Li
Takuya Yoshioka
36
54
0
02 Feb 2022
SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech
  Separation
SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech Separation
Chenda Li
Lei Yang
Weiqin Wang
Y. Qian
37
25
0
26 Jan 2022
Endpoint Detection for Streaming End-to-End Multi-talker ASR
Endpoint Detection for Streaming End-to-End Multi-talker ASR
Liang Lu
Jinyu Li
Yifan Gong
17
17
0
24 Jan 2022
Multi-turn RNN-T for streaming recognition of multi-party speech
Multi-turn RNN-T for streaming recognition of multi-party speech
Ilya Sklyar
A. Piunova
Xianrui Zheng
Yulan Liu
24
22
0
19 Dec 2021
Directed Speech Separation for Automatic Speech Recognition of Long Form
  Conversational Speech
Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech
Rohit Paturi
S. Srinivasan
Katrin Kirchhoff
Daniel Garcia-Romero
22
9
0
10 Dec 2021
SA-SDR: A novel loss function for separation of meeting style data
SA-SDR: A novel loss function for separation of meeting style data
Thilo von Neumann
K. Kinoshita
Christoph Boeddeker
Marc Delcroix
Reinhold Haeb-Umbach
29
20
0
29 Oct 2021
Continuous Speech Separation with Recurrent Selective Attention Network
Continuous Speech Separation with Recurrent Selective Attention Network
Yixuan Zhang
Zhuo Chen
Jian Wu
Takuya Yoshioka
Peidong Wang
Zhong Meng
Jinyu Li
BDL
27
7
0
28 Oct 2021
Separating Long-Form Speech with Group-Wise Permutation Invariant
  Training
Separating Long-Form Speech with Group-Wise Permutation Invariant Training
Wangyou Zhang
Zhuo Chen
Naoyuki Kanda
Shujie Liu
Jinyu Li
...
Takuya Yoshioka
Xiong Xiao
Zhong Meng
Y. Qian
Furu Wei
VLM
19
6
0
27 Oct 2021
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech
  Processing
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Sanyuan Chen
Chengyi Wang
Zhengyang Chen
Yu-Huan Wu
Shujie Liu
...
Yao Qian
Jian Wu
Micheal Zeng
Xiangzhan Yu
Furu Wei
SSL
138
1,721
0
26 Oct 2021
The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World
  Soundtracks
The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks
Darius Petermann
Gordon Wichern
Zhong-Qiu Wang
Jonathan Le Roux
23
37
0
19 Oct 2021
M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription
  Challenge
M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge
Fan Yu
Shiliang Zhang
Yihui Fu
Lei Xie
Siqi Zheng
...
Pengcheng Guo
Zhijie Yan
B. Ma
Xin Xu
Hui Bu
11
106
0
14 Oct 2021
All-neural beamformer for continuous speech separation
All-neural beamformer for continuous speech separation
Zhuohuang Zhang
Takuya Yoshioka
Naoyuki Kanda
Zhuo Chen
Xiaofei Wang
Dongmei Wang
Sefik Emre Eskimez
33
15
0
13 Oct 2021
VarArray: Array-Geometry-Agnostic Continuous Speech Separation
VarArray: Array-Geometry-Agnostic Continuous Speech Separation
Takuya Yoshioka
Xiaofei Wang
Dongmei Wang
M. Tang
Zirun Zhu
Zhuo Chen
Naoyuki Kanda
17
37
0
12 Oct 2021
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number
  of Speakers using End-to-End Speaker-Attributed ASR
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR
Naoyuki Kanda
Xiong Xiao
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
29
36
0
07 Oct 2021
USEV: Universal Speaker Extraction with Visual Cue
USEV: Universal Speaker Extraction with Visual Cue
Zexu Pan
Meng Ge
Haizhou Li
34
41
0
30 Sep 2021
Continuous Streaming Multi-Talker ASR with Dual-path Transducers
Continuous Streaming Multi-Talker ASR with Dual-path Transducers
Desh Raj
Liang Lu
Zhuo Chen
Yashesh Gaur
Jinyu Li
24
17
0
17 Sep 2021
Convolutive Prediction for Monaural Speech Dereverberation and
  Noisy-Reverberant Speaker Separation
Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation
Zhong-Qiu Wang
Gordon Wichern
Jonathan Le Roux
22
31
0
16 Aug 2021
Graph-PIT: Generalized permutation invariant training for continuous
  separation of arbitrary numbers of speakers
Graph-PIT: Generalized permutation invariant training for continuous separation of arbitrary numbers of speakers
Thilo von Neumann
K. Kinoshita
Christoph Boeddeker
Marc Delcroix
Reinhold Haeb-Umbach
28
23
0
30 Jul 2021
Speeding Up Permutation Invariant Training for Source Separation
Speeding Up Permutation Invariant Training for Source Separation
Thilo von Neumann
Christoph Boeddeker
K. Kinoshita
Marc Delcroix
Reinhold Haeb-Umbach
16
6
0
30 Jul 2021
Localization Based Sequential Grouping for Continuous Speech Separation
Localization Based Sequential Grouping for Continuous Speech Separation
Zhong-Qiu Wang
DeLiang Wang
21
12
0
14 Jul 2021
A Comparative Study of Modular and Joint Approaches for
  Speaker-Attributed ASR on Monaural Long-Form Audio
A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio
Naoyuki Kanda
Xiong Xiao
Jian Wu
Tianyan Zhou
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
19
14
0
06 Jul 2021
Investigation of Practical Aspects of Single Channel Speech Separation
  for ASR
Investigation of Practical Aspects of Single Channel Speech Separation for ASR
Jian Wu
Zhuo Chen
Sanyuan Chen
Yu-Huan Wu
Takuya Yoshioka
Naoyuki Kanda
Shujie Liu
Jinyu Li
30
17
0
05 Jul 2021
Towards Neural Diarization for Unlimited Numbers of Speakers Using
  Global and Local Attractors
Towards Neural Diarization for Unlimited Numbers of Speakers Using Global and Local Attractors
Shota Horiguchi
Shinji Watanabe
Leibny Paola García-Perera
Yawen Xue
Yuki Takashima
Y. Kawaguchi
36
37
0
04 Jul 2021
Encoder-Decoder Based Attractors for End-to-End Neural Diarization
Encoder-Decoder Based Attractors for End-to-End Neural Diarization
Shota Horiguchi
Yusuke Fujita
Shinji Watanabe
Yawen Xue
Leibny Paola García-Perera
37
64
0
20 Jun 2021
Previous
123
Next