VarArray Meets t-SOT: Advancing the State of the Art of Streaming
Distant Conversational Speech Recognition

VarArray Meets t-SOT: Advancing the State of the Art of Streaming Distant Conversational Speech Recognition

12 September 2022

Takuya Yoshioka

Papers citing "VarArray Meets t-SOT: Advancing the State of the Art of Streaming Distant Conversational Speech Recognition"

10 / 10 papers shown

Title
Joint Beamforming and Speaker-Attributed ASR for Real Distant-Microphone Meeting Transcription Can Cui Imran A. Sheikh Mostafa Sadeghi Emmanuel Vincent 29 0 0 29 Oct 2024
Neural Blind Source Separation and Diarization for Distant Speech Recognition Yoshiaki Bando Tomohiko Nakamura Shinji Watanabe BDL 29 5 0 12 Jun 2024
AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition Ju Lin Niko Moritz Yiteng Huang Ruiming Xie Ming Sun Christian Fuegen Frank Seide 25 4 0 18 Jan 2024
One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition Samuele Cornell Jee-weon Jung Shinji Watanabe S. Squartini VLM 20 15 0 02 Oct 2023
t-SOT FNT: Streaming Multi-talker ASR with Text-only Domain Adaptation Capability Jian Wu Naoyuki Kanda Takuya Yoshioka Rui Zhao Zhuo Chen Jinyu Li 11 5 0 15 Sep 2023
Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments Sara Papi Peidong Wan Junkun Chen Jian Xue Jinyu Li Yashesh Gaur 21 8 0 07 Jul 2023
The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios Samuele Cornell Matthew Wiesner Shinji Watanabe Desh Raj Xuankai Chang ... Matthew Maciejewski Yoshiki Masuyama Zhong-Qiu Wang S. Squartini Sanjeev Khudanpur 19 51 0 23 Jun 2023
SURT 2.0: Advances in Transducer-based Multi-talker Speech Recognition Desh Raj Daniel Povey Sanjeev Khudanpur VLM 26 9 0 18 Jun 2023
On Word Error Rate Definitions and their Efficient Computation for Multi-Speaker Speech Recognition Systems Thilo von Neumann Christoph Boeddeker K. Kinoshita Marc Delcroix Reinhold Haeb-Umbach 16 19 0 29 Nov 2022
Simulating realistic speech overlaps improves multi-talker ASR Muqiao Yang Naoyuki Kanda Xiaofei Wang Jian Wu S. Sivasankaran Zhuo Chen Jinyu Li Takuya Yoshioka 12 12 0 27 Oct 2022