ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2101.01853
  4. Cited By
Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form
  Multi-talker Recordings

Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings

6 January 2021
Xuankai Chang
Naoyuki Kanda
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Takuya Yoshioka
    RALM
ArXivPDFHTML

Papers citing "Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings"

11 / 11 papers shown
Title
Survey of End-to-End Multi-Speaker Automatic Speech Recognition for Monaural Audio
Survey of End-to-End Multi-Speaker Automatic Speech Recognition for Monaural Audio
Xinlu He
Jacob Whitehill
19
0
0
16 May 2025
Improving Speaker Assignment in Speaker-Attributed ASR for Real Meeting
  Applications
Improving Speaker Assignment in Speaker-Attributed ASR for Real Meeting Applications
Can Cui
Imran Ahmad Sheikh
Mostafa Sadeghi
Emmanuel Vincent
47
2
0
11 Mar 2024
On Speaker Attribution with SURT
On Speaker Attribution with SURT
Desh Raj
Sanjeev Khudanpur
Matthew Maciejewski
Leibny Paola García-Perera
Daniel Povey
Sanjeev Khudanpur
34
3
0
28 Jan 2024
A Glance is Enough: Extract Target Sentence By Looking at A keyword
A Glance is Enough: Extract Target Sentence By Looking at A keyword
Ying Shi
Dong Wang
Lantian Li
Jiqing Han
38
1
0
09 Oct 2023
CASA-ASR: Context-Aware Speaker-Attributed ASR
CASA-ASR: Context-Aware Speaker-Attributed ASR
Mohan Shi
Zhihao Du
Qian Chen
Fan Yu
Yangze Li
Shiliang Zhang
Jie Zhang
Lirong Dai
36
8
0
21 May 2023
Tandem Multitask Training of Speaker Diarisation and Speech Recognition
  for Meeting Transcription
Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription
Xianrui Zheng
Chuxu Zhang
P. Woodland
34
16
0
08 Jul 2022
Improving the Naturalness of Simulated Conversations for End-to-End
  Neural Diarization
Improving the Naturalness of Simulated Conversations for End-to-End Neural Diarization
Natsuo Yamashita
Shota Horiguchi
Takeshi Homma
26
16
0
24 Apr 2022
Directed Speech Separation for Automatic Speech Recognition of Long Form
  Conversational Speech
Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech
Rohit Paturi
S. Srinivasan
Katrin Kirchhoff
Daniel Garcia-Romero
25
9
0
10 Dec 2021
Separating Long-Form Speech with Group-Wise Permutation Invariant
  Training
Separating Long-Form Speech with Group-Wise Permutation Invariant Training
Wangyou Zhang
Zhuo Chen
Naoyuki Kanda
Shujie Liu
Jinyu Li
...
Takuya Yoshioka
Xiong Xiao
Zhong Meng
Y. Qian
Furu Wei
VLM
27
6
0
27 Oct 2021
A Comparative Study of Modular and Joint Approaches for
  Speaker-Attributed ASR on Monaural Long-Form Audio
A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio
Naoyuki Kanda
Xiong Xiao
Jian Wu
Tianyan Zhou
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
24
14
0
06 Jul 2021
Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting
  Transcription with Single Distant Microphone
Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone
Naoyuki Kanda
Guoli Ye
Yu-Huan Wu
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
39
41
0
31 Mar 2021
1