Sortformer: Seamless Integration of Speaker Diarization and ASR by Bridging Timestamps and Tokens

10 September 2024

Taejin Park

Kunal Dhawan

Nithin Rao Koluguri

Jagadeesh Balam

Boris Ginsburg

Papers citing "Sortformer: Seamless Integration of Speaker Diarization and ASR by Bridging Timestamps and Tokens"

1 / 1 papers shown

Title
Target-Speaker Voice Activity Detection via Sequence-to-Sequence Prediction Ming Cheng Weiqing Wang Yucong Zhang Xiaoyi Qin Ming Li VLM 48 32 0 28 Oct 2022