Title |
---|
![]() META-CAT: Speaker-Informed Speech Embeddings via Meta Information
Concatenation for Multi-talker ASR Jinhan Wang Weiqing Wang Kunal Dhawan Taejin Park Myungjong Kim Ivan Medennikov He Huang Nithin Koluguri Jagadeesh Balam Boris Ginsburg |
![]() Sortformer: Seamless Integration of Speaker Diarization and ASR by
Bridging Timestamps and Tokens Taejin Park Ivan Medennikov Kunal Dhawan Weiqing Wang He Huang Nithin Rao Koluguri Krishna Puvvada Jagadeesh Balam Boris Ginsburg |
![]() Resource-Efficient Adaptation of Speech Foundation Models for
Multi-Speaker ASR Weiqing Wang Kunal Dhawan Taejin Park Krishna Puvvada Ivan Medennikov Somshubra Majumdar He Huang Jagadeesh Balam Boris Ginsburg |
![]() LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant
Multi-Talker Speech Separation, ASR and Speaker Diarization Zengrui Jin Yifan Yang Mohan Shi Wei Kang Xiaoyu Yang ...Lingwei Meng Long Lin Yong Xu Shi-Xiong Zhang Daniel Povey |
![]() Speaker Mask Transformer for Multi-talker Overlapped Speech Recognition Peng Shen Xugang Lu Hisashi Kawai |