Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2409.00815
Cited By
v1
v2
v3 (latest)
Serialized Speech Information Guidance with Overlapped Encoding Separation for Multi-Speaker Automatic Speech Recognition
Spoken Language Technology Workshop (SLT), 2024
1 September 2024
Hao Shi
Yuan Gao
Zhaoheng Ni
Tatsuya Kawahara
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Serialized Speech Information Guidance with Overlapped Encoding Separation for Multi-Speaker Automatic Speech Recognition"
3 / 3 papers shown
Title
Serialized Output Prompting for Large Language Model-based Multi-Talker Speech Recognition
Hao Shi
Yusuke Fujita
Tomoya Mizumoto
Lianbo Liu
Atsushi Kojima
Yui Sudo
52
0
0
01 Sep 2025
Speaker-Distinguishable CTC: Learning Speaker Distinction Using CTC for Multi-Talker Speech Recognition
Asahi Sakuma
Hiroaki Sato
Ryuga Sugano
Tadashi Kumano
Yoshihiko Kawai
Tetsuji Ogawa
102
1
0
09 Jun 2025
Adapting Whisper for Code-Switching through Encoding Refining and Language-Aware Decoding
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Jiahui Zhao
Hao Shi
Chenrui Cui
Tianrui Wang
Hexin Liu
Zhaoheng Ni
Lingxuan Ye
Longbiao Wang
488
4
0
21 Dec 2024
1