ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2409.00815
  4. Cited By
Serialized Speech Information Guidance with Overlapped Encoding
  Separation for Multi-Speaker Automatic Speech Recognition
v1v2v3 (latest)

Serialized Speech Information Guidance with Overlapped Encoding Separation for Multi-Speaker Automatic Speech Recognition

Spoken Language Technology Workshop (SLT), 2024
1 September 2024
Hao Shi
Yuan Gao
Zhaoheng Ni
Tatsuya Kawahara
ArXiv (abs)PDFHTML

Papers citing "Serialized Speech Information Guidance with Overlapped Encoding Separation for Multi-Speaker Automatic Speech Recognition"

3 / 3 papers shown
Title
Serialized Output Prompting for Large Language Model-based Multi-Talker Speech Recognition
Serialized Output Prompting for Large Language Model-based Multi-Talker Speech Recognition
Hao Shi
Yusuke Fujita
Tomoya Mizumoto
Lianbo Liu
Atsushi Kojima
Yui Sudo
52
0
0
01 Sep 2025
Speaker-Distinguishable CTC: Learning Speaker Distinction Using CTC for Multi-Talker Speech Recognition
Speaker-Distinguishable CTC: Learning Speaker Distinction Using CTC for Multi-Talker Speech Recognition
Asahi Sakuma
Hiroaki Sato
Ryuga Sugano
Tadashi Kumano
Yoshihiko Kawai
Tetsuji Ogawa
102
1
0
09 Jun 2025
Adapting Whisper for Code-Switching through Encoding Refining and Language-Aware Decoding
Adapting Whisper for Code-Switching through Encoding Refining and Language-Aware DecodingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Jiahui Zhao
Hao Shi
Chenrui Cui
Tianrui Wang
Hexin Liu
Zhaoheng Ni
Lingxuan Ye
Longbiao Wang
484
4
0
21 Dec 2024
1