Mamba-based Decoder-Only Approach with Bidirectional Speech Modeling for
Speech RecognitionSpoken Language Technology Workshop (SLT), 2024 |
A Transcription Prompt-based Efficient Audio Large Language Model for
Robust Speech RecognitionInterspeech (Interspeech), 2024 |
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT Zhihao Du Jiaming Wang Qian Chen Yunfei Chu Zhifu Gao ...Wen Wang Siqi Zheng Chang Zhou Zhijie Yan Shiliang Zhang |