Generative Context-aware Fine-tuning of Self-supervised Speech ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 |
Conversational Speech Recognition by Learning Audio-textual Cross-modal
Contextual RepresentationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023 |
Context-aware Fine-tuning of Self-supervised Speech ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022 |
LongFNT: Long-form Speech Recognition with Factorized Neural TransducerIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022 |
Leveraging Acoustic Contextual Representation by Audio-textual
Cross-modal Learning for Conversational ASRInterspeech (Interspeech), 2022 |