Guided Speaker EmbeddingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 |
Two-pass Endpoint Detection for Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2023 |
SURT 2.0: Advances in Transducer-based Multi-talker Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023 |
Adaptive Endpointing with Deep Contextual Multi-armed BanditsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 |
Unified End-to-End Speech Recognition and Endpointing for Fast and
Efficient Speech SystemsSpoken Language Technology Workshop (SLT), 2022 |
Separator-Transducer-Segmenter: Streaming Recognition and Segmentation
of Multi-party SpeechInterspeech (Interspeech), 2022 |
How Does Pre-trained Wav2Vec 2.0 Perform on Domain Shifted ASR? An
Extensive Benchmark on Air Traffic Control CommunicationsSpoken Language Technology Workshop (SLT), 2022 |