Cross-modal Audio-visual Co-learning for Text-independent Speaker
VerificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 |
How to Teach DNNs to Pay Attention to the Visual Modality in Speech
RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020 |