Exploring Text-Queried Sound Event Detection with Audio Source SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 |
DCIM-AVSR : Efficient Audio-Visual Speech Recognition via Dual Conformer Interaction ModuleIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 |