Conditional Deep Canonical Time WarpingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 |
GestSync: Determining who is speaking without a talking headBritish Machine Vision Conference (BMVC), 2023 |
Latent Optimal Paths by Gumbel Propagation for Variational Bayesian
Dynamic ProgrammingInternational Conference on Machine Learning (ICML), 2023 |
Sparse in Space and Time: Audio-visual Synchronisation with Trainable
SelectorsBritish Machine Vision Conference (BMVC), 2022 |
Deep Learning for Visual Speech Analysis: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022 |
End to End Lip Synchronization with a Temporal AutoEncoderIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020 |
Drop-DTW: Aligning Common Signal Between Sequences While Dropping
OutliersNeural Information Processing Systems (NeurIPS), 2021 |
Improved Lite Audio-Visual Speech EnhancementIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020 |
AlignNet: A Unifying Approach to Audio-Visual AlignmentIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020 |