TD3Net: A temporal densely connected multi-dilated convolutional network for lipreadingJournal of Visual Communication and Image Representation (JVCIR), 2025 |
Multi-Temporal Lip-Audio Memory for Visual Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 |
Deep Learning for Visual Speech Analysis: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022 |
Lip to Speech Synthesis with Visual Context Attentional GANNeural Information Processing Systems (NeurIPS), 2022 |
Multi-modality Associative Bridging through Memory: Speech Sound
Recollected from Face VideoIEEE International Conference on Computer Vision (ICCV), 2021 |
Spatio-Temporal Attention Mechanism and Knowledge Distillation for Lip
ReadingInternational Joint Conference on Computational Intelligence (IJCCI), 2021 |
Towards Practical Lipreading with Distilled and Efficient ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020 |
Mutual Information Maximization for Effective Lip ReadingIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2020 |
Deformation Flow Based Two-Stream Network for Lip ReadingIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2020 |
Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence
Lip-ReadingIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2020 |
Lipreading using Temporal Convolutional NetworksIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020 |