See the Speaker: Crafting High-Resolution Talking Faces from Speech with Prior Guidance and Region RefinementIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025 |
Human Motion Video Generation: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025 |
SpeechForensics: Audio-Visual Speech Representation Learning for Face Forgery DetectionNeural Information Processing Systems (NeurIPS), 2025 |
SketchAnimator: Animate Sketch via Motion Customization of Text-to-Video Diffusion ModelsVisual Communications and Image Processing (VCIP), 2024 |
X-NeMo: Expressive Neural Motion Reenactment via Disentangled Latent AttentionInternational Conference on Learning Representations (ICLR), 2025 |
Robust Deepfake Detection for Electronic Know Your Customer Systems Using Registered ImagesIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2025 |