See the Speaker: Crafting High-Resolution Talking Faces from Speech with Prior Guidance and Region RefinementIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025 |
Human Motion Video Generation: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025 |
X-NeMo: Expressive Neural Motion Reenactment via Disentangled Latent AttentionInternational Conference on Learning Representations (ICLR), 2025 |
Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head GenerationComputer Vision and Pattern Recognition (CVPR), 2025 |
Exploring Timeline Control for Facial Motion GenerationComputer Vision and Pattern Recognition (CVPR), 2025 |
DATA: Multi-Disentanglement based Contrastive Learning for Open-World Semi-Supervised Deepfake AttributionIEEE transactions on multimedia (TMM), 2025 |