Realistic Speech-Driven Facial Animation with GANs

International Journal of Computer Vision (IJCV), 2019

14 June 2019

Konstantinos Vougioukas

Stavros Petridis

Maja Pantic

ArXiv (abs)PDF HTML

Papers citing "Realistic Speech-Driven Facial Animation with GANs"

50 / 157 papers shown

Taming Transformer for Emotion-Controllable Talking Face Generation

Ziqi Zhang

Cheng Deng

CVBM

192

20 Aug 2025

Multi-human Interactive Talking Dataset

225

05 Aug 2025

Mask-Free Audio-driven Talking Face Generation for Enhanced Visual Quality and Identity Preservation

249

28 Jul 2025

OT-Talk: Animating 3D Talking Head with Optimal TransportationInternational Conference on Multimedia Retrieval (ICMR), 2025

427

03 May 2025

KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution

Konstantinos Vougioukas

Stavros Petridis

Maja Pantic

423

01 May 2025

PASE: Phoneme-Aware Speech Encoder to Improve Lip Sync Accuracy for Talking Head Synthesis

358

08 Apr 2025

KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame InterpolationComputer Vision and Pattern Recognition (CVPR), 2025

Konstantinos Vougioukas

453

03 Mar 2025

Emotion Recognition and Generation: A Comprehensive Review of Face, Speech, and Text Modalities

Rebecca Mobbs

Dimitrios Makris

Vasileios Argyriou

236

02 Feb 2025

Towards Dynamic Neural Communication and Speech Neuroprosthesis Based on Viseme DecodingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

498

28 Jan 2025

Joint Learning of Depth and Appearance for Portrait Image Animation

391

15 Jan 2025

A Review of Human Emotion Synthesis Based on Generative Technology

...

318

10 Dec 2024

DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video GenerationInternational Conference on Learning Representations (ICLR), 2024

1.1K

17 Oct 2024

EmoGene: Audio-Driven Emotional 3D Talking-Head GenerationIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2024

Wenqing Wang

Yun Fu

VGen

411

07 Oct 2024

StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking HeadsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

Suzhen Wang

Yifeng Ma

Yu Ding

Zhipeng Hu

Changjie Fan

Tangjie Lv

Zhidong Deng

Xin Yu

289

14 Sep 2024

KAN-Based Fusion of Dual-Domain for Audio-Driven Facial Landmarks Generation

180

09 Sep 2024

FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion ModelACM Multimedia (MM), 2024

256

18 Aug 2024

The impact of differences in facial features between real speakers and 3D face models on synthesized lip motions

233

24 Jul 2024

EmoFace: Audio-driven Emotional 3D Face Animation

280

17 Jul 2024

Listen and Move: Improving GANs Coherency in Agnostic Sound-to-Video Generation

Rafael Redondo

264

23 Jun 2024

A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing

414

15 Jun 2024

Deepfakes and Higher Education: A Research Agenda and Scoping Review of Synthetic Media

Jasper Roe

Mike Perkins

257

24 Apr 2024

GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting

...

335

22 Apr 2024

Learn2Talk: 3D Talking Face Learns from 2D Talking Face

258

19 Apr 2024

MI-NeRF: Learning a Single Face NeRF from Multiple Identities

325

29 Mar 2024

Dyadic Interaction Modeling for Social Behavior GenerationEuropean Conference on Computer Vision (ECCV), 2024

441

14 Mar 2024

VLOGGER: Multimodal Diffusion for Embodied Avatar SynthesisComputer Vision and Pattern Recognition (CVPR), 2024

Enric Corona

Andrei Zanfir

Eduard Gabriel Bazavan

344

13 Mar 2024

FlowVQTalker: High-Quality Emotional Talking Face Generation through Normalizing Flow and QuantizationComputer Vision and Pattern Recognition (CVPR), 2024

Shuai Tan

Bin Ji

Ye Pan

521

11 Mar 2024

Say Anything with Any StyleAAAI Conference on Artificial Intelligence (AAAI), 2024

245

11 Mar 2024

CustomListener: Text-guided Responsive Interaction for User-friendly Listening Head Generation

Pengfei Yan

400

01 Mar 2024

AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D Talking Face Generation

225

25 Feb 2024

EmoTalker: Emotionally Editable Talking Face Generation via Diffusion ModelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

313

16 Jan 2024

From Audio to Photoreal Embodiment: Synthesizing Humans in ConversationsComputer Vision and Pattern Recognition (CVPR), 2024

300

03 Jan 2024

DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

449

15 Dec 2023

FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head ModelsComputer Vision and Pattern Recognition (CVPR), 2023

Matthias Nießner

617

13 Dec 2023

GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face GuidanceIEEE Transactions on Visualization and Computer Graphics (TVCG), 2023

217

12 Dec 2023

GMTalker: Gaussian Mixture-based Audio-Driven Emotional Talking Video Portraits

Yibo Xia

Lizhen Wang

Xiang Deng

Xiaoyan Luo

Yunhong Wang

Yebin Liu

VGen

377

12 Dec 2023

R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning

280

09 Dec 2023

VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid PriorInternational Conference on 3D Vision (3DV), 2023

Xun Cao

412

04 Dec 2023

3DiFACE: Diffusion-based Speech-driven 3D Facial Animation and Editing

Balamurugan Thambiraja

339

01 Dec 2023

SyncTalk: The Devil is in the Synchronization for Talking Head SynthesisComputer Vision and Pattern Recognition (CVPR), 2023

Hao Zhao

Jun He

Hongyan Liu

Zhaoxin Fan

360

108

29 Nov 2023

META4: Semantically-Aligned Generation of Metaphoric Gestures Using Self-Supervised Text and Speech Representation

Mireille Fares

Catherine Pelachaud

Nicolas Obin

200

09 Nov 2023

DualTalker: A Cross-Modal Dual Learning Approach for Speech-Driven 3D Facial Animation

302

08 Nov 2023

Breathing Life into Faces: Speech-driven 3D Facial Animation with Natural Head Pose and Detailed Shape

Tianyu He

214

31 Oct 2023

Emotional Listener Portrait: Neural Listener Head Generation with EmotionIEEE International Conference on Computer Vision (ICCV), 2023

504

29 Sep 2023

OSM-Net: One-to-Many One-shot Talking Head Generation with Spontaneous Head Motions

202

28 Sep 2023

Towards the generation of synchronized and believable non-verbal facial behaviors of a talking virtual agent

383

15 Sep 2023

HDTR-Net: A Real-Time High-Definition Teeth Restoration Network for Arbitrary Talking Face Generation MethodsChinese Conference on Pattern Recognition and Computer Vision (CPRCV), 2023

253

14 Sep 2023

Blendshapes GHUM: Real-time Monocular Facial Blendshape Prediction

Ivan Grishchenko

Geng Yan

Eduard Gabriel Bazavan

241

11 Sep 2023

Efficient Emotional Adaptation for Audio-Driven Talking-Head GenerationIEEE International Conference on Computer Vision (ICCV), 2023

421

106

10 Sep 2023

Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short VideoIEEE International Conference on Computer Vision (ICCV), 2023

Ying Shan

Xiaojuan Qi

160

09 Sep 2023