ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.06337
  4. Cited By
Realistic Speech-Driven Facial Animation with GANs

Realistic Speech-Driven Facial Animation with GANs

International Journal of Computer Vision (IJCV), 2019
14 June 2019
Konstantinos Vougioukas
Stavros Petridis
Maja Pantic
ArXiv (abs)PDFHTML

Papers citing "Realistic Speech-Driven Facial Animation with GANs"

50 / 157 papers shown
Taming Transformer for Emotion-Controllable Talking Face Generation
Taming Transformer for Emotion-Controllable Talking Face Generation
Ziqi Zhang
Cheng Deng
CVBM
192
0
0
20 Aug 2025
Multi-human Interactive Talking Dataset
Multi-human Interactive Talking Dataset
Zeyu Zhu
Weijia Wu
Mike Zheng Shou
VGen
225
1
0
05 Aug 2025
Mask-Free Audio-driven Talking Face Generation for Enhanced Visual Quality and Identity Preservation
Mask-Free Audio-driven Talking Face Generation for Enhanced Visual Quality and Identity Preservation
Dogucan Yaman
Fevziye Irem Eyiokur
Leonard Barmann
H. K. Ekenel
Alexander H. Waibel
CVBM
249
1
0
28 Jul 2025
OT-Talk: Animating 3D Talking Head with Optimal Transportation
OT-Talk: Animating 3D Talking Head with Optimal TransportationInternational Conference on Multimedia Retrieval (ICMR), 2025
Xinmu Wang
Xiang Gao
Xiyun Song
Heather Yu
Zongfang Lin
Liang Peng
Xianfeng Gu
427
5
0
03 May 2025
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution
Antoni Bigata
Rodrigo Mira
Stella Bounareli
Michał Stypułkowski
Konstantinos Vougioukas
Stavros Petridis
Maja Pantic
423
8
0
01 May 2025
PASE: Phoneme-Aware Speech Encoder to Improve Lip Sync Accuracy for Talking Head Synthesis
PASE: Phoneme-Aware Speech Encoder to Improve Lip Sync Accuracy for Talking Head Synthesis
Yihuan Huang
Jiajun Liu
Yanzhen Ren
Wuyang Liu
Juhua Tang
Zongkun Sun
358
0
0
08 Apr 2025
KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation
KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame InterpolationComputer Vision and Pattern Recognition (CVPR), 2025
Antoni Bigata
Michał Stypułkowski
Rodrigo Mira
Stella Bounareli
Konstantinos Vougioukas
Zoe Landgraf
Nikita Drobyshev
Maciej Ziȩba
Stavros Petridis
Maja Pantic
DiffMVGen
453
10
0
03 Mar 2025
Emotion Recognition and Generation: A Comprehensive Review of Face, Speech, and Text Modalities
Emotion Recognition and Generation: A Comprehensive Review of Face, Speech, and Text Modalities
Rebecca Mobbs
Dimitrios Makris
Vasileios Argyriou
236
9
0
02 Feb 2025
Towards Dynamic Neural Communication and Speech Neuroprosthesis Based on Viseme Decoding
Towards Dynamic Neural Communication and Speech Neuroprosthesis Based on Viseme DecodingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Ji-Ha Park
Seo-Hyun Lee
Soowon Kim
Seong-Whan Lee
498
5
0
28 Jan 2025
Joint Learning of Depth and Appearance for Portrait Image Animation
Joint Learning of Depth and Appearance for Portrait Image Animation
Xinya Ji
Gaspard Zoss
Prashanth Chandran
Lingchen Yang
Xun Cao
B. Solenthaler
D. Bradley
3DHMDE
391
2
0
15 Jan 2025
A Review of Human Emotion Synthesis Based on Generative Technology
A Review of Human Emotion Synthesis Based on Generative Technology
Fei Ma
Yongqian Li
Yifan Xie
Y. He
Yujiao Shi
...
Z. Liu
Wei Yao
Fuji Ren
Fei Richard Yu
Shiguang Ni
318
16
0
10 Dec 2024
DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation
DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video GenerationInternational Conference on Learning Representations (ICLR), 2024
Hanbo Cheng
Limin Lin
Chenyu Liu
Pengcheng Xia
Pengfei Hu
Jiefeng Ma
Jun Du
Jia Pan
DiffMVGen
1.1K
6
0
17 Oct 2024
EmoGene: Audio-Driven Emotional 3D Talking-Head Generation
EmoGene: Audio-Driven Emotional 3D Talking-Head GenerationIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2024
Wenqing Wang
Yun Fu
VGen
411
1
0
07 Oct 2024
StyleTalk++: A Unified Framework for Controlling the Speaking Styles of
  Talking Heads
StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking HeadsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Suzhen Wang
Yifeng Ma
Yu Ding
Zhipeng Hu
Changjie Fan
Tangjie Lv
Zhidong Deng
Xin Yu
289
24
0
14 Sep 2024
KAN-Based Fusion of Dual-Domain for Audio-Driven Facial Landmarks
  Generation
KAN-Based Fusion of Dual-Domain for Audio-Driven Facial Landmarks Generation
Hoang-Son Vo-Thanh
Quang Vinh Nguyen
Soo-Hyung Kim
CVBM
180
0
0
09 Sep 2024
FD2Talk: Towards Generalized Talking Head Generation with Facial
  Decoupled Diffusion Model
FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion ModelACM Multimedia (MM), 2024
Ziyu Yao
Xuxin Cheng
Zhiqi Huang
DiffM
256
10
0
18 Aug 2024
The impact of differences in facial features between real speakers and
  3D face models on synthesized lip motions
The impact of differences in facial features between real speakers and 3D face models on synthesized lip motions
Rabab Algadhy
Yoshihiko Gotoh
Steve Maddock
CVBM
233
1
0
24 Jul 2024
EmoFace: Audio-driven Emotional 3D Face Animation
EmoFace: Audio-driven Emotional 3D Face Animation
Chang Liu
Qunfen Lin
Zijiao Zeng
Ye Pan
CVBM
280
9
0
17 Jul 2024
Listen and Move: Improving GANs Coherency in Agnostic Sound-to-Video
  Generation
Listen and Move: Improving GANs Coherency in Agnostic Sound-to-Video Generation
Rafael Redondo
264
0
0
23 Jun 2024
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis:
  Techniques for Portrait Generation, Driving Mechanisms, and Editing
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing
Ming Meng
Yufei Zhao
Bo Zhang
Yonggui Zhu
Weimin Shi
Maxwell Wen
Zhaoxin Fan
VGen
414
6
0
15 Jun 2024
Deepfakes and Higher Education: A Research Agenda and Scoping Review of
  Synthetic Media
Deepfakes and Higher Education: A Research Agenda and Scoping Review of Synthetic Media
Jasper Roe
Mike Perkins
257
29
0
24 Apr 2024
GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian
  Splatting
GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting
Hongyun Yu
Zhan Qu
Qihang Yu
Jianchuan Chen
Zhonghua Jiang
...
Shengyu Zhang
Jimin Xu
Leilei Gan
Chengfei Lv
Gang Yu
3DGS
335
38
0
22 Apr 2024
Learn2Talk: 3D Talking Face Learns from 2D Talking Face
Learn2Talk: 3D Talking Face Learns from 2D Talking Face
Yixiang Zhuang
Baoping Cheng
Yao Cheng
Yuntao Jin
Renshuai Liu
Chengyang Li
Xuan Cheng
Jing Liao
Juncong Lin
CVBM3DH
258
13
0
19 Apr 2024
MI-NeRF: Learning a Single Face NeRF from Multiple Identities
MI-NeRF: Learning a Single Face NeRF from Multiple Identities
Aggelina Chatziagapi
Grigorios G. Chrysos
Dimitris Samaras
CVBM
325
4
0
29 Mar 2024
Dyadic Interaction Modeling for Social Behavior Generation
Dyadic Interaction Modeling for Social Behavior GenerationEuropean Conference on Computer Vision (ECCV), 2024
Minh Tran
Di Chang
Maksim Siniukov
Mohammad Soleymani
VGen
441
28
0
14 Mar 2024
VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis
VLOGGER: Multimodal Diffusion for Embodied Avatar SynthesisComputer Vision and Pattern Recognition (CVPR), 2024
Enric Corona
Andrei Zanfir
Eduard Gabriel Bazavan
Nikos Kolotouros
Thiemo Alldieck
C. Sminchisescu
VGenDiffM
344
49
0
13 Mar 2024
FlowVQTalker: High-Quality Emotional Talking Face Generation through
  Normalizing Flow and Quantization
FlowVQTalker: High-Quality Emotional Talking Face Generation through Normalizing Flow and QuantizationComputer Vision and Pattern Recognition (CVPR), 2024
Shuai Tan
Bin Ji
Ye Pan
521
44
0
11 Mar 2024
Say Anything with Any Style
Say Anything with Any StyleAAAI Conference on Artificial Intelligence (AAAI), 2024
Shuai Tan
Bin Ji
Yu Ding
Ye Pan
VGenDiffM
245
30
0
11 Mar 2024
CustomListener: Text-guided Responsive Interaction for User-friendly
  Listening Head Generation
CustomListener: Text-guided Responsive Interaction for User-friendly Listening Head Generation
Xi Liu
Ying Guo
Cheng Zhen
Tong Li
Yingying Ao
Pengfei Yan
DiffM
400
21
0
01 Mar 2024
AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D
  Talking Face Generation
AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D Talking Face Generation
Yasheng Sun
Wenqing Chu
Hang Zhou
Kaisiyuan Wang
Hideki Koike
225
13
0
25 Feb 2024
EmoTalker: Emotionally Editable Talking Face Generation via Diffusion
  Model
EmoTalker: Emotionally Editable Talking Face Generation via Diffusion ModelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Bingyuan Zhang
Xulong Zhang
Ning Cheng
Jun Yu
Jing Xiao
Jianzong Wang
DiffM
313
17
0
16 Jan 2024
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
From Audio to Photoreal Embodiment: Synthesizing Humans in ConversationsComputer Vision and Pattern Recognition (CVPR), 2024
Evonne Ng
Javier Romero
Timur M. Bagautdinov
Shaojie Bai
Trevor Darrell
Angjoo Kanazawa
Alexander Richard
VGen
300
81
0
03 Jan 2024
DreamTalk: When Expressive Talking Head Generation Meets Diffusion
  Probabilistic Models
DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Yifeng Ma
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yingya Zhang
Zhidong Deng
DiffM
449
23
0
15 Dec 2023
FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head
  Models
FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head ModelsComputer Vision and Pattern Recognition (CVPR), 2023
Shivangi Aneja
Justus Thies
Angela Dai
Matthias Nießner
DiffMVGen
617
58
0
13 Dec 2023
GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained
  3D Face Guidance
GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face GuidanceIEEE Transactions on Visualization and Computer Graphics (TVCG), 2023
Haiming Zhang
Zhihao Yuan
Chaoda Zheng
Xu Yan
Baoyuan Wang
Guanbin Li
Song Wu
Shuguang Cui
Zhen Li
CVBM
217
1
0
12 Dec 2023
GMTalker: Gaussian Mixture-based Audio-Driven Emotional Talking Video Portraits
GMTalker: Gaussian Mixture-based Audio-Driven Emotional Talking Video Portraits
Yibo Xia
Lizhen Wang
Xiang Deng
Xiaoyan Luo
Yunhong Wang
Yebin Liu
VGen
377
2
0
12 Dec 2023
R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid
  Landmarks Encoding and Progressive Multilayer Conditioning
R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning
Zhiling Ye
LiangGuo Zhang
Dingheng Zeng
Quan Lu
Ning Jiang
280
2
0
09 Dec 2023
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D
  Hybrid Prior
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid PriorInternational Conference on 3D Vision (3DV), 2023
Xusen Sun
Longhao Zhang
Hao Zhu
Peng Zhang
Bang Zhang
Xinya Ji
Kangneng Zhou
Daiheng Gao
Liefeng Bo
Xun Cao
VGen
412
53
0
04 Dec 2023
3DiFACE: Diffusion-based Speech-driven 3D Facial Animation and Editing
3DiFACE: Diffusion-based Speech-driven 3D Facial Animation and Editing
Balamurugan Thambiraja
S. Aliakbarian
Darren Cosker
Justus Thies
DiffMVGen
339
18
0
01 Dec 2023
SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis
SyncTalk: The Devil is in the Synchronization for Talking Head SynthesisComputer Vision and Pattern Recognition (CVPR), 2023
Ziqiao Peng
Wentao Hu
Yue Shi
Xiangyu Zhu
Xiaomei Zhang
Hao Zhao
Jun He
Hongyan Liu
Zhaoxin Fan
360
108
0
29 Nov 2023
META4: Semantically-Aligned Generation of Metaphoric Gestures Using
  Self-Supervised Text and Speech Representation
META4: Semantically-Aligned Generation of Metaphoric Gestures Using Self-Supervised Text and Speech Representation
Mireille Fares
Catherine Pelachaud
Nicolas Obin
200
1
0
09 Nov 2023
DualTalker: A Cross-Modal Dual Learning Approach for Speech-Driven 3D
  Facial Animation
DualTalker: A Cross-Modal Dual Learning Approach for Speech-Driven 3D Facial Animation
Guinan Su
Yanwu Yang
Zhifeng Li
VGen
302
3
0
08 Nov 2023
Breathing Life into Faces: Speech-driven 3D Facial Animation with
  Natural Head Pose and Detailed Shape
Breathing Life into Faces: Speech-driven 3D Facial Animation with Natural Head Pose and Detailed Shape
Wei Zhao
Yijun Wang
Tianyu He
Li-Ping Yin
Jianxin Lin
Xin Jin
3DH
214
7
0
31 Oct 2023
Emotional Listener Portrait: Neural Listener Head Generation with
  Emotion
Emotional Listener Portrait: Neural Listener Head Generation with EmotionIEEE International Conference on Computer Vision (ICCV), 2023
Luchuan Song
Guojun Yin
Zhenchao Jin
Xiaoyi Dong
Chenliang Xu
504
19
0
29 Sep 2023
OSM-Net: One-to-Many One-shot Talking Head Generation with Spontaneous
  Head Motions
OSM-Net: One-to-Many One-shot Talking Head Generation with Spontaneous Head Motions
Jin Liu
Xi Wang
Xiaomeng Fu
Yesheng Chai
Cai Yu
Jiao Dai
Jizhong Han
202
10
0
28 Sep 2023
Towards the generation of synchronized and believable non-verbal facial
  behaviors of a talking virtual agent
Towards the generation of synchronized and believable non-verbal facial behaviors of a talking virtual agent
Alice Delbosc
M. Ochs
Nicolas Sabouret
Brian Ravenet
Stéphane Ayache
383
14
0
15 Sep 2023
HDTR-Net: A Real-Time High-Definition Teeth Restoration Network for
  Arbitrary Talking Face Generation Methods
HDTR-Net: A Real-Time High-Definition Teeth Restoration Network for Arbitrary Talking Face Generation MethodsChinese Conference on Pattern Recognition and Computer Vision (CPRCV), 2023
Yongyuan Li
Xiuyuan Qin
Chao Liang
Mingqiang Wei
253
5
0
14 Sep 2023
Blendshapes GHUM: Real-time Monocular Facial Blendshape Prediction
Blendshapes GHUM: Real-time Monocular Facial Blendshape Prediction
Ivan Grishchenko
Geng Yan
Eduard Gabriel Bazavan
Andrei Zanfir
Nikolai Chinaev
Karthik Raveendran
Matthias Grundmann
C. Sminchisescu
3DHCVBM
241
4
0
11 Sep 2023
Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation
Efficient Emotional Adaptation for Audio-Driven Talking-Head GenerationIEEE International Conference on Computer Vision (ICCV), 2023
Yuan Gan
Zongxin Yang
Xihang Yue
Lingyun Sun
Yezhou Yang
421
106
0
10 Sep 2023
Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a
  Short Video
Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short VideoIEEE International Conference on Computer Vision (ICCV), 2023
Xiuzhe Wu
Pengfei Hu
Yang Wu
Xiaoyang Lyu
Yan-Pei Cao
Ying Shan
Wenming Yang
Zhongqian Sun
Xiaojuan Qi
160
16
0
09 Sep 2023
1234
Next
Page 1 of 4