Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1906.06337
Cited By
Realistic Speech-Driven Facial Animation with GANs
International Journal of Computer Vision (IJCV), 2019
14 June 2019
Konstantinos Vougioukas
Stavros Petridis
Maja Pantic
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Realistic Speech-Driven Facial Animation with GANs"
50 / 157 papers shown
Taming Transformer for Emotion-Controllable Talking Face Generation
Ziqi Zhang
Cheng Deng
CVBM
192
0
0
20 Aug 2025
Multi-human Interactive Talking Dataset
Zeyu Zhu
Weijia Wu
Mike Zheng Shou
VGen
225
1
0
05 Aug 2025
Mask-Free Audio-driven Talking Face Generation for Enhanced Visual Quality and Identity Preservation
Dogucan Yaman
Fevziye Irem Eyiokur
Leonard Barmann
H. K. Ekenel
Alexander H. Waibel
CVBM
249
1
0
28 Jul 2025
OT-Talk: Animating 3D Talking Head with Optimal Transportation
International Conference on Multimedia Retrieval (ICMR), 2025
Xinmu Wang
Xiang Gao
Xiyun Song
Heather Yu
Zongfang Lin
Liang Peng
Xianfeng Gu
427
5
0
03 May 2025
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution
Antoni Bigata
Rodrigo Mira
Stella Bounareli
Michał Stypułkowski
Konstantinos Vougioukas
Stavros Petridis
Maja Pantic
423
8
0
01 May 2025
PASE: Phoneme-Aware Speech Encoder to Improve Lip Sync Accuracy for Talking Head Synthesis
Yihuan Huang
Jiajun Liu
Yanzhen Ren
Wuyang Liu
Juhua Tang
Zongkun Sun
358
0
0
08 Apr 2025
KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation
Computer Vision and Pattern Recognition (CVPR), 2025
Antoni Bigata
Michał Stypułkowski
Rodrigo Mira
Stella Bounareli
Konstantinos Vougioukas
Zoe Landgraf
Nikita Drobyshev
Maciej Ziȩba
Stavros Petridis
Maja Pantic
DiffM
VGen
453
10
0
03 Mar 2025
Emotion Recognition and Generation: A Comprehensive Review of Face, Speech, and Text Modalities
Rebecca Mobbs
Dimitrios Makris
Vasileios Argyriou
236
9
0
02 Feb 2025
Towards Dynamic Neural Communication and Speech Neuroprosthesis Based on Viseme Decoding
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Ji-Ha Park
Seo-Hyun Lee
Soowon Kim
Seong-Whan Lee
498
5
0
28 Jan 2025
Joint Learning of Depth and Appearance for Portrait Image Animation
Xinya Ji
Gaspard Zoss
Prashanth Chandran
Lingchen Yang
Xun Cao
B. Solenthaler
D. Bradley
3DH
MDE
391
2
0
15 Jan 2025
A Review of Human Emotion Synthesis Based on Generative Technology
Fei Ma
Yongqian Li
Yifan Xie
Y. He
Yujiao Shi
...
Z. Liu
Wei Yao
Fuji Ren
Fei Richard Yu
Shiguang Ni
318
16
0
10 Dec 2024
DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation
International Conference on Learning Representations (ICLR), 2024
Hanbo Cheng
Limin Lin
Chenyu Liu
Pengcheng Xia
Pengfei Hu
Jiefeng Ma
Jun Du
Jia Pan
DiffM
VGen
1.1K
6
0
17 Oct 2024
EmoGene: Audio-Driven Emotional 3D Talking-Head Generation
IEEE International Conference on Automatic Face & Gesture Recognition (FG), 2024
Wenqing Wang
Yun Fu
VGen
411
1
0
07 Oct 2024
StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Suzhen Wang
Yifeng Ma
Yu Ding
Zhipeng Hu
Changjie Fan
Tangjie Lv
Zhidong Deng
Xin Yu
289
24
0
14 Sep 2024
KAN-Based Fusion of Dual-Domain for Audio-Driven Facial Landmarks Generation
Hoang-Son Vo-Thanh
Quang Vinh Nguyen
Soo-Hyung Kim
CVBM
180
0
0
09 Sep 2024
FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model
ACM Multimedia (MM), 2024
Ziyu Yao
Xuxin Cheng
Zhiqi Huang
DiffM
256
10
0
18 Aug 2024
The impact of differences in facial features between real speakers and 3D face models on synthesized lip motions
Rabab Algadhy
Yoshihiko Gotoh
Steve Maddock
CVBM
233
1
0
24 Jul 2024
EmoFace: Audio-driven Emotional 3D Face Animation
Chang Liu
Qunfen Lin
Zijiao Zeng
Ye Pan
CVBM
280
9
0
17 Jul 2024
Listen and Move: Improving GANs Coherency in Agnostic Sound-to-Video Generation
Rafael Redondo
264
0
0
23 Jun 2024
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing
Ming Meng
Yufei Zhao
Bo Zhang
Yonggui Zhu
Weimin Shi
Maxwell Wen
Zhaoxin Fan
VGen
414
6
0
15 Jun 2024
Deepfakes and Higher Education: A Research Agenda and Scoping Review of Synthetic Media
Jasper Roe
Mike Perkins
257
29
0
24 Apr 2024
GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting
Hongyun Yu
Zhan Qu
Qihang Yu
Jianchuan Chen
Zhonghua Jiang
...
Shengyu Zhang
Jimin Xu
Leilei Gan
Chengfei Lv
Gang Yu
3DGS
335
38
0
22 Apr 2024
Learn2Talk: 3D Talking Face Learns from 2D Talking Face
Yixiang Zhuang
Baoping Cheng
Yao Cheng
Yuntao Jin
Renshuai Liu
Chengyang Li
Xuan Cheng
Jing Liao
Juncong Lin
CVBM
3DH
258
13
0
19 Apr 2024
MI-NeRF: Learning a Single Face NeRF from Multiple Identities
Aggelina Chatziagapi
Grigorios G. Chrysos
Dimitris Samaras
CVBM
325
4
0
29 Mar 2024
Dyadic Interaction Modeling for Social Behavior Generation
European Conference on Computer Vision (ECCV), 2024
Minh Tran
Di Chang
Maksim Siniukov
Mohammad Soleymani
VGen
441
28
0
14 Mar 2024
VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis
Computer Vision and Pattern Recognition (CVPR), 2024
Enric Corona
Andrei Zanfir
Eduard Gabriel Bazavan
Nikos Kolotouros
Thiemo Alldieck
C. Sminchisescu
VGen
DiffM
344
49
0
13 Mar 2024
FlowVQTalker: High-Quality Emotional Talking Face Generation through Normalizing Flow and Quantization
Computer Vision and Pattern Recognition (CVPR), 2024
Shuai Tan
Bin Ji
Ye Pan
521
44
0
11 Mar 2024
Say Anything with Any Style
AAAI Conference on Artificial Intelligence (AAAI), 2024
Shuai Tan
Bin Ji
Yu Ding
Ye Pan
VGen
DiffM
245
30
0
11 Mar 2024
CustomListener: Text-guided Responsive Interaction for User-friendly Listening Head Generation
Xi Liu
Ying Guo
Cheng Zhen
Tong Li
Yingying Ao
Pengfei Yan
DiffM
400
21
0
01 Mar 2024
AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D Talking Face Generation
Yasheng Sun
Wenqing Chu
Hang Zhou
Kaisiyuan Wang
Hideki Koike
225
13
0
25 Feb 2024
EmoTalker: Emotionally Editable Talking Face Generation via Diffusion Model
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Bingyuan Zhang
Xulong Zhang
Ning Cheng
Jun Yu
Jing Xiao
Jianzong Wang
DiffM
313
17
0
16 Jan 2024
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Computer Vision and Pattern Recognition (CVPR), 2024
Evonne Ng
Javier Romero
Timur M. Bagautdinov
Shaojie Bai
Trevor Darrell
Angjoo Kanazawa
Alexander Richard
VGen
300
81
0
03 Jan 2024
DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Yifeng Ma
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yingya Zhang
Zhidong Deng
DiffM
449
23
0
15 Dec 2023
FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models
Computer Vision and Pattern Recognition (CVPR), 2023
Shivangi Aneja
Justus Thies
Angela Dai
Matthias Nießner
DiffM
VGen
617
58
0
13 Dec 2023
GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face Guidance
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2023
Haiming Zhang
Zhihao Yuan
Chaoda Zheng
Xu Yan
Baoyuan Wang
Guanbin Li
Song Wu
Shuguang Cui
Zhen Li
CVBM
217
1
0
12 Dec 2023
GMTalker: Gaussian Mixture-based Audio-Driven Emotional Talking Video Portraits
Yibo Xia
Lizhen Wang
Xiang Deng
Xiaoyan Luo
Yunhong Wang
Yebin Liu
VGen
377
2
0
12 Dec 2023
R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning
Zhiling Ye
LiangGuo Zhang
Dingheng Zeng
Quan Lu
Ning Jiang
280
2
0
09 Dec 2023
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
International Conference on 3D Vision (3DV), 2023
Xusen Sun
Longhao Zhang
Hao Zhu
Peng Zhang
Bang Zhang
Xinya Ji
Kangneng Zhou
Daiheng Gao
Liefeng Bo
Xun Cao
VGen
412
53
0
04 Dec 2023
3DiFACE: Diffusion-based Speech-driven 3D Facial Animation and Editing
Balamurugan Thambiraja
S. Aliakbarian
Darren Cosker
Justus Thies
DiffM
VGen
339
18
0
01 Dec 2023
SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis
Computer Vision and Pattern Recognition (CVPR), 2023
Ziqiao Peng
Wentao Hu
Yue Shi
Xiangyu Zhu
Xiaomei Zhang
Hao Zhao
Jun He
Hongyan Liu
Zhaoxin Fan
360
108
0
29 Nov 2023
META4: Semantically-Aligned Generation of Metaphoric Gestures Using Self-Supervised Text and Speech Representation
Mireille Fares
Catherine Pelachaud
Nicolas Obin
200
1
0
09 Nov 2023
DualTalker: A Cross-Modal Dual Learning Approach for Speech-Driven 3D Facial Animation
Guinan Su
Yanwu Yang
Zhifeng Li
VGen
302
3
0
08 Nov 2023
Breathing Life into Faces: Speech-driven 3D Facial Animation with Natural Head Pose and Detailed Shape
Wei Zhao
Yijun Wang
Tianyu He
Li-Ping Yin
Jianxin Lin
Xin Jin
3DH
214
7
0
31 Oct 2023
Emotional Listener Portrait: Neural Listener Head Generation with Emotion
IEEE International Conference on Computer Vision (ICCV), 2023
Luchuan Song
Guojun Yin
Zhenchao Jin
Xiaoyi Dong
Chenliang Xu
504
19
0
29 Sep 2023
OSM-Net: One-to-Many One-shot Talking Head Generation with Spontaneous Head Motions
Jin Liu
Xi Wang
Xiaomeng Fu
Yesheng Chai
Cai Yu
Jiao Dai
Jizhong Han
202
10
0
28 Sep 2023
Towards the generation of synchronized and believable non-verbal facial behaviors of a talking virtual agent
Alice Delbosc
M. Ochs
Nicolas Sabouret
Brian Ravenet
Stéphane Ayache
383
14
0
15 Sep 2023
HDTR-Net: A Real-Time High-Definition Teeth Restoration Network for Arbitrary Talking Face Generation Methods
Chinese Conference on Pattern Recognition and Computer Vision (CPRCV), 2023
Yongyuan Li
Xiuyuan Qin
Chao Liang
Mingqiang Wei
253
5
0
14 Sep 2023
Blendshapes GHUM: Real-time Monocular Facial Blendshape Prediction
Ivan Grishchenko
Geng Yan
Eduard Gabriel Bazavan
Andrei Zanfir
Nikolai Chinaev
Karthik Raveendran
Matthias Grundmann
C. Sminchisescu
3DH
CVBM
241
4
0
11 Sep 2023
Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation
IEEE International Conference on Computer Vision (ICCV), 2023
Yuan Gan
Zongxin Yang
Xihang Yue
Lingyun Sun
Yezhou Yang
421
106
0
10 Sep 2023
Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video
IEEE International Conference on Computer Vision (ICCV), 2023
Xiuzhe Wu
Pengfei Hu
Yang Wu
Xiaoyang Lyu
Yan-Pei Cao
Ying Shan
Wenming Yang
Zhongqian Sun
Xiaojuan Qi
160
16
0
09 Sep 2023
1
2
3
4
Next
Page 1 of 4