Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.10137
Cited By
v1
v2 (latest)
Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose
24 February 2020
Ran Yi
Zipeng Ye
Juyong Zhang
Hujun Bao
Yong Liu
CVBM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose"
50 / 90 papers shown
Do You See What I Say? Generalizable Deepfake Detection based on Visual Speech Recognition
Maheswar Bora
Tashvik Dhamija
Shukesh Reddy
Baptiste Chopin
P. Balaji
Abhijit Das
A. Dantcheva
156
0
0
27 Nov 2025
Towards Generalizable Deepfake Detection via Forgery-aware Audio-Visual Adaptation: A Variational Bayesian Approach
Fan Nie
Jiangqun Ni
J. Zhang
Bin Zhang
Weizhe Zhang
Bin Li
AAML
229
2
0
24 Nov 2025
Referee: Reference-aware Audiovisual Deepfake Detection
Hyemin Boo
Eunsang Lee
Jiyoung Lee
142
0
0
31 Oct 2025
SpeechForensics: Audio-Visual Speech Representation Learning for Face Forgery Detection
Neural Information Processing Systems (NeurIPS), 2025
Yachao Liang
Min Yu
Gang Li
Jianguo Jiang
B. Li
Feng Yu
Ning Zhang
Xiang Meng
Weiqing Huang
AAML
249
10
0
13 Aug 2025
Robust Deepfake Detection for Electronic Know Your Customer Systems Using Registered Images
IEEE International Conference on Automatic Face & Gesture Recognition (FG), 2025
Takuma Amada
Kazuya Kakizaki
Taiki Miyagawa
Akinori F. Ebihara
Kaede Shiohara
T. Yamasaki
181
1
0
30 Jul 2025
JOLT3D: Joint Learning of Talking Heads and 3DMM Parameters with Application to Lip-Sync
Sungjoon Park
Minsik Park
Haneol Lee
Jaesub Yun
Donggeon Lee
3DH
295
0
0
28 Jul 2025
MemoryTalker: Personalized Speech-Driven 3D Facial Animation via Audio-Guided Stylization
Hyung Kyu Kim
Sangmin Lee
Hak Gu Kim
236
4
0
28 Jul 2025
Detecting Lip-Syncing Deepfakes: Vision Temporal Transformer for Analyzing Mouth Inconsistencies
Soumyya Kanti Datta
Shan Jia
Siwei Lyu
373
4
0
02 Apr 2025
Personalized Generation In Large Model Era: A Survey
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yiyan Xu
Jinghao Zhang
Alireza Salemi
Xinting Hu
Wenjie Wang
Fuli Feng
Hamed Zamani
Xiangnan He
Tat-Seng Chua
3DV
643
42
0
04 Mar 2025
InsTaG: Learning Personalized 3D Talking Head from Few-Second Video
Computer Vision and Pattern Recognition (CVPR), 2025
Jiahe Li
Jiawei Zhang
Xiao Bai
Jin Zheng
J. Zhou
L. Gu
455
12
0
27 Feb 2025
Driving Towards Inclusion: A Systematic Review of AI-powered Accessibility Enhancements for People with Disability in Autonomous Vehicles
IEEE Access (IEEE Access), 2024
Ashish Bastola
Julian Brinkley
Hao Wang
Abolfazl Razi
A. Moshayedi
Abolfazl Razi
393
5
0
10 Jan 2025
JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics and Head Motion Generation
Xuyang Cao
Guoxin Wang
Sheng Shi
Jun Zhao
Yang Yao
Jintao Fei
Minyu Gao
Pei Xie
VGen
553
6
0
14 Nov 2024
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes
Neural Information Processing Systems (NeurIPS), 2024
Zhenhui Ye
Tianyun Zhong
Yi Ren
Ziyue Jiang
Jiawei Huang
...
Chen Zhang
Zehan Wang
Xize Chen
Xiang Yin
Zhou Zhao
VGen
360
21
0
09 Oct 2024
A Comprehensive Survey with Critical Analysis for Deepfake Speech Detection
Computer Science Review (CSR), 2024
Lam Pham
Phat Lam
Dat Tran
Hieu Tang
Tin Nguyen
Alexander Schindler
Canh Vu
Alexander Polonsky
Canh Vu
720
19
0
23 Sep 2024
StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Suzhen Wang
Yifeng Ma
Yu Ding
Zhipeng Hu
Changjie Fan
Tangjie Lv
Zhidong Deng
Xin Yu
289
23
0
14 Sep 2024
S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis
European Conference on Computer Vision (ECCV), 2024
Dongze Li
Kang Zhao
Wei Wang
Yifeng Ma
Bo Peng
Yingya Zhang
Jing Dong
3DH
CVBM
265
5
0
18 Aug 2024
Content and Style Aware Audio-Driven Facial Animation
British Machine Vision Conference (BMVC), 2024
Qingju Liu
Hyeongwoo Kim
Gaurav Bharaj
DiffM
417
2
0
13 Aug 2024
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing
Ming Meng
Yufei Zhao
Bo Zhang
Yonggui Zhu
Weimin Shi
Maxwell Wen
Zhaoxin Fan
VGen
414
6
0
15 Jun 2024
AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection
Trevine Oorloff
Surya Koppisetti
Nicolo Bonettini
Divyaraj Solanki
Ben Colman
Yaser Yacoob
Ali Shahriyari
Gaurav Bharaj
423
94
0
05 Jun 2024
Faces that Speak: Jointly Synthesising Talking Face and Speech from Text
Computer Vision and Pattern Recognition (CVPR), 2024
Youngjoon Jang
Ji-Hoon Kim
Junseok Ahn
Doyeop Kwak
Hong-Sun Yang
Yooncheol Ju
Il-Hwan Kim
Byeong-Yeol Kim
Joon Son Chung
CVBM
322
23
0
16 May 2024
Dyadic Interaction Modeling for Social Behavior Generation
European Conference on Computer Vision (ECCV), 2024
Minh Tran
Di Chang
Maksim Siniukov
Mohammad Soleymani
VGen
440
28
0
14 Mar 2024
FlowVQTalker: High-Quality Emotional Talking Face Generation through Normalizing Flow and Quantization
Computer Vision and Pattern Recognition (CVPR), 2024
Shuai Tan
Bin Ji
Ye Pan
514
44
0
11 Mar 2024
Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis
Zicheng Zhang
Ruobing Zheng
Ziwen Liu
Congying Han
Tianqi Li
Meng Wang
Tiande Guo
Jingdong Chen
Bonan li
Ming Yang
3DH
243
14
0
27 Feb 2024
Exposing Lip-syncing Deepfakes from Mouth Inconsistencies
Soumyya Kanti Datta
Shan Jia
Siwei Lyu
317
22
0
18 Jan 2024
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis
International Conference on Learning Representations (ICLR), 2024
Zhenhui Ye
Tianyun Zhong
Yi Ren
Jiaqi Yang
Weichuang Li
...
Jinglin Liu
Chen Zhang
Xiang Yin
Zejun Ma
Zhou Zhao
366
98
0
16 Jan 2024
DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Yifeng Ma
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yingya Zhang
Zhidong Deng
DiffM
447
23
0
15 Dec 2023
GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face Guidance
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2023
Haiming Zhang
Zhihao Yuan
Chaoda Zheng
Xu Yan
Baoyuan Wang
Guanbin Li
Song Wu
Shuguang Cui
Zhen Li
CVBM
216
1
0
12 Dec 2023
GMTalker: Gaussian Mixture-based Audio-Driven Emotional Talking Video Portraits
Yibo Xia
Lizhen Wang
Xiang Deng
Xiaoyan Luo
Yunhong Wang
Yebin Liu
VGen
377
2
0
12 Dec 2023
R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning
Zhiling Ye
LiangGuo Zhang
Dingheng Zeng
Quan Lu
Ning Jiang
280
2
0
09 Dec 2023
3DiFACE: Diffusion-based Speech-driven 3D Facial Animation and Editing
Balamurugan Thambiraja
S. Aliakbarian
Darren Cosker
Justus Thies
DiffM
VGen
336
18
0
01 Dec 2023
THInImg: Cross-modal Steganography for Presenting Talking Heads in Images
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Lin Zhao
Hongxuan Li
Xuefei Ning
Xinru Jiang
270
2
0
28 Nov 2023
OSM-Net: One-to-Many One-shot Talking Head Generation with Spontaneous Head Motions
Jin Liu
Xi Wang
Xiaomeng Fu
Yesheng Chai
Cai Yu
Jiao Dai
Jizhong Han
201
9
0
28 Sep 2023
ReliTalk: Relightable Talking Portrait Generation from a Single Video
International Journal of Computer Vision (IJCV), 2023
Haonan Qiu
Zhaoxi Chen
Yuming Jiang
Hang Zhou
Xiangyu Fan
Lei Yang
Wayne Wu
Ziwei Liu
DiffM
VGen
275
16
0
05 Sep 2023
RADIO: Reference-Agnostic Dubbing Video Synthesis
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Dongyeun Lee
Chaewon Kim
Sangjoon Yu
Jaejun Yoo
Gyeong-Moon Park
VGen
DiffM
384
2
0
05 Sep 2023
Audio-Driven Dubbing for User Generated Contents via Style-Aware Semi-Parametric Synthesis
Linsen Song
Wayne Wu
Chaoyou Fu
Chen Change Loy
Xiao-Yu Zhang
285
17
0
31 Aug 2023
A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation
Li Liu
Lufei Gao
Wen-Ling Lei
Fengji Ma
Xiaotian Lin
Jin-Tao Wang
CVBM
225
9
0
17 Aug 2023
Speech-Driven 3D Face Animation with Composite and Regional Facial Movements
ACM Multimedia (ACM MM), 2023
Haozhe Wu
Songtao Zhou
Jia Jia
Junliang Xing
Qi Wen
Xiang Wen
CVBM
359
25
0
10 Aug 2023
UniBriVL: Robust Universal Representation and Generation of Audio Driven Diffusion Models
Sen Fang
Bowen Gao
Yangjian Wu
T. Teoh
DiffM
275
1
0
29 Jul 2023
MODA: Mapping-Once Audio-driven Portrait Animation with Dual Attentions
IEEE International Conference on Computer Vision (ICCV), 2023
Yunfei Liu
Lijian Lin
Fei Yu
Changyin Zhou
Yu Li
DiffM
VGen
268
40
0
19 Jul 2023
A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation
Louis Airale
Dominique Vaufreydaz
Xavier Alameda-Pineda
184
1
0
04 Jul 2023
Parametric Implicit Face Representation for Audio-Driven Facial Reenactment
Computer Vision and Pattern Recognition (CVPR), 2023
Ricong Huang
Puxiang Lai
Yipeng Qin
Guanbin Li
CVBM
DiffM
262
16
0
13 Jun 2023
IFaceUV: Intuitive Motion Facial Image Generation by Identity Preservation via UV map
Han-Lim Lee
Yu-Te Ku
Eunseok Kim
Seungryul Baek
3DH
180
0
0
08 Jun 2023
LPMM: Intuitive Pose Control for Neural Talking-Head Model via Landmark-Parameter Morphable Model
K. Lee
Patrick Kwon
Myung Ki Lee
Namhyuk Ahn
Junsoo Lee
358
2
0
17 May 2023
StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator
Computer Vision and Pattern Recognition (CVPR), 2023
Jiazhi Guan
Zhanwang Zhang
Hang Zhou
Tianshu Hu
Kaisiyuan Wang
...
Haocheng Feng
Jingtuo Liu
Errui Ding
Ziwei Liu
Jingdong Wang
310
112
0
09 May 2023
High-fidelity Generalized Emotional Talking Face Generation with Multi-modal Emotion Space Learning
Computer Vision and Pattern Recognition (CVPR), 2023
Chao Xu
Sijun Tan
Jibang Wu
Yue Han
Wenqing Chu
Xiaohui Bei
Chengjie Wang
Haifeng Xu
Yong Liu
CVBM
282
45
0
04 May 2023
GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation
Zhenhui Ye
Jinzheng He
Ziyue Jiang
Rongjie Huang
Jia-Bin Huang
Jinglin Liu
Yixiang Ren
Xiang Yin
Zejun Ma
Zhou Zhao
CVBM
247
57
0
01 May 2023
Audio-Driven Talking Face Generation with Diverse yet Realistic Facial Animations
Pattern Recognition (Pattern Recogn.), 2023
Rongliang Wu
Yingchen Yu
Fangneng Zhan
Jiahui Zhang
Xiaoqin Zhang
Shijian Lu
CVBM
253
16
0
18 Apr 2023
That's What I Said: Fully-Controllable Talking Face Generation
ACM Multimedia (ACM MM), 2023
Youngjoon Jang
Kyeongha Rho
Jong-Bin Woo
Hyeongkeun Lee
Jihwan Park
Youshin Lim
Byeong-Yeol Kim
Joon Son Chung
CVBM
318
13
0
06 Apr 2023
DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder
ACM Multimedia (ACM MM), 2023
Chenpeng Du
Qi Chen
Xie Chen
K. Yu
DiffM
506
70
0
30 Mar 2023
MusicFace: Music-driven Expressive Singing Face Synthesis
Computational Visual Media (CVM), 2023
Peng Liu
W. Deng
Hengda Li
Jintai Wang
Yinglin Zheng
Yiwei Ding
Xiaohu Guo
Ming Zeng
CVBM
223
15
0
24 Mar 2023
1
2
Next
Page 1 of 2