ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.10137
  4. Cited By
Audio-driven Talking Face Video Generation with Learning-based
  Personalized Head Pose
v1v2 (latest)

Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose

24 February 2020
Ran Yi
Zipeng Ye
Juyong Zhang
Hujun Bao
Yong Liu
    CVBM
ArXiv (abs)PDFHTML

Papers citing "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose"

50 / 90 papers shown
Do You See What I Say? Generalizable Deepfake Detection based on Visual Speech Recognition
Do You See What I Say? Generalizable Deepfake Detection based on Visual Speech Recognition
Maheswar Bora
Tashvik Dhamija
Shukesh Reddy
Baptiste Chopin
P. Balaji
Abhijit Das
A. Dantcheva
156
0
0
27 Nov 2025
Towards Generalizable Deepfake Detection via Forgery-aware Audio-Visual Adaptation: A Variational Bayesian Approach
Towards Generalizable Deepfake Detection via Forgery-aware Audio-Visual Adaptation: A Variational Bayesian Approach
Fan Nie
Jiangqun Ni
J. Zhang
Bin Zhang
Weizhe Zhang
Bin Li
AAML
229
2
0
24 Nov 2025
Referee: Reference-aware Audiovisual Deepfake Detection
Referee: Reference-aware Audiovisual Deepfake Detection
Hyemin Boo
Eunsang Lee
Jiyoung Lee
142
0
0
31 Oct 2025
SpeechForensics: Audio-Visual Speech Representation Learning for Face Forgery Detection
SpeechForensics: Audio-Visual Speech Representation Learning for Face Forgery DetectionNeural Information Processing Systems (NeurIPS), 2025
Yachao Liang
Min Yu
Gang Li
Jianguo Jiang
B. Li
Feng Yu
Ning Zhang
Xiang Meng
Weiqing Huang
AAML
249
10
0
13 Aug 2025
Robust Deepfake Detection for Electronic Know Your Customer Systems Using Registered Images
Robust Deepfake Detection for Electronic Know Your Customer Systems Using Registered ImagesIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2025
Takuma Amada
Kazuya Kakizaki
Taiki Miyagawa
Akinori F. Ebihara
Kaede Shiohara
T. Yamasaki
181
1
0
30 Jul 2025
JOLT3D: Joint Learning of Talking Heads and 3DMM Parameters with Application to Lip-Sync
JOLT3D: Joint Learning of Talking Heads and 3DMM Parameters with Application to Lip-Sync
Sungjoon Park
Minsik Park
Haneol Lee
Jaesub Yun
Donggeon Lee
3DH
295
0
0
28 Jul 2025
MemoryTalker: Personalized Speech-Driven 3D Facial Animation via Audio-Guided Stylization
MemoryTalker: Personalized Speech-Driven 3D Facial Animation via Audio-Guided Stylization
Hyung Kyu Kim
Sangmin Lee
Hak Gu Kim
236
4
0
28 Jul 2025
Detecting Lip-Syncing Deepfakes: Vision Temporal Transformer for Analyzing Mouth Inconsistencies
Detecting Lip-Syncing Deepfakes: Vision Temporal Transformer for Analyzing Mouth Inconsistencies
Soumyya Kanti Datta
Shan Jia
Siwei Lyu
373
4
0
02 Apr 2025
Personalized Generation In Large Model Era: A Survey
Personalized Generation In Large Model Era: A SurveyAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Yiyan Xu
Jinghao Zhang
Alireza Salemi
Xinting Hu
Wenjie Wang
Fuli Feng
Hamed Zamani
Xiangnan He
Tat-Seng Chua
3DV
643
42
0
04 Mar 2025
InsTaG: Learning Personalized 3D Talking Head from Few-Second Video
InsTaG: Learning Personalized 3D Talking Head from Few-Second VideoComputer Vision and Pattern Recognition (CVPR), 2025
Jiahe Li
Jiawei Zhang
Xiao Bai
Jin Zheng
J. Zhou
L. Gu
455
12
0
27 Feb 2025
Driving Towards Inclusion: A Systematic Review of AI-powered Accessibility Enhancements for People with Disability in Autonomous Vehicles
Driving Towards Inclusion: A Systematic Review of AI-powered Accessibility Enhancements for People with Disability in Autonomous VehiclesIEEE Access (IEEE Access), 2024
Ashish Bastola
Julian Brinkley
Hao Wang
Abolfazl Razi
A. Moshayedi
Abolfazl Razi
393
5
0
10 Jan 2025
JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics and Head Motion Generation
JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics and Head Motion Generation
Xuyang Cao
Guoxin Wang
Sheng Shi
Jun Zhao
Yang Yao
Jintao Fei
Minyu Gao
Pei Xie
VGen
553
6
0
14 Nov 2024
MimicTalk: Mimicking a personalized and expressive 3D talking face in
  minutes
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutesNeural Information Processing Systems (NeurIPS), 2024
Zhenhui Ye
Tianyun Zhong
Yi Ren
Ziyue Jiang
Jiawei Huang
...
Chen Zhang
Zehan Wang
Xize Chen
Xiang Yin
Zhou Zhao
VGen
360
21
0
09 Oct 2024
A Comprehensive Survey with Critical Analysis for Deepfake Speech Detection
A Comprehensive Survey with Critical Analysis for Deepfake Speech DetectionComputer Science Review (CSR), 2024
Lam Pham
Phat Lam
Dat Tran
Hieu Tang
Tin Nguyen
Alexander Schindler
Canh Vu
Alexander Polonsky
Canh Vu
720
19
0
23 Sep 2024
StyleTalk++: A Unified Framework for Controlling the Speaking Styles of
  Talking Heads
StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking HeadsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Suzhen Wang
Yifeng Ma
Yu Ding
Zhipeng Hu
Changjie Fan
Tangjie Lv
Zhidong Deng
Xin Yu
289
23
0
14 Sep 2024
S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High
  Fidelity Talking Head Synthesis
S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head SynthesisEuropean Conference on Computer Vision (ECCV), 2024
Dongze Li
Kang Zhao
Wei Wang
Yifeng Ma
Bo Peng
Yingya Zhang
Jing Dong
3DHCVBM
265
5
0
18 Aug 2024
Content and Style Aware Audio-Driven Facial Animation
Content and Style Aware Audio-Driven Facial AnimationBritish Machine Vision Conference (BMVC), 2024
Qingju Liu
Hyeongwoo Kim
Gaurav Bharaj
DiffM
417
2
0
13 Aug 2024
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis:
  Techniques for Portrait Generation, Driving Mechanisms, and Editing
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing
Ming Meng
Yufei Zhao
Bo Zhang
Yonggui Zhu
Weimin Shi
Maxwell Wen
Zhaoxin Fan
VGen
414
6
0
15 Jun 2024
AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection
AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection
Trevine Oorloff
Surya Koppisetti
Nicolo Bonettini
Divyaraj Solanki
Ben Colman
Yaser Yacoob
Ali Shahriyari
Gaurav Bharaj
423
94
0
05 Jun 2024
Faces that Speak: Jointly Synthesising Talking Face and Speech from Text
Faces that Speak: Jointly Synthesising Talking Face and Speech from TextComputer Vision and Pattern Recognition (CVPR), 2024
Youngjoon Jang
Ji-Hoon Kim
Junseok Ahn
Doyeop Kwak
Hong-Sun Yang
Yooncheol Ju
Il-Hwan Kim
Byeong-Yeol Kim
Joon Son Chung
CVBM
322
23
0
16 May 2024
Dyadic Interaction Modeling for Social Behavior Generation
Dyadic Interaction Modeling for Social Behavior GenerationEuropean Conference on Computer Vision (ECCV), 2024
Minh Tran
Di Chang
Maksim Siniukov
Mohammad Soleymani
VGen
440
28
0
14 Mar 2024
FlowVQTalker: High-Quality Emotional Talking Face Generation through
  Normalizing Flow and Quantization
FlowVQTalker: High-Quality Emotional Talking Face Generation through Normalizing Flow and QuantizationComputer Vision and Pattern Recognition (CVPR), 2024
Shuai Tan
Bin Ji
Ye Pan
514
44
0
11 Mar 2024
Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis
Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis
Zicheng Zhang
Ruobing Zheng
Ziwen Liu
Congying Han
Tianqi Li
Meng Wang
Tiande Guo
Jingdong Chen
Bonan li
Ming Yang
3DH
243
14
0
27 Feb 2024
Exposing Lip-syncing Deepfakes from Mouth Inconsistencies
Exposing Lip-syncing Deepfakes from Mouth Inconsistencies
Soumyya Kanti Datta
Shan Jia
Siwei Lyu
317
22
0
18 Jan 2024
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis
Real3D-Portrait: One-shot Realistic 3D Talking Portrait SynthesisInternational Conference on Learning Representations (ICLR), 2024
Zhenhui Ye
Tianyun Zhong
Yi Ren
Jiaqi Yang
Weichuang Li
...
Jinglin Liu
Chen Zhang
Xiang Yin
Zejun Ma
Zhou Zhao
366
98
0
16 Jan 2024
DreamTalk: When Expressive Talking Head Generation Meets Diffusion
  Probabilistic Models
DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Yifeng Ma
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yingya Zhang
Zhidong Deng
DiffM
447
23
0
15 Dec 2023
GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained
  3D Face Guidance
GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face GuidanceIEEE Transactions on Visualization and Computer Graphics (TVCG), 2023
Haiming Zhang
Zhihao Yuan
Chaoda Zheng
Xu Yan
Baoyuan Wang
Guanbin Li
Song Wu
Shuguang Cui
Zhen Li
CVBM
216
1
0
12 Dec 2023
GMTalker: Gaussian Mixture-based Audio-Driven Emotional Talking Video Portraits
GMTalker: Gaussian Mixture-based Audio-Driven Emotional Talking Video Portraits
Yibo Xia
Lizhen Wang
Xiang Deng
Xiaoyan Luo
Yunhong Wang
Yebin Liu
VGen
377
2
0
12 Dec 2023
R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid
  Landmarks Encoding and Progressive Multilayer Conditioning
R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning
Zhiling Ye
LiangGuo Zhang
Dingheng Zeng
Quan Lu
Ning Jiang
280
2
0
09 Dec 2023
3DiFACE: Diffusion-based Speech-driven 3D Facial Animation and Editing
3DiFACE: Diffusion-based Speech-driven 3D Facial Animation and Editing
Balamurugan Thambiraja
S. Aliakbarian
Darren Cosker
Justus Thies
DiffMVGen
336
18
0
01 Dec 2023
THInImg: Cross-modal Steganography for Presenting Talking Heads in
  Images
THInImg: Cross-modal Steganography for Presenting Talking Heads in ImagesIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Lin Zhao
Hongxuan Li
Xuefei Ning
Xinru Jiang
270
2
0
28 Nov 2023
OSM-Net: One-to-Many One-shot Talking Head Generation with Spontaneous
  Head Motions
OSM-Net: One-to-Many One-shot Talking Head Generation with Spontaneous Head Motions
Jin Liu
Xi Wang
Xiaomeng Fu
Yesheng Chai
Cai Yu
Jiao Dai
Jizhong Han
201
9
0
28 Sep 2023
ReliTalk: Relightable Talking Portrait Generation from a Single Video
ReliTalk: Relightable Talking Portrait Generation from a Single VideoInternational Journal of Computer Vision (IJCV), 2023
Haonan Qiu
Zhaoxi Chen
Yuming Jiang
Hang Zhou
Xiangyu Fan
Lei Yang
Wayne Wu
Ziwei Liu
DiffMVGen
275
16
0
05 Sep 2023
RADIO: Reference-Agnostic Dubbing Video Synthesis
RADIO: Reference-Agnostic Dubbing Video SynthesisIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Dongyeun Lee
Chaewon Kim
Sangjoon Yu
Jaejun Yoo
Gyeong-Moon Park
VGenDiffM
384
2
0
05 Sep 2023
Audio-Driven Dubbing for User Generated Contents via Style-Aware
  Semi-Parametric Synthesis
Audio-Driven Dubbing for User Generated Contents via Style-Aware Semi-Parametric Synthesis
Linsen Song
Wayne Wu
Chaoyou Fu
Chen Change Loy
Xiao-Yu Zhang
285
17
0
31 Aug 2023
A Survey on Deep Multi-modal Learning for Body Language Recognition and
  Generation
A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation
Li Liu
Lufei Gao
Wen-Ling Lei
Fengji Ma
Xiaotian Lin
Jin-Tao Wang
CVBM
225
9
0
17 Aug 2023
Speech-Driven 3D Face Animation with Composite and Regional Facial
  Movements
Speech-Driven 3D Face Animation with Composite and Regional Facial MovementsACM Multimedia (ACM MM), 2023
Haozhe Wu
Songtao Zhou
Jia Jia
Junliang Xing
Qi Wen
Xiang Wen
CVBM
359
25
0
10 Aug 2023
UniBriVL: Robust Universal Representation and Generation of Audio Driven
  Diffusion Models
UniBriVL: Robust Universal Representation and Generation of Audio Driven Diffusion Models
Sen Fang
Bowen Gao
Yangjian Wu
T. Teoh
DiffM
275
1
0
29 Jul 2023
MODA: Mapping-Once Audio-driven Portrait Animation with Dual Attentions
MODA: Mapping-Once Audio-driven Portrait Animation with Dual AttentionsIEEE International Conference on Computer Vision (ICCV), 2023
Yunfei Liu
Lijian Lin
Fei Yu
Changyin Zhou
Yu Li
DiffMVGen
268
40
0
19 Jul 2023
A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony
  in Talking Head Generation
A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation
Louis Airale
Dominique Vaufreydaz
Xavier Alameda-Pineda
184
1
0
04 Jul 2023
Parametric Implicit Face Representation for Audio-Driven Facial
  Reenactment
Parametric Implicit Face Representation for Audio-Driven Facial ReenactmentComputer Vision and Pattern Recognition (CVPR), 2023
Ricong Huang
Puxiang Lai
Yipeng Qin
Guanbin Li
CVBMDiffM
262
16
0
13 Jun 2023
IFaceUV: Intuitive Motion Facial Image Generation by Identity
  Preservation via UV map
IFaceUV: Intuitive Motion Facial Image Generation by Identity Preservation via UV map
Han-Lim Lee
Yu-Te Ku
Eunseok Kim
Seungryul Baek
3DH
180
0
0
08 Jun 2023
LPMM: Intuitive Pose Control for Neural Talking-Head Model via
  Landmark-Parameter Morphable Model
LPMM: Intuitive Pose Control for Neural Talking-Head Model via Landmark-Parameter Morphable Model
K. Lee
Patrick Kwon
Myung Ki Lee
Namhyuk Ahn
Junsoo Lee
358
2
0
17 May 2023
StyleSync: High-Fidelity Generalized and Personalized Lip Sync in
  Style-based Generator
StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based GeneratorComputer Vision and Pattern Recognition (CVPR), 2023
Jiazhi Guan
Zhanwang Zhang
Hang Zhou
Tianshu Hu
Kaisiyuan Wang
...
Haocheng Feng
Jingtuo Liu
Errui Ding
Ziwei Liu
Jingdong Wang
310
112
0
09 May 2023
High-fidelity Generalized Emotional Talking Face Generation with
  Multi-modal Emotion Space Learning
High-fidelity Generalized Emotional Talking Face Generation with Multi-modal Emotion Space LearningComputer Vision and Pattern Recognition (CVPR), 2023
Chao Xu
Sijun Tan
Jibang Wu
Yue Han
Wenqing Chu
Xiaohui Bei
Chengjie Wang
Haifeng Xu
Yong Liu
CVBM
282
45
0
04 May 2023
GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking
  Face Generation
GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation
Zhenhui Ye
Jinzheng He
Ziyue Jiang
Rongjie Huang
Jia-Bin Huang
Jinglin Liu
Yixiang Ren
Xiang Yin
Zejun Ma
Zhou Zhao
CVBM
247
57
0
01 May 2023
Audio-Driven Talking Face Generation with Diverse yet Realistic Facial
  Animations
Audio-Driven Talking Face Generation with Diverse yet Realistic Facial AnimationsPattern Recognition (Pattern Recogn.), 2023
Rongliang Wu
Yingchen Yu
Fangneng Zhan
Jiahui Zhang
Xiaoqin Zhang
Shijian Lu
CVBM
253
16
0
18 Apr 2023
That's What I Said: Fully-Controllable Talking Face Generation
That's What I Said: Fully-Controllable Talking Face GenerationACM Multimedia (ACM MM), 2023
Youngjoon Jang
Kyeongha Rho
Jong-Bin Woo
Hyeongkeun Lee
Jihwan Park
Youshin Lim
Byeong-Yeol Kim
Joon Son Chung
CVBM
318
13
0
06 Apr 2023
DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with
  Diffusion Autoencoder
DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion AutoencoderACM Multimedia (ACM MM), 2023
Chenpeng Du
Qi Chen
Xie Chen
K. Yu
DiffM
506
70
0
30 Mar 2023
MusicFace: Music-driven Expressive Singing Face Synthesis
MusicFace: Music-driven Expressive Singing Face SynthesisComputational Visual Media (CVM), 2023
Peng Liu
W. Deng
Hengda Li
Jintai Wang
Yinglin Zheng
Yiwei Ding
Xiaohu Guo
Ming Zeng
CVBM
223
15
0
24 Mar 2023
12
Next
Page 1 of 2