ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1807.07860
  4. Cited By
Talking Face Generation by Adversarially Disentangled Audio-Visual
  Representation
v1v2 (latest)

Talking Face Generation by Adversarially Disentangled Audio-Visual Representation

AAAI Conference on Artificial Intelligence (AAAI), 2018
20 July 2018
Hang Zhou
Yu Liu
Ziwei Liu
Ping Luo
Xiaogang Wang
    CVBM
ArXiv (abs)PDFHTML

Papers citing "Talking Face Generation by Adversarially Disentangled Audio-Visual Representation"

50 / 242 papers shown
Title
Is It Truly Necessary to Process and Fit Minutes-Long Reference Videos for Personalized Talking Face Generation?
Is It Truly Necessary to Process and Fit Minutes-Long Reference Videos for Personalized Talking Face Generation?
Rui-Qing Sun
Ang Li
Zhijing Wu
Tian Lan
Qianyu Lu
Xingshan Yao
C. Xu
Xian-Ling Mao
DiffMVGen
344
0
0
11 Nov 2025
AvatarSync: Rethinking Talking-Head Animation through Phoneme-Guided Autoregressive Perspective
AvatarSync: Rethinking Talking-Head Animation through Phoneme-Guided Autoregressive Perspective
Yuchen Deng
Xiuyang Wu
Hai-Tao Zheng
Suiyang Zhang
Yi He
Yuxing Han
VGen
80
0
0
15 Sep 2025
Taming Transformer for Emotion-Controllable Talking Face Generation
Taming Transformer for Emotion-Controllable Talking Face Generation
Ziqi Zhang
Cheng Deng
CVBM
108
0
0
20 Aug 2025
EDTalk++: Full Disentanglement for Controllable Talking Head Synthesis
EDTalk++: Full Disentanglement for Controllable Talking Head Synthesis
Shuai Tan
Bin Ji
146
0
0
19 Aug 2025
Scaling Up Audio-Synchronized Visual Animation: An Efficient Training Paradigm
Scaling Up Audio-Synchronized Visual Animation: An Efficient Training Paradigm
Lin Zhang
Zefan Cai
Jiuxiang Gu
Shentong Mo
Jinhong Lin
...
Ruiyi Zhang
Wen Xiao
Tong Sun
Junjie Hu
Pedro Morgado
VGen
141
1
0
05 Aug 2025
Mask-Free Audio-driven Talking Face Generation for Enhanced Visual Quality and Identity Preservation
Mask-Free Audio-driven Talking Face Generation for Enhanced Visual Quality and Identity Preservation
Dogucan Yaman
Fevziye Irem Eyiokur
Leonard Barmann
H. K. Ekenel
Alexander H. Waibel
CVBM
154
0
0
28 Jul 2025
Identity Deepfake Threats to Biometric Authentication Systems: Public and Expert Perspectives
Identity Deepfake Threats to Biometric Authentication Systems: Public and Expert Perspectives
Shijing He
Yaxiong Lei
Zihan Zhang
Yuzhou Sun
S. Li
Chi Zhang
Juan Ye
154
1
0
07 Jun 2025
Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head Generation
Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head GenerationComputer Vision and Pattern Recognition (CVPR), 2025
Yuan Gan
Jiaxu Miao
Yunze Wang
Yi Yang
AAMLDiffM
130
1
0
02 Jun 2025
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution
Antoni Bigata
Rodrigo Mira
Stella Bounareli
Michał Stypułkowski
Konstantinos Vougioukas
Stavros Petridis
Maja Pantic
277
2
0
01 May 2025
Efficient Listener: Dyadic Facial Motion Synthesis via Action Diffusion
Efficient Listener: Dyadic Facial Motion Synthesis via Action Diffusion
Xiping Hu
Alexandre Bruckert
P. Le Callet
Guoquan Zheng
VGen
207
1
0
29 Apr 2025
AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers
AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion TransformersComputer Vision and Pattern Recognition (CVPR), 2025
Jiazhi Guan
Kaisiyuan Wang
Zhiliang Xu
Quanwei Yang
Yasheng Sun
...
Errui Ding
Jiadong Wang
Youjian Zhao
Hang Zhou
Ziwei Liu
VGen
224
1
0
25 Mar 2025
3D Engine-ready Photorealistic Avatars via Dynamic Textures
3D Engine-ready Photorealistic Avatars via Dynamic Textures
Yifan Wang
Ivan Molodetskikh
Ondrej Texler
Dimitar Dinev
237
0
0
19 Mar 2025
SyncDiff: Diffusion-based Talking Head Synthesis with Bottlenecked Temporal Visual Prior for Improved Synchronization
SyncDiff: Diffusion-based Talking Head Synthesis with Bottlenecked Temporal Visual Prior for Improved SynchronizationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Xulin Fan
Heting Gao
Ziyi Chen
Peng Chang
Mei Han
Mark Hasegawa-Johnson
DiffM
285
1
0
17 Mar 2025
MAVFlow: Preserving Paralinguistic Elements with Conditional Flow Matching for Zero-Shot AV2AV Multilingual Translation
MAVFlow: Preserving Paralinguistic Elements with Conditional Flow Matching for Zero-Shot AV2AV Multilingual Translation
Sungwoo Cho
J. Choi
Sungnyun Kim
Se-Young Yun
277
0
0
14 Mar 2025
MACS: Multi-source Audio-to-image Generation with Contextual Significance and Semantic Alignment
MACS: Multi-source Audio-to-image Generation with Contextual Significance and Semantic Alignment
Hao Zhou
Xiaobao Guo
Yuzhe Zhu
A. Kong
DiffM
362
1
0
13 Mar 2025
Separation of Neural Drives to Muscles from Transferred Polyfunctional
  Nerves using Implanted Micro-electrode Arrays
Separation of Neural Drives to Muscles from Transferred Polyfunctional Nerves using Implanted Micro-electrode Arrays
Laura Ferrante
Anna Boesendorfer
D. Barsakcioglu
Benedikt Baumgartner
Yazan Al-Ajam
Alex Woollard
Norbert Venantius Kang
Oskar Aszmann
D. Farina
212
15
0
14 Oct 2024
EmoGene: Audio-Driven Emotional 3D Talking-Head Generation
EmoGene: Audio-Driven Emotional 3D Talking-Head GenerationIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2024
Wenqing Wang
Yun Fu
VGen
305
1
0
07 Oct 2024
Learning Frame-Wise Emotion Intensity for Audio-Driven Talking-Head
  Generation
Learning Frame-Wise Emotion Intensity for Audio-Driven Talking-Head Generation
Aoxiang Fan
Hieu Le
Zhixin Shu
Yang Wang
Yi-Hsuan Tsai
Dimitris Samaras
151
1
0
29 Sep 2024
DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical
  Diffusion for Audio-driven Talking Head Synthesis
DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis
Fa-Ting Hong
Yunfei Liu
Yu Li
Changyin Zhou
Fei Yu
D. Xu
DiffM
196
3
0
16 Sep 2024
StyleTalk++: A Unified Framework for Controlling the Speaking Styles of
  Talking Heads
StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking HeadsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Suzhen Wang
Yifeng Ma
Yu Ding
Zhipeng Hu
Changjie Fan
Tangjie Lv
Zhidong Deng
Xin Yu
236
18
0
14 Sep 2024
DiffTED: One-shot Audio-driven TED Talk Video Generation with
  Diffusion-based Co-speech Gestures
DiffTED: One-shot Audio-driven TED Talk Video Generation with Diffusion-based Co-speech Gestures
S. Hogue
Chenxu Zhang
Hamza Daruger
Yapeng Tian
Xiaohu Guo
VGen
195
21
0
11 Sep 2024
KAN-Based Fusion of Dual-Domain for Audio-Driven Facial Landmarks
  Generation
KAN-Based Fusion of Dual-Domain for Audio-Driven Facial Landmarks Generation
Hoang-Son Vo-Thanh
Quang Vinh Nguyen
Soo-Hyung Kim
CVBM
100
0
0
09 Sep 2024
FD2Talk: Towards Generalized Talking Head Generation with Facial
  Decoupled Diffusion Model
FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion ModelACM Multimedia (MM), 2024
Ziyu Yao
Xuxin Cheng
Zhiqi Huang
DiffM
185
8
0
18 Aug 2024
ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually
  Synced Facial Performer
ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial PerformerEuropean Conference on Computer Vision (ECCV), 2024
Jiazhi Guan
Zhiliang Xu
Hang Zhou
Kaisiyuan Wang
Shengyi He
...
Errui Ding
Jingtuo Liu
Jingdong Wang
Youjian Zhao
Ziwei Liu
VGen
179
10
0
06 Aug 2024
EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking
  Head
EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking HeadEuropean Conference on Computer Vision (ECCV), 2024
Qianyun He
Xinya Ji
Ruichen Zheng
Yuanxun Lu
Zhengyu Diao
...
Songcen Xu
Xiaofei Wu
Zixiao Zhang
Xun Cao
Hao Zhu
174
13
0
01 Aug 2024
EmoFace: Audio-driven Emotional 3D Face Animation
EmoFace: Audio-driven Emotional 3D Face Animation
Chang Liu
Qunfen Lin
Zijiao Zeng
Ye Pan
CVBM
156
7
0
17 Jul 2024
Learning Online Scale Transformation for Talking Head Video Generation
Learning Online Scale Transformation for Talking Head Video Generation
Fa-Ting Hong
Dan Xu
196
1
0
13 Jul 2024
Listen and Move: Improving GANs Coherency in Agnostic Sound-to-Video
  Generation
Listen and Move: Improving GANs Coherency in Agnostic Sound-to-Video Generation
Rafael Redondo
141
0
0
23 Jun 2024
RITA: A Real-time Interactive Talking Avatars Framework
RITA: A Real-time Interactive Talking Avatars Framework
Wuxinlin Cheng
Cheng Wan
Yupeng Cao
Sihan Chen
165
1
0
18 Jun 2024
NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head
  Generation
NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head Generation
Niu Guanchen
3DH
209
0
0
17 Jun 2024
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis:
  Techniques for Portrait Generation, Driving Mechanisms, and Editing
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing
Ming Meng
Yufei Zhao
Bo Zhang
Yonggui Zhu
Weimin Shi
Maxwell Wen
Zhaoxin Fan
VGen
293
5
0
15 Jun 2024
Emotional Conversation: Empowering Talking Faces with Cohesive
  Expression, Gaze and Pose Generation
Emotional Conversation: Empowering Talking Faces with Cohesive Expression, Gaze and Pose Generation
Jiadong Liang
Feng Lu
CVBM
260
7
0
12 Jun 2024
SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound
SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound
Rishit Dagli
Shivesh Prakash
Robert Wu
H. Khosravani
321
14
0
06 Jun 2024
OpFlowTalker: Realistic and Natural Talking Face Generation via Optical
  Flow Guidance
OpFlowTalker: Realistic and Natural Talking Face Generation via Optical Flow Guidance
Shuheng Ge
Haoyu Xing
Li Zhang
Xiangqian Wu
244
0
0
23 May 2024
Faces that Speak: Jointly Synthesising Talking Face and Speech from Text
Faces that Speak: Jointly Synthesising Talking Face and Speech from TextComputer Vision and Pattern Recognition (CVPR), 2024
Youngjoon Jang
Ji-Hoon Kim
Junseok Ahn
Doyeop Kwak
Hong-Sun Yang
Yooncheol Ju
Il-Hwan Kim
Byeong-Yeol Kim
Joon Son Chung
CVBM
199
18
0
16 May 2024
Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation
Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation
Jihyun Kim
Changjae Oh
Hoseok Do
Soohyun Kim
Kwanghoon Sohn
DiffM
185
21
0
07 May 2024
Audio-Visual Speech Representation Expert for Enhanced Talking Face
  Video Generation and Evaluation
Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation
Dogucan Yaman
Fevziye Irem Eyiokur
Leonard Barmann
Seymanur Akti
H. K. Ekenel
Alexander H. Waibel
EGVM
205
14
0
07 May 2024
GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian
  Splatting
GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting
Hongyun Yu
Zhan Qu
Qihang Yu
Jianchuan Chen
Zhonghua Jiang
...
Shengyu Zhang
Jimin Xu
Leilei Gan
Chengfei Lv
Gang Yu
3DGS
241
30
0
22 Apr 2024
FSRT: Facial Scene Representation Transformer for Face Reenactment from
  Factorized Appearance, Head-pose, and Facial Expression Features
FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features
Andre Rochow
Max Schwarz
Sven Behnke
ViT
206
22
0
15 Apr 2024
EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis
EDTalk: Efficient Disentanglement for Emotional Talking Head SynthesisEuropean Conference on Computer Vision (ECCV), 2024
Shuai Tan
Bin Ji
Mengxiao Bi
Ye Pan
206
63
0
02 Apr 2024
Deepfake Generation and Detection: A Benchmark and Survey
Deepfake Generation and Detection: A Benchmark and Survey
Gan Pei
Jiangning Zhang
Menghan Hu
Ying Tai
Chengjie Wang
Yunsheng Wu
Guangtao Zhai
Jian Yang
Chunhua Shen
Dacheng Tao
276
76
0
26 Mar 2024
VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis
VLOGGER: Multimodal Diffusion for Embodied Avatar SynthesisComputer Vision and Pattern Recognition (CVPR), 2024
Enric Corona
Andrei Zanfir
Eduard Gabriel Bazavan
Nikos Kolotouros
Thiemo Alldieck
C. Sminchisescu
VGenDiffM
164
45
0
13 Mar 2024
A Comparative Study of Perceptual Quality Metrics for Audio-driven
  Talking Head Videos
A Comparative Study of Perceptual Quality Metrics for Audio-driven Talking Head VideosInternational Conference on Information Photonics (ICIP), 2024
Weixia Zhang
Chengguang Zhu
Jingnan Gao
Manwen Liao
Guangtao Zhai
Yunbo Wang
EGVM
128
7
0
11 Mar 2024
FlowVQTalker: High-Quality Emotional Talking Face Generation through
  Normalizing Flow and Quantization
FlowVQTalker: High-Quality Emotional Talking Face Generation through Normalizing Flow and QuantizationComputer Vision and Pattern Recognition (CVPR), 2024
Shuai Tan
Bin Ji
Ye Pan
340
39
0
11 Mar 2024
Say Anything with Any Style
Say Anything with Any StyleAAAI Conference on Artificial Intelligence (AAAI), 2024
Shuai Tan
Bin Ji
Yu Ding
Ye Pan
VGenDiffM
186
25
0
11 Mar 2024
Audio-Synchronized Visual Animation
Audio-Synchronized Visual AnimationEuropean Conference on Computer Vision (ECCV), 2024
Lin Zhang
Shentong Mo
Yijing Zhang
Pedro Morgado
DiffM
180
30
0
08 Mar 2024
FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces
  from Disentangled Audio
FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio
Chao Xu
Yang Liu
Jiazheng Xing
Weida Wang
Mingze Sun
...
Tianxin Huang
Siyuan Li
Zhi-Qi Cheng
Ying Tai
Baigui Sun
CVBM
332
18
0
04 Mar 2024
G4G:A Generic Framework for High Fidelity Talking Face Generation with
  Fine-grained Intra-modal Alignment
G4G:A Generic Framework for High Fidelity Talking Face Generation with Fine-grained Intra-modal Alignment
Juan Zhang
Jiahao Chen
Cheng Wang
Zhi-Yang Yu
Tangquan Qi
Di Wu
CVBM
202
0
0
28 Feb 2024
Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis
Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis
Zicheng Zhang
Ruobing Zheng
Ziwen Liu
Congying Han
Tianqi Li
Meng Wang
Tiande Guo
Jingdong Chen
Bonan li
Ming Yang
3DH
158
12
0
27 Feb 2024
Mimic: Speaking Style Disentanglement for Speech-Driven 3D Facial
  Animation
Mimic: Speaking Style Disentanglement for Speech-Driven 3D Facial Animation
Hui Fu
Zeqing Wang
Ke Gong
Keze Wang
Tianshui Chen
Haojie Li
Haifeng Zeng
Xiandong Li
169
21
0
18 Dec 2023
12345
Next