Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1807.07860
Cited By
v1
v2 (latest)
Talking Face Generation by Adversarially Disentangled Audio-Visual Representation
AAAI Conference on Artificial Intelligence (AAAI), 2018
20 July 2018
Hang Zhou
Yu Liu
Ziwei Liu
Ping Luo
Xiaogang Wang
CVBM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Talking Face Generation by Adversarially Disentangled Audio-Visual Representation"
50 / 242 papers shown
Title
DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Yifeng Ma
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yingya Zhang
Zhidong Deng
DiffM
285
23
0
15 Dec 2023
FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models
Computer Vision and Pattern Recognition (CVPR), 2023
Shivangi Aneja
Justus Thies
Angela Dai
Matthias Nießner
DiffM
VGen
347
50
0
13 Dec 2023
GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face Guidance
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2023
Haiming Zhang
Zhihao Yuan
Chaoda Zheng
Xu Yan
Baoyuan Wang
Guanbin Li
Song Wu
Shuguang Cui
Zhen Li
CVBM
137
1
0
12 Dec 2023
GMTalker: Gaussian Mixture-based Audio-Driven Emotional Talking Video Portraits
Yibo Xia
Lizhen Wang
Xiang Deng
Xiaoyan Luo
Yunhong Wang
Yebin Liu
VGen
243
2
0
12 Dec 2023
DiT-Head: High-Resolution Talking Head Synthesis using Diffusion Transformers
Aaron Mir
Eduardo Alonso
Esther Mondragón
DiffM
220
4
0
11 Dec 2023
R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning
Zhiling Ye
LiangGuo Zhang
Dingheng Zeng
Quan Lu
Ning Jiang
167
2
0
09 Dec 2023
SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis
Computer Vision and Pattern Recognition (CVPR), 2023
Ziqiao Peng
Wentao Hu
Yue Shi
Xiangyu Zhu
Xiaomei Zhang
Hao Zhao
Jun He
Hongyan Liu
Zhaoxin Fan
202
87
0
29 Nov 2023
CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding
Jianzong Wang
Yimin Deng
Ziqi Liang
Xulong Zhang
Ning Cheng
Jing Xiao
CVBM
119
2
0
15 Nov 2023
STELLA: Continual Audio-Video Pre-training with Spatio-Temporal Localized Alignment
International Conference on Machine Learning (ICML), 2023
Jaewoo Lee
Jaehong Yoon
Wonjae Kim
Yunji Kim
Sung Ju Hwang
CLL
223
1
0
12 Oct 2023
OSM-Net: One-to-Many One-shot Talking Head Generation with Spontaneous Head Motions
Jin Liu
Xi Wang
Xiaomeng Fu
Yesheng Chai
Cai Yu
Jiao Dai
Jizhong Han
110
5
0
28 Sep 2023
Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment
ACM Multimedia (ACM MM), 2023
Zheng-Yan Sheng
Yang Ai
Yan-Nian Chen
Zhenhua Ling
CVBM
129
10
0
18 Sep 2023
Towards the generation of synchronized and believable non-verbal facial behaviors of a talking virtual agent
Alice Delbosc
M. Ochs
Nicolas Sabouret
Brian Ravenet
Stéphane Ayache
216
11
0
15 Sep 2023
HDTR-Net: A Real-Time High-Definition Teeth Restoration Network for Arbitrary Talking Face Generation Methods
Chinese Conference on Pattern Recognition and Computer Vision (CPRCV), 2023
Yongyuan Li
Xiuyuan Qin
Chao Liang
Mingqiang Wei
153
5
0
14 Sep 2023
Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation
IEEE International Conference on Computer Vision (ICCV), 2023
Yuan Gan
Zongxin Yang
Xihang Yue
Lingyun Sun
Yezhou Yang
179
90
0
10 Sep 2023
Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video
IEEE International Conference on Computer Vision (ICCV), 2023
Xiuzhe Wu
Pengfei Hu
Yang Wu
Xiaoyang Lyu
Yan-Pei Cao
Ying Shan
Wenming Yang
Zhongqian Sun
Xiaojuan Qi
94
15
0
09 Sep 2023
ReliTalk: Relightable Talking Portrait Generation from a Single Video
International Journal of Computer Vision (IJCV), 2023
Haonan Qiu
Zhaoxi Chen
Yuming Jiang
Hang Zhou
Xiangyu Fan
Lei Yang
Wayne Wu
Ziwei Liu
DiffM
VGen
186
14
0
05 Sep 2023
RADIO: Reference-Agnostic Dubbing Video Synthesis
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Dongyeun Lee
Chaewon Kim
Sangjoon Yu
Jaejun Yoo
Gyeong-Moon Park
VGen
DiffM
214
2
0
05 Sep 2023
Audio-Driven Dubbing for User Generated Contents via Style-Aware Semi-Parametric Synthesis
Linsen Song
Wayne Wu
Chaoyou Fu
Chen Change Loy
Xiao-Yu Zhang
214
15
0
31 Aug 2023
MFR-Net: Multi-faceted Responsive Listening Head Generation via Denoising Diffusion Model
ACM Multimedia (ACM MM), 2023
Jin Liu
Xi Wang
Xiaomeng Fu
Yesheng Chai
Cai Yu
Jiao Dai
Jizhong Han
DiffM
127
16
0
31 Aug 2023
From Pixels to Portraits: A Comprehensive Survey of Talking Head Generation Techniques and Applications
Shreyank N. Gowda
Dheeraj Pandey
Shashank Narayana Gowda
183
5
0
30 Aug 2023
DF-3DFace: One-to-Many Speech Synchronized 3D Face Animation with Diffusion
Se Jin Park
Joanna Hong
Minsu Kim
Y. Ro
162
4
0
23 Aug 2023
A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation
Li Liu
Lufei Gao
Wen-Ling Lei
Fengji Ma
Xiaotian Lin
Jin-Tao Wang
CVBM
172
9
0
17 Aug 2023
Controlling Character Motions without Observable Driving Source
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Weiyuan Li
Bin Dai
Ziyi Zhou
Qi Yao
Baoyuan Wang
VGen
98
3
0
11 Aug 2023
Speech-Driven 3D Face Animation with Composite and Regional Facial Movements
ACM Multimedia (ACM MM), 2023
Haozhe Wu
Songtao Zhou
Jia Jia
Junliang Xing
Qi Wen
Xiang Wen
CVBM
216
21
0
10 Aug 2023
Rethinking Voice-Face Correlation: A Geometry View
ACM Multimedia (ACM MM), 2023
Xiang Li
Yandong Wen
Muqiao Yang
Jinglu Wang
Rita Singh
Bhiksha Raj
CVBM
3DH
99
7
0
26 Jul 2023
Audio-driven Talking Face Generation with Stabilized Synchronization Loss
European Conference on Computer Vision (ECCV), 2023
Dogucan Yaman
Fevziye Irem Eyiokur
Leonard Barmann
H. K. Ekenel
Alexander Waibel
CVBM
356
10
0
18 Jul 2023
AltFreezing for More General Video Face Forgery Detection
Computer Vision and Pattern Recognition (CVPR), 2023
Zhendong Wang
Jianmin Bao
Wen-gang Zhou
Weilun Wang
Houqiang Li
ViT
CVBM
190
101
0
17 Jul 2023
A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation
Louis Airale
Dominique Vaufreydaz
Xavier Alameda-Pineda
135
1
0
04 Jul 2023
Text-driven Talking Face Synthesis by Reprogramming Audio-driven Models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
J. Choi
Minsu Kim
Se Jin Park
Y. Ro
CVBM
119
6
0
28 Jun 2023
Audio-Driven 3D Facial Animation from In-the-Wild Videos
Liying Lu
Tianke Zhang
Yunfei Liu
Xuangeng Chu
Yu Li
VGen
128
6
0
20 Jun 2023
Align, Adapt and Inject: Sound-guided Unified Image Generation
Yue Yang
Kaipeng Zhang
Yuying Ge
Wenqi Shao
Zeyue Xue
Yu Qiao
Ping Luo
DiffM
261
7
0
20 Jun 2023
Parametric Implicit Face Representation for Audio-Driven Facial Reenactment
Computer Vision and Pattern Recognition (CVPR), 2023
Ricong Huang
Puxiang Lai
Yipeng Qin
Guanbin Li
CVBM
DiffM
217
16
0
13 Jun 2023
IFaceUV: Intuitive Motion Facial Image Generation by Identity Preservation via UV map
Han-Lim Lee
Yu-Te Ku
Eunseok Kim
Seungryul Baek
3DH
101
0
0
08 Jun 2023
MyStyle++: A Controllable Personalized Generative Prior
ACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH Asia), 2023
Libing Zeng
Lele Chen
Yinghao Xu
N. Kalantari
236
8
0
08 Jun 2023
Exploring Phonetic Context-Aware Lip-Sync For Talking Face Generation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Se Jin Park
Minsu Kim
J. Choi
Y. Ro
CVBM
131
7
0
31 May 2023
RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars
Neural Information Processing Systems (NeurIPS), 2023
Dongwei Pan
Long Zhuo
Jingtan Piao
Huiwen Luo
Wei Cheng
...
Chen Change Loy
Chao Qian
Wayne Wu
Dahua Lin
Kwan-Yee Lin
199
37
0
22 May 2023
DaGAN++: Depth-Aware Generative Adversarial Network for Talking Head Video Generation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Fa-Ting Hong
Li Shen
Dan Xu
3DH
CVBM
186
31
0
10 May 2023
StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator
Computer Vision and Pattern Recognition (CVPR), 2023
Jiazhi Guan
Zhanwang Zhang
Hang Zhou
Tianshu Hu
Kaisiyuan Wang
...
Haocheng Feng
Jingtuo Liu
Errui Ding
Ziwei Liu
Jingdong Wang
249
91
0
09 May 2023
High-fidelity Generalized Emotional Talking Face Generation with Multi-modal Emotion Space Learning
Computer Vision and Pattern Recognition (CVPR), 2023
Chao Xu
Sijun Tan
Jibang Wu
Yue Han
Wenqing Chu
Xiaohui Bei
Chengjie Wang
Haifeng Xu
Yong Liu
CVBM
128
44
0
04 May 2023
GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation
Zhenhui Ye
Jinzheng He
Ziyue Jiang
Rongjie Huang
Jia-Bin Huang
Jinglin Liu
Yixiang Ren
Xiang Yin
Zejun Ma
Zhou Zhao
CVBM
183
53
0
01 May 2023
Audio-Driven Talking Face Generation with Diverse yet Realistic Facial Animations
Pattern Recognition (Pattern Recogn.), 2023
Rongliang Wu
Yingchen Yu
Fangneng Zhan
Jiahui Zhang
Xiaoqin Zhang
Shijian Lu
CVBM
146
13
0
18 Apr 2023
That's What I Said: Fully-Controllable Talking Face Generation
ACM Multimedia (ACM MM), 2023
Youngjoon Jang
Kyeongha Rho
Jong-Bin Woo
Hyeongkeun Lee
Jihwan Park
Youshin Lim
Byeong-Yeol Kim
Joon Son Chung
CVBM
105
10
0
06 Apr 2023
TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking Styles
IEEE transactions on multimedia (IEEE TMM), 2023
Yifeng Ma
Suzhe Wang
Yu-qiong Ding
Lincheng Li
Bowen Ma
Tangjie Lv
Changjie Fan
Zhipeng Hu
Zhidong Deng
Xin Yu
CLIP
164
32
0
01 Apr 2023
FONT: Flow-guided One-shot Talking Head Generation with Natural Head Motions
IEEE International Conference on Multimedia and Expo (ICME), 2023
Jin Liu
Xi Wang
Xiaomeng Fu
Yesheng Chai
Cai Yu
Jiao Dai
Jizhong Han
243
7
0
31 Mar 2023
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Computer Vision and Pattern Recognition (CVPR), 2023
Xubo Liu
Egor Lakomkin
Konstantinos Vougioukas
Pingchuan Ma
Honglie Chen
...
Niko Moritz
J. Kolár
Stavros Petridis
Maja Pantic
Christian Fuegen
340
24
0
30 Mar 2023
Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert
Computer Vision and Pattern Recognition (CVPR), 2023
Jiadong Wang
Xinyuan Qian
Malu Zhang
R. Tan
Haizhou Li
EGVM
159
133
0
29 Mar 2023
OTAvatar: One-shot Talking Face Avatar with Controllable Tri-plane Rendering
Computer Vision and Pattern Recognition (CVPR), 2023
Zhiyuan Ma
Xiangyu Zhu
Guojun Qi
Zhen Lei
Guang Dai
CVBM
212
72
0
26 Mar 2023
MusicFace: Music-driven Expressive Singing Face Synthesis
Computational Visual Media (CVM), 2023
Peng Liu
W. Deng
Hengda Li
Jintai Wang
Yinglin Zheng
Yiwei Ding
Xiaohu Guo
Ming Zeng
CVBM
139
13
0
24 Mar 2023
A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?
Chaoning Zhang
Chenshuang Zhang
Sheng Zheng
Yu Qiao
Chenghao Li
...
Lik-Hang Lee
Yang Yang
Heng Tao Shen
In So Kweon
Choong Seon Hong
267
190
0
21 Mar 2023
Style Transfer for 2D Talking Head Animation
Trong-Thang Pham
Nhat Le
Tuong Khanh Long Do
Hung Nguyen
Erman Tjiputra
Quang-Dieu Tran
A. Nguyen
252
3
0
17 Mar 2023
Previous
1
2
3
4
5
Next