Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1807.07860
Cited By
v1
v2 (latest)
Talking Face Generation by Adversarially Disentangled Audio-Visual Representation
AAAI Conference on Artificial Intelligence (AAAI), 2018
20 July 2018
Hang Zhou
Yu Liu
Ziwei Liu
Ping Luo
Xiaogang Wang
CVBM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Talking Face Generation by Adversarially Disentangled Audio-Visual Representation"
50 / 242 papers shown
Title
DFA-NeRF: Personalized Talking Head Generation via Disentangled Face Attributes Neural Rendering
Shunyu Yao
Ruizhe Zhong
Manwen Liao
Guangtao Zhai
Xiaokang Yang
CVBM
139
112
0
03 Jan 2022
Multimodal Image Synthesis and Editing: The Generative AI Era
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Fangneng Zhan
Yingchen Yu
Rongliang Wu
Jiahui Zhang
Shijian Lu
Lingjie Liu
Adam Kortylewski
Christian Theobalt
Eric Xing
EGVM
510
77
0
27 Dec 2021
VirtualCube: An Immersive 3D Video Communication System
Yizhong Zhang
Jiaolong Yang
Zhen Liu
Ruicheng Wang
Guojun Chen
Xin Tong
B. Guo
171
57
0
13 Dec 2021
FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Yingruo Fan
Mohammad Kachuee
Jun Saito
Wenping Wang
Taku Komura
CVBM
564
254
0
10 Dec 2021
One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning
AAAI Conference on Artificial Intelligence (AAAI), 2021
Suzhe Wang
Lincheng Li
Yueqing Ding
Xin Yu
CVBM
214
131
0
06 Dec 2021
Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation
Proceedings of the ACM on Computer Graphics and Interactive Techniques (PACMCGIT), 2021
Yingruo Fan
Mohammad Kachuee
Jun Saito
Wenping Wang
Taku Komura
162
27
0
04 Dec 2021
Geometry-Aware Multi-Task Learning for Binaural Audio Generation from Video
British Machine Vision Conference (BMVC), 2021
Rishabh Garg
Ruohan Gao
Kristen Grauman
140
30
0
21 Nov 2021
LiMuSE: Lightweight Multi-modal Speaker Extraction
Qinghua Liu
Yating Huang
Yunzhe Hao
Jiaming Xu
Bo Xu
125
6
0
07 Nov 2021
Imitating Arbitrary Talking Style for Realistic Audio-DrivenTalking Face Synthesis
ACM Multimedia (ACM MM), 2021
Haozhe Wu
Jia Jia
Haoyu Wang
Yishun Dou
Chao Duan
Qingshan Deng
CVBM
137
82
0
30 Oct 2021
LARNet: Latent Action Representation for Human Action Synthesis
British Machine Vision Conference (BMVC), 2021
Naman Biyani
A. J. Rana
Shruti Vyas
Yogesh S Rawat
116
4
0
21 Oct 2021
Talking Head Generation with Audio and Speech Related Facial Action Units
Sen Chen
Zhilei Liu
Jiaxing Liu
Zhengxiang Yan
Longbiao Wang
CVBM
111
18
0
19 Oct 2021
Intelligent Video Editing: Incorporating Modern Talking Face Generation Algorithms in a Video Editor
Anchit Gupta
Faizan Farooq Khan
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
CVBM
173
6
0
16 Oct 2021
Neural Dubber: Dubbing for Videos According to Scripts
Chenxu Hu
Qiao Tian
Tingle Li
Yuping Wang
Yuxuan Wang
Hang Zhao
DiffM
VGen
218
50
0
15 Oct 2021
A review of Generative Adversarial Networks (GANs) and its applications in a wide variety of disciplines -- From Medical to Remote Sensing
Ankan Dash
J. Ye
Guiling Wang
MedIm
AI4CE
148
132
0
01 Oct 2021
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation
ACM Transactions on Graphics (TOG), 2021
Yuanxun Lu
Jinxiang Chai
Xun Cao
173
96
0
22 Sep 2021
PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering
Yurui Ren
Gezhong Li
Yuanqi Chen
Thomas H. Li
Shan Liu
DiffM
VGen
194
256
0
17 Sep 2021
Deep Person Generation: A Survey from the Perspective of Face, Pose and Cloth Synthesis
Tong Sha
Wei Zhang
T. Shen
Zhoujun Li
Tao Mei
152
45
0
05 Sep 2021
Sparse to Dense Motion Transfer for Face Image Animation
Ruiqi Zhao
Tianyi Wu
Guodong Guo
3DH
CVBM
181
30
0
01 Sep 2021
FACIAL: Synthesizing Dynamic Talking Face with Implicit Attribute Learning
Chenxu Zhang
Yifan Zhao
Yifei Huang
Ming Zeng
Saifeng Ni
M. Budagavi
Xiaohu Guo
CVBM
139
142
0
18 Aug 2021
AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person
IEEE transactions on multimedia (IEEE Trans. Multimedia), 2021
Xinsheng Wang
Qicong Xie
Jihua Zhu
Lei Xie
O. Scharenborg
140
27
0
09 Aug 2021
A Survey on Audio Synthesis and Audio-Visual Multimodal Processing
Zhaofeng Shi
129
11
0
01 Aug 2021
Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion
International Joint Conference on Artificial Intelligence (IJCAI), 2021
Suzhe Wang
Lincheng Li
Yu-qiong Ding
Changjie Fan
Xin Yu
VGen
207
202
0
20 Jul 2021
Parallel and High-Fidelity Text-to-Lip Generation
AAAI Conference on Artificial Intelligence (AAAI), 2021
Jinglin Liu
Zhiying Zhu
Yi Ren
Wencan Huang
Baoxing Huai
N. Yuan
Zhou Zhao
130
10
0
14 Jul 2021
Speech2Video: Cross-Modal Distillation for Speech to Video Generation
Interspeech (Interspeech), 2021
Shijing Si
Jianzong Wang
Xiaoyang Qu
Ning Cheng
Wenqi Wei
Xinghua Zhu
Jing Xiao
VGen
106
15
0
10 Jul 2021
Deep Image Synthesis from Intuitive User Input: A Review and Perspectives
Computational Visual Media (CVM), 2021
Yuan Xue
Yuanchen Guo
Han Zhang
Tao Xu
Song-Hai Zhang
Xiaolei Huang
EGVM
3DV
180
22
0
09 Jul 2021
Multi-modality Deep Restoration of Extremely Compressed Face Videos
Xi Zhang
Xiaolin Wu
CVBM
166
20
0
05 Jul 2021
LipSync3D: Data-Efficient Learning of Personalized 3D Talking Faces from Video using Pose and Lighting Normalization
Computer Vision and Pattern Recognition (CVPR), 2021
A. Lahiri
Vivek Kwatra
C. Frueh
J. P. Lewis
C. Bregler
3DH
174
110
0
08 Jun 2021
PTeacher: a Computer-Aided Personalized Pronunciation Training System with Exaggerated Audio-Visual Corrective Feedback
International Conference on Human Factors in Computing Systems (CHI), 2021
Yaohua Bu
Tianyi Ma
Weijun Li
Hang Zhou
Jia Jia
...
Kun Li
Zhiyong Wu
Yuanchun Shi
Xiaobo Lu
Ziwei Liu
63
10
0
11 May 2021
Text2Video: Text-driven Talking-head Video Synthesis with Personalized Phoneme-Pose Dictionary
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Sibo Zhang
Jiahong Yuan
Miao Liao
Liangjun Zhang
161
39
0
29 Apr 2021
Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation
Computer Vision and Pattern Recognition (CVPR), 2021
Hang Zhou
Yasheng Sun
Wayne Wu
Chen Change Loy
Xiaogang Wang
Ziwei Liu
CVBM
279
422
0
22 Apr 2021
Voice2Mesh: Cross-Modal 3D Face Model Generation from Voices
Cho-Ying Wu
Ke Xu
Chin-Cheng Hsu
Ulrich Neumann
CVBM
3DH
122
5
0
21 Apr 2021
Visually Guided Sound Source Separation and Localization using Self-Supervised Motion Representations
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021
Xiangjie Sui
Esa Rahtu
136
30
0
17 Apr 2021
MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement
IEEE International Conference on Computer Vision (ICCV), 2021
Alexander Richard
Michael Zollhoefer
Yandong Wen
Fernando de la Torre
Yaser Sheikh
CVBM
252
241
0
16 Apr 2021
Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation
AAAI Conference on Artificial Intelligence (AAAI), 2021
Lilin Cheng
Suzhe Wang
Zhimeng Zhang
Yu-qiong Ding
Yixing Zheng
Xin Yu
Changjie Fan
VGen
174
78
0
16 Apr 2021
Visually Informed Binaural Audio Generation without Binaural Audios
Computer Vision and Pattern Recognition (CVPR), 2021
Xudong Xu
Hang Zhou
Ziwei Liu
Bo Dai
Xiaogang Wang
Dahua Lin
DiffM
79
67
0
13 Apr 2021
Can audio-visual integration strengthen robustness under multimodal attacks?
Computer Vision and Pattern Recognition (CVPR), 2021
Yapeng Tian
Chenliang Xu
AAML
252
40
0
05 Apr 2021
Audio Description from Image by Modal Translation Network
Neurocomputing (Neurocomputing), 2021
Hailong Ning
Xiangtao Zheng
Yuan Yuan
Xiaoqiang Lu
DiffM
91
18
0
18 Mar 2021
Deepfakes Generation and Detection: State-of-the-art, open challenges, countermeasures, and way forward
Momina Masood
M. Nawaz
K. Malik
A. Javed
Aun Irtaza
AAML
457
392
0
25 Feb 2021
Video Reenactment as Inductive Bias for Content-Motion Disentanglement
IEEE Transactions on Image Processing (TIP), 2021
Juan Felipe Hernandez Albarracin
Adín Ramirez Rivera
314
3
0
30 Jan 2021
VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency
Computer Vision and Pattern Recognition (CVPR), 2021
Ruohan Gao
Kristen Grauman
CVBM
386
233
0
08 Jan 2021
Weakly-Supervised Multi-Face 3D Reconstruction
Jialiang Zhang
Lixiang Lin
Jianke Zhu
Guosheng Lin
CVBM
3DH
168
22
0
06 Jan 2021
AudioViewer: Learning to Visualize Sounds
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020
Chunjin Song
Yuchi Zhang
Willis Peng
Parmis Mohaghegh
Bastian Wandt
Helge Rhodin
237
3
0
22 Dec 2020
Multi Modal Adaptive Normalization for Audio to Video Generation
Neeraj Kumar
Srishti Goel
Ankur Narang
Brejesh Lall
VGen
DiffM
80
0
0
14 Dec 2020
One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing
Computer Vision and Pattern Recognition (CVPR), 2020
Ting-Chun Wang
Arun Mallya
Xuan Li
3DH
448
571
0
30 Nov 2020
Audio-visual Speech Separation with Adversarially Disentangled Visual Representation
Peng Zhang
Jiaming Xu
Jing Shi
Yunzhe Hao
Bo Xu
747
7
0
29 Nov 2020
Stochastic Talking Face Generation Using Latent Distribution Matching
Interspeech (Interspeech), 2020
Ravindra Yadav
Ashish Sardana
Vinay P. Namboodiri
R. Hegde
DiffM
CVBM
91
4
0
21 Nov 2020
Iterative Text-based Editing of Talking-heads Using Neural Retargeting
ACM Transactions on Graphics (TOG), 2020
Xinwei Yao
Ohad Fried
Kayvon Fatahalian
Maneesh Agrawala
VGen
121
37
0
21 Nov 2020
Large-scale multilingual audio visual dubbing
Yi Yang
Brendan Shillingford
Yannis Assael
Miaosen Wang
Wendi Liu
...
Eren Sezener
Luis C. Cobo
Misha Denil
Y. Aytar
Nando de Freitas
139
24
0
06 Nov 2020
AOT: Appearance Optimal Transport Based Identity Swapping for Forgery Detection
Hao Zhu
Chaoyou Fu
Qianyi Wu
Wayne Wu
Chao Qian
Ran He
142
32
0
05 Nov 2020
Lets Play Music: Audio-driven Performance Video Generation
Hao Zhu
Yi Li
Feixia Zhu
A. Zheng
Ran He
161
7
0
05 Nov 2020
Previous
1
2
3
4
5
Next