ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.09313
  4. Cited By
End-to-End Speech-Driven Facial Animation with Temporal GANs

End-to-End Speech-Driven Facial Animation with Temporal GANs

23 May 2018
Konstantinos Vougioukas
Stavros Petridis
M. Pantic
    CVBM
ArXivPDFHTML

Papers citing "End-to-End Speech-Driven Facial Animation with Temporal GANs"

50 / 64 papers shown
Title
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution
Antoni Bigata
Rodrigo Mira
Stella Bounareli
Michał Stypułkowski
Konstantinos Vougioukas
Stavros Petridis
Maja Pantic
52
0
0
01 May 2025
Emotional Conversation: Empowering Talking Faces with Cohesive
  Expression, Gaze and Pose Generation
Emotional Conversation: Empowering Talking Faces with Cohesive Expression, Gaze and Pose Generation
Jiadong Liang
Feng Lu
CVBM
29
0
0
12 Jun 2024
FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head
  Models
FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models
Shivangi Aneja
Justus Thies
Angela Dai
Matthias Nießner
DiffM
VGen
26
29
0
13 Dec 2023
Speech-Gesture GAN: Gesture Generation for Robots and Embodied Agents
Speech-Gesture GAN: Gesture Generation for Robots and Embodied Agents
Carson Yu Liu
Gelareh Mohammadi
Yang Song
W. Johal
13
2
0
17 Sep 2023
Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization
Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization
Soumik Mukhopadhyay
Saksham Suri
R. Gadde
Abhinav Shrivastava
DiffM
33
20
0
18 Aug 2023
IFaceUV: Intuitive Motion Facial Image Generation by Identity
  Preservation via UV map
IFaceUV: Intuitive Motion Facial Image Generation by Identity Preservation via UV map
Han-Lim Lee
Yu-Te Ku
Eunseok Kim
Seungryul Baek
3DH
22
0
0
08 Jun 2023
Laughing Matters: Introducing Laughing-Face Generation using Diffusion
  Models
Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models
Antoni Bigata Casademunt
Rodrigo Mira
Nikita Drobyshev
Konstantinos Vougioukas
Stavros Petridis
M. Pantic
DiffM
59
2
0
15 May 2023
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic
  Supervision
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Xubo Liu
Egor Lakomkin
Konstantinos Vougioukas
Pingchuan Ma
Honglie Chen
...
Niko Moritz
J. Kolár
Stavros Petridis
M. Pantic
Christian Fuegen
42
19
0
30 Mar 2023
READ Avatars: Realistic Emotion-controllable Audio Driven Avatars
READ Avatars: Realistic Emotion-controllable Audio Driven Avatars
Jack D. Saunders
Vinay P. Namboodiri
VGen
21
11
0
01 Mar 2023
Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Dan Bigioi
Shubhajit Basak
Michał Stypułkowski
Maciej Ziȩba
H. Jordan
R. Mcdonnell
Peter Corcoran
DiffM
VGen
19
34
0
10 Jan 2023
Face Generation and Editing with StyleGAN: A Survey
Face Generation and Editing with StyleGAN: A Survey
Andrew Melnik
Maksim Miasayedzenkau
Dzianis Makaravets
Dzianis Pirshtuk
Eren Akbulut
Dennis Holzmann
Tarek Renusch
Gustav Reichert
Helge J. Ritter
CVBM
19
39
0
18 Dec 2022
Motion and Context-Aware Audio-Visual Conditioned Video Prediction
Motion and Context-Aware Audio-Visual Conditioned Video Prediction
Yating Xu
Conghui Hu
G. Lee
VGen
35
0
0
09 Dec 2022
SPACE: Speech-driven Portrait Animation with Controllable Expression
SPACE: Speech-driven Portrait Animation with Controllable Expression
Siddharth Gururani
Arun Mallya
Ting-Chun Wang
Rafael Valle
Ming-Yu Liu
VGen
20
45
0
17 Nov 2022
Talking Head from Speech Audio using a Pre-trained Image Generator
Talking Head from Speech Audio using a Pre-trained Image Generator
M. M. Alghamdi
He-Nan Wang
A. Bulpitt
David C. Hogg
70
21
0
09 Sep 2022
VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via
  Speech-Visage Feature Selection
VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection
Joanna Hong
Minsu Kim
Y. Ro
CVBM
DiffM
28
8
0
15 Jun 2022
EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware
  Motion Model
EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model
Xinya Ji
Hang Zhou
Kaisiyuan Wang
Qianyi Wu
Wayne Wu
Feng Xu
Xun Cao
CVBM
50
157
0
30 May 2022
Deep Learning for Visual Speech Analysis: A Survey
Deep Learning for Visual Speech Analysis: A Survey
Changchong Sheng
Gangyao Kuang
L. Bai
Chen Hou
Y. Guo
Xin Xu
M. Pietikäinen
Li Liu
VLM
21
33
0
22 May 2022
Synthetic Data -- what, why and how?
Synthetic Data -- what, why and how?
James Jordon
Lukasz Szpruch
F. Houssiau
M. Bottarelli
Giovanni Cherubin
Carsten Maple
Samuel N. Cohen
Adrian Weller
35
109
0
06 May 2022
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive
  Transformer
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer
Songwei Ge
Thomas Hayes
Harry Yang
Xiaoyue Yin
Guan Pang
David Jacobs
Jia-Bin Huang
Devi Parikh
ViT
40
214
0
07 Apr 2022
Audio-Driven Talking Face Video Generation with Dynamic Convolution
  Kernels
Audio-Driven Talking Face Video Generation with Dynamic Convolution Kernels
Zipeng Ye
Mengfei Xia
Ran Yi
Juyong Zhang
Yu-Kun Lai
Xuanteng Huang
Guoxin Zhang
Yong-jin Liu
CVBM
22
39
0
16 Jan 2022
Talking Head Generation with Audio and Speech Related Facial Action
  Units
Talking Head Generation with Audio and Speech Related Facial Action Units
Sen Chen
Zhilei Liu
Jiaxing Liu
Zhengxiang Yan
Longbiao Wang
CVBM
8
14
0
19 Oct 2021
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation
Yuanxun Lu
Jinxiang Chai
Xun Cao
24
82
0
22 Sep 2021
PIRenderer: Controllable Portrait Image Generation via Semantic Neural
  Rendering
PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering
Yurui Ren
Gezhong Li
Yuanqi Chen
Thomas H. Li
Shan Liu
DiffM
VGen
49
224
0
17 Sep 2021
Deep Person Generation: A Survey from the Perspective of Face, Pose and
  Cloth Synthesis
Deep Person Generation: A Survey from the Perspective of Face, Pose and Cloth Synthesis
Tong Sha
Wei Zhang
T. Shen
Zhoujun Li
Tao Mei
27
38
0
05 Sep 2021
AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary
  Person
AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person
Xinsheng Wang
Qicong Xie
Jihua Zhu
Lei Xie
O. Scharenborg
20
16
0
09 Aug 2021
CCVS: Context-aware Controllable Video Synthesis
CCVS: Context-aware Controllable Video Synthesis
G. L. Moing
Jean Ponce
Cordelia Schmid
10
78
0
16 Jul 2021
Multi-modality Deep Restoration of Extremely Compressed Face Videos
Multi-modality Deep Restoration of Extremely Compressed Face Videos
Xi Zhang
Xiaolin Wu
CVBM
16
13
0
05 Jul 2021
MeshTalk: 3D Face Animation from Speech using Cross-Modality
  Disentanglement
MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement
Alexander Richard
Michael Zollhoefer
Yandong Wen
Fernando De la Torre
Yaser Sheikh
CVBM
29
193
0
16 Apr 2021
One Shot Audio to Animated Video Generation
One Shot Audio to Animated Video Generation
Neeraj Kumar
Srishti Goel
Ankur Narang
Brejesh Lall
H. Mujtaba
Pranshu Agarwal
D. Sarkar
VGen
18
0
0
19 Feb 2021
AudioViewer: Learning to Visualize Sounds
AudioViewer: Learning to Visualize Sounds
Chunjin Song
Yuchi Zhang
Willis Peng
Parmis Mohaghegh
Bastian Wandt
Helge Rhodin
22
1
0
22 Dec 2020
Robust One Shot Audio to Video Generation
Robust One Shot Audio to Video Generation
Neeraj Kumar
Srishti Goel
Ankur Narang
H. Mujtaba
VGen
22
13
0
14 Dec 2020
Multi Modal Adaptive Normalization for Audio to Video Generation
Multi Modal Adaptive Normalization for Audio to Video Generation
Neeraj Kumar
Srishti Goel
Ankur Narang
Brejesh Lall
VGen
DiffM
19
0
0
14 Dec 2020
Stochastic Talking Face Generation Using Latent Distribution Matching
Stochastic Talking Face Generation Using Latent Distribution Matching
Ravindra Yadav
Ashish Sardana
Vinay P. Namboodiri
R. Hegde
DiffM
CVBM
11
4
0
21 Nov 2020
Iterative Text-based Editing of Talking-heads Using Neural Retargeting
Iterative Text-based Editing of Talking-heads Using Neural Retargeting
Xinwei Yao
Ohad Fried
Kayvon Fatahalian
Maneesh Agrawala
VGen
11
33
0
21 Nov 2020
Speech Prediction in Silent Videos using Variational Autoencoders
Speech Prediction in Silent Videos using Variational Autoencoders
Ravindra Yadav
Ashish Sardana
Vinay P. Namboodiri
R. Hegde
VGen
DRL
13
23
0
14 Nov 2020
Personality-Driven Gaze Animation with Conditional Generative
  Adversarial Networks
Personality-Driven Gaze Animation with Conditional Generative Adversarial Networks
Funda Durupinar
CVBM
GAN
20
2
0
11 Nov 2020
Video Generative Adversarial Networks: A Review
Video Generative Adversarial Networks: A Review
Nuha Aldausari
Arcot Sowmya
Nadine Marcus
Gelareh Mohammadi
EGVM
13
102
0
04 Nov 2020
Facial Keypoint Sequence Generation from Audio
Facial Keypoint Sequence Generation from Audio
Prateek Manocha
Prithwijit Guha
3DH
VGen
10
0
0
02 Nov 2020
Generative Adversarial Networks in Human Emotion Synthesis:A Review
Generative Adversarial Networks in Human Emotion Synthesis:A Review
Noushin Hajarolasvadi
M. A. Ramírez
H. Demirel
GAN
11
20
0
28 Oct 2020
Audio- and Gaze-driven Facial Animation of Codec Avatars
Audio- and Gaze-driven Facial Animation of Codec Avatars
Alexander Richard
Colin S. Lea
Shugao Ma
Juergen Gall
Fernando De la Torre
Yaser Sheikh
CVBM
6
81
0
11 Aug 2020
Speech Driven Talking Face Generation from a Single Image and an Emotion
  Condition
Speech Driven Talking Face Generation from a Single Image and an Emotion Condition
Sefik Emre Eskimez
You Zhang
Z. Duan
EGVM
CVBM
12
86
0
08 Aug 2020
Sound2Sight: Generating Visual Dynamics from Sound and Context
Sound2Sight: Generating Visual Dynamics from Sound and Context
A. Cherian
Moitreya Chatterjee
N. Ahuja
VGen
69
35
0
23 Jul 2020
Modality Dropout for Improved Performance-driven Talking Faces
Modality Dropout for Improved Performance-driven Talking Faces
Ahmed Hussen Abdelaziz
B. Theobald
Paul Dixon
Reinhard Knothe
N. Apostoloff
Sachin Kajareker
19
36
0
27 May 2020
Does Visual Self-Supervision Improve Learning of Speech Representations
  for Emotion Recognition?
Does Visual Self-Supervision Improve Learning of Speech Representations for Emotion Recognition?
Abhinav Shukla
Stavros Petridis
M. Pantic
SSL
27
28
0
04 May 2020
Everybody's Talkin': Let Me Talk as You Want
Everybody's Talkin': Let Me Talk as You Want
Linsen Song
Wayne Wu
Chao Qian
R. He
Chen Change Loy
DiffM
VGen
23
142
0
15 Jan 2020
Deep Audio-Visual Learning: A Survey
Deep Audio-Visual Learning: A Survey
Hao Zhu
Mandi Luo
Rui Wang
A. Zheng
R. He
24
156
0
14 Jan 2020
Visually Guided Self Supervised Learning of Speech Representations
Visually Guided Self Supervised Learning of Speech Representations
Abhinav Shukla
Konstantinos Vougioukas
Pingchuan Ma
Stavros Petridis
M. Pantic
SSL
14
24
0
13 Jan 2020
Detecting Adversarial Attacks On Audiovisual Speech Recognition
Detecting Adversarial Attacks On Audiovisual Speech Recognition
Pingchuan Ma
Stavros Petridis
M. Pantic
AAML
8
19
0
18 Dec 2019
Music-oriented Dance Video Synthesis with Pose Perceptual Loss
Music-oriented Dance Video Synthesis with Pose Perceptual Loss
Xuanchi Ren
Haoran Li
Zijian Huang
Qifeng Chen
15
17
0
13 Dec 2019
Speech-driven facial animation using polynomial fusion of features
Speech-driven facial animation using polynomial fusion of features
Triantafyllos Kefalas
Konstantinos Vougioukas
Yannis Panagakis
Stavros Petridis
Jean Kossaifi
M. Pantic
14
6
0
12 Dec 2019
12
Next