Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis

Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis

17 May 2020

Rudrabha Mukhopadhyay

Vinay P. Namboodiri

Papers citing "Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis"

17 / 67 papers shown

Title
Visual Speech Recognition for Multiple Languages in the Wild Pingchuan Ma Stavros Petridis M. Pantic VLM 130 145 0 26 Feb 2022
VCVTS: Multi-speaker Video-to-Speech synthesis via cross-modal knowledge transfer from voice conversion Disong Wang Shan Yang Dan Su Xunying Liu Dong Yu Helen Meng 17 11 0 18 Feb 2022
LipSound2: Self-Supervised Pre-Training for Lip-to-Speech Reconstruction and Lip Reading Leyuan Qu C. Weber S. Wermter 38 23 0 09 Dec 2021
More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech Michael Hassid Michelle Tadmor Ramanovich Brendan Shillingford Miaosen Wang Ye Jia Tal Remez DiffM 25 16 0 19 Nov 2021
Personalized One-Shot Lipreading for an ALS Patient Bipasha Sen Aditya Agarwal Rudrabha Mukhopadhyay Vinay P. Namboodiri C. V. Jawahar LM&MA 11 3 0 02 Nov 2021
Neural Dubber: Dubbing for Videos According to Scripts Chenxu Hu Qiao Tian Tingle Li Yuping Wang Yuxuan Wang Hang Zhao DiffM VGen 36 39 0 15 Oct 2021
VisualTTS: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over Junchen Lu Berrak Sisman Rui Liu Mingyang Zhang Haizhou Li DiffM 36 19 0 07 Oct 2021
Spatio-Temporal Attention Mechanism and Knowledge Distillation for Lip Reading Shahd Elashmawy Marian M. Ramsis Hesham M. Eraqi Farah Eldeshnawy Hadeel Mabrouk Omar Abugabal Nourhan Sakr 35 1 0 07 Aug 2021
Facetron: A Multi-speaker Face-to-Speech Model based on Cross-modal Latent Representations Seyun Um Jihyun Kim Jihyun Lee Hong-Goo Kang CVBM 13 4 0 26 Jul 2021
Speaker disentanglement in video-to-speech conversion Dan Oneaţă Adriana Stan H. Cucu 24 9 0 20 May 2021
End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks Rodrigo Mira Konstantinos Vougioukas Pingchuan Ma Stavros Petridis Björn W. Schuller M. Pantic 32 43 0 27 Apr 2021
GAN Inversion: A Survey Weihao Xia Yulun Zhang Yujiu Yang Jing-Hao Xue Bolei Zhou Ming-Hsuan Yang DiffM 70 507 0 14 Jan 2021
Visual Speech Enhancement Without A Real Visual Stream Sindhu B. Hegde Prajwal K R Rudrabha Mukhopadhyay Vinay P. Namboodiri C. V. Jawahar DiffM 20 17 0 20 Dec 2020
Speech Prediction in Silent Videos using Variational Autoencoders Ravindra Yadav Ashish Sardana Vinay P. Namboodiri R. Hegde VGen DRL 29 23 0 14 Nov 2020
An Empirical Study of Visual Features for DNN based Audio-Visual Speech Enhancement in Multi-talker Environments Shrishti Saha Shetu Soumitro Chakrabarty Emanuel Habets 14 2 0 09 Nov 2020
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis Ye Jia Yu Zhang Ron J. Weiss Quan Wang Jonathan Shen ... Zhehuai Chen Patrick Nguyen Ruoming Pang Ignacio López Moreno Yonghui Wu 207 820 0 12 Jun 2018
Lip Reading Sentences in the Wild Joon Son Chung A. Senior Oriol Vinyals Andrew Zisserman 185 784 0 16 Nov 2016