Dynamic Temporal Alignment of Speech to Lips

19 August 2018

Papers citing "Dynamic Temporal Alignment of Speech to Lips"

17 / 17 papers shown

Conditional Deep Canonical Time WarpingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

275

10 Jan 2025

Synchformer: Efficient Synchronization from Sparse Cues

Vladimir E. Iashin

Weidi Xie

Esa Rahtu

Andrew Zisserman

233

29 Jan 2024

GestSync: Determining who is speaking without a talking headBritish Machine Vision Conference (BMVC), 2023

Sindhu B. Hegde

Andrew Zisserman

157

08 Oct 2023

DF-TransFusion: Multimodal Deepfake Detection via Lip-Audio Cross-Attention and Facial Self-Attention

Aaditya Kharel

Manas Paranjape

Aniket Bera

160

12 Sep 2023

Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic ProgrammingInternational Conference on Machine Learning (ICML), 2023

Xinlei Niu

Christian J. Walder

J. Zhang

Charles Patrick Martin

BDL

283

05 Jun 2023

Sparse in Space and Time: Audio-visual Synchronisation with Trainable SelectorsBritish Machine Vision Conference (BMVC), 2022

Vladimir E. Iashin

Weidi Xie

Esa Rahtu

Andrew Zisserman

146

13 Oct 2022

Deep Learning for Visual Speech Analysis: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

315

22 May 2022

End to End Lip Synchronization with a Temporal AutoEncoderIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020

Yoav Shalev

Lior Wolf

30 Mar 2022

Audio-Visual Fusion Layers for Event Type Aware Video Recognition

In So Kweon

132

12 Feb 2022

Audio-Visual Synchronisation in the wild

Honglie Chen

Weidi Xie

Triantafyllos Afouras

Arsha Nagrani

Andrea Vedaldi

Andrew Zisserman

198

08 Dec 2021

Neural Dubber: Dubbing for Videos According to Scripts

Yuxuan Wang

Hang Zhao

DiffM VGen

313

15 Oct 2021

Drop-DTW: Aligning Common Signal Between Sequences While Dropping OutliersNeural Information Processing Systems (NeurIPS), 2021

Nikita Dvornik

Isma Hadji

Konstantinos G. Derpanis

Animesh Garg

Allan D. Jepson

162

26 Aug 2021

Improved Lite Audio-Visual Speech EnhancementIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020

Shang-Yi Chuang

Hsin-Min Wang

Yu Tsao

299

30 Aug 2020

Look, Listen, and Attend: Co-Attention Network for Self-Supervised Audio-Visual Representation Learning

Rui Feng

275

117

13 Aug 2020

End-to-End Lip Synchronisation Based on Pattern Classification

164

18 May 2020

AlignNet: A Unifying Approach to Audio-Visual AlignmentIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020

Jianren Wang

Zhaoyuan Fang

Hang Zhao

148

12 Feb 2020

Perfect match: Improved cross-modal embeddings for audio-visual synchronisation

Soo-Whan Chung

Joon Son Chung

Hong-Goo Kang

199

129

21 Sep 2018