v1v2 (latest)

Perfect match: Improved cross-modal embeddings for audio-visual synchronisation

21 September 2018

Soo-Whan Chung

Joon Son Chung

Hong-Goo Kang

ArXiv (abs)PDF HTML

Papers citing "Perfect match: Improved cross-modal embeddings for audio-visual synchronisation"

28 / 78 papers shown

Automatic audiovisual synchronisation for ultrasound tongue imagingSpeech Communication (Speech Commun.), 2021

31 May 2021

Divide and Contrast: Self-supervised Learning from Uncurated DataIEEE International Conference on Computer Vision (ICCV), 2021

335

110

17 May 2021

Representation Learning via Global Temporal Alignment and Cycle-ConsistencyComputer Vision and Pattern Recognition (CVPR), 2021

Isma Hadji

Konstantinos G. Derpanis

Allan D. Jepson

AI4TS

300

11 May 2021

Self-supervised object detection from audio-visual correspondenceComputer Vision and Pattern Recognition (CVPR), 2021

Triantafyllos Afouras

Yuki M. Asano

Francois Fagan

Andrea Vedaldi

Florian Metze

SSL

326

13 Apr 2021

Contrastive Learning of Global-Local Video Representations

203

07 Apr 2021

Composable Augmentation Encoding for Video Representation LearningIEEE International Conference on Computer Vision (ICCV), 2021

254

01 Apr 2021

Looking into Your Speech: Learning Cross-modal Affinity for Audio-visual Speech SeparationComputer Vision and Pattern Recognition (CVPR), 2021

178

25 Mar 2021

Automated Video Labelling: Identifying Faces by Corroborative EvidenceConference on Multimedia Information Processing and Retrieval (MIPR), 2021

Andrew Brown

Ernesto Coto

Andrew Zisserman

CVBM

108

10 Feb 2021

Cross-Modal Contrastive Learning for Text-to-Image GenerationComputer Vision and Pattern Recognition (CVPR), 2021

583

423

12 Jan 2021

MAAS: Multi-modal Assignation for Active Speaker DetectionIEEE International Conference on Computer Vision (ICCV), 2021

Juan Carlos León Alcázar

Fabian Caba Heilbron

Ali K. Thabet

Guohao Li

354

11 Jan 2021

VisualVoice: Audio-Visual Speech Separation with Cross-Modal ConsistencyComputer Vision and Pattern Recognition (CVPR), 2021

Ruohan Gao

Kristen Grauman

CVBM

453

239

08 Jan 2021

Playing a Part: Speaker Verification at the MoviesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Joon Son Chung

221

29 Oct 2020

Themes Informed Audio-visual Correspondence Learning

198

14 Sep 2020

Look, Listen, and Attend: Co-Attention Network for Self-Supervised Audio-Visual Representation Learning

Rui Feng

295

118

13 Aug 2020

Self-Supervised Learning of Audio-Visual Objects from VideoEuropean Conference on Computer Vision (ECCV), 2020

Triantafyllos Afouras

Andrew Owens

Joon Son Chung

Andrew Zisserman

SSL

243

278

10 Aug 2020

Spot the conversation: speaker diarisation in the wild

Joon Son Chung

Jaesung Huh

Arsha Nagrani

Triantafyllos Afouras

Andrew Zisserman

VGen

296

180

02 Jul 2020

Modality Dropout for Improved Performance-driven Talking FacesInternational Conference on Multimodal Interaction (ICMI), 2020

Ahmed Hussen Abdelaziz

211

27 May 2020

What Makes for Good Views for Contrastive Learning?

434

1,495

20 May 2020

Active Speakers in Context

Juan Carlos León Alcázar

134

20 May 2020

End-to-End Lip Synchronisation Based on Pattern Classification

164

18 May 2020

Disentangled Speech Embeddings using Cross-modal Self-supervisionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Joon Son Chung

189

20 Feb 2020

AlignNet: A Unifying Approach to Audio-Visual AlignmentIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020

Jianren Wang

Zhaoyuan Fang

Hang Zhao

152

12 Feb 2020

Deep Audio-Visual Learning: A SurveyInternational Journal of Automation and Computing (IJAC), 2020

223

178

14 Jan 2020

Detecting Adversarial Attacks On Audiovisual Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

171

18 Dec 2019

The sound of my voice: speaker representation loss for target voice separationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

Seongkyu Mun

Soyeon Choe

Jaesung Huh

Joon Son Chung

195

06 Nov 2019

Synchronising audio and ultrasound by learning cross-modal embeddingsInterspeech (Interspeech), 2019

125

01 Jul 2019

Naver at ActivityNet Challenge 2019 -- Task B Active Speaker Detection (AVA)

Joon Son Chung

104

25 Jun 2019

Who said that?: Audio-visual speaker diarisation of real-world meetingsInterspeech (Interspeech), 2019

Joon Son Chung

Bong-Jin Lee

Icksang Han

24 Jun 2019