Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.17200
Cited By
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
30 March 2023
Xubo Liu
Egor Lakomkin
Konstantinos Vougioukas
Pingchuan Ma
Honglie Chen
Rui-Cang Xie
Morrie Doulaty
Niko Moritz
J. Kolár
Stavros Petridis
M. Pantic
Christian Fuegen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision"
17 / 17 papers shown
Title
VALLR: Visual ASR Language Model for Lip Reading
Marshall Thomas
Edward Fish
Richard Bowden
29
0
0
27 Mar 2025
LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition
Bowen Hao
Dongliang Zhou
Xiaojie Li
Xingyu Zhang
Liang Xie
Jianlong Wu
Erwei Yin
28
1
0
08 Jan 2025
Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs
A. Haliassos
Rodrigo Mira
Honglie Chen
Zoe Landgraf
Stavros Petridis
M. Pantic
SSL
27
0
0
04 Nov 2024
Tailored Design of Audio-Visual Speech Recognition Models using Branchformers
David Gimeno-Gómez
Carlos David Martínez Hinarejos
86
2
0
09 Jul 2024
Contrastive Learning from Synthetic Audio Doppelgängers
Manuel Cherep
Nikhil Singh
14
1
0
09 Jun 2024
Exploring the Impact of Synthetic Data for Aerial-view Human Detection
Hyungtae Lee
Yan Zhang
Yingzhe Shen
Heesung Kwon
Shuvra S. Bhattacharyya
27
1
0
24 May 2024
BRAVEn: Improving Self-Supervised Pre-training for Visual and Auditory Speech Recognition
A. Haliassos
Andreas Zinonos
Rodrigo Mira
Stavros Petridis
Maja Pantic
VLM
SSL
AI4TS
20
11
0
02 Apr 2024
LiteVSR: Efficient Visual Speech Recognition by Learning from Speech Representations of Unlabeled Data
Hendrik Laux
Emil Mededovic
Ahmed Hallawa
Lukas Martin
A. Peine
Anke Schmeink
VLM
13
4
0
15 Dec 2023
Do VSR Models Generalize Beyond LRS3?
Y. A. D. Djilali
Sanath Narayan
Eustache Le Bihan
Haithem Boussaid
Ebtesam Almazrouei
Merouane Debbah
19
4
0
23 Nov 2023
AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model
Jeong Hun Yeo
Minsu Kim
J. Choi
Dae Hoe Kim
Y. Ro
11
17
0
15 Aug 2023
Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention
Xubo Liu
Qiushi Huang
Xinhao Mei
Haohe Liu
Qiuqiang Kong
...
Yu Zhang
Lilian H. Y. Tang
Mark D. Plumbley
Volkan Kilicc
Wenwu Wang
36
18
0
28 Oct 2022
Visual Speech Recognition for Multiple Languages in the Wild
Pingchuan Ma
Stavros Petridis
M. Pantic
VLM
112
95
0
26 Feb 2022
Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Multi-Person Video
Dmitriy Serdyuk
Otavio Braga
Olivier Siohan
ViT
80
40
0
25 Jan 2022
End-to-end Audio-visual Speech Recognition with Conformers
Pingchuan Ma
Stavros Petridis
M. Pantic
79
221
0
12 Feb 2021
Differential Treatment for Stuff and Things: A Simple Unsupervised Domain Adaptation Method for Semantic Segmentation
Zhonghao Wang
Mo Yu
Yunchao Wei
Rogerio Feris
Jinjun Xiong
Wen-mei W. Hwu
Thomas S. Huang
Humphrey Shi
OOD
179
232
0
18 Mar 2020
StyleGAN2 Distillation for Feed-forward Image Manipulation
Yuri Viazovetskyi
V. Ivashkin
Evgenii Kashin
153
134
0
07 Mar 2020
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
214
2,224
0
14 Jun 2018
1