Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.06606
Cited By
Tailored Design of Audio-Visual Speech Recognition Models using Branchformers
9 July 2024
David Gimeno-Gómez
Carlos David Martínez Hinarejos
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"
4 / 4 papers shown
Title
mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech Recognition
Andrew Rouditchenko
Saurabhchand Bhati
Samuel Thomas
Hilde Kuehne
Rogerio Feris
90
1
0
03 Feb 2025
Visual Speech Recognition for Multiple Languages in the Wild
Pingchuan Ma
Stavros Petridis
M. Pantic
VLM
112
95
0
26 Feb 2022
End-to-end Audio-visual Speech Recognition with Conformers
Pingchuan Ma
Stavros Petridis
M. Pantic
79
221
0
12 Feb 2021
Intermediate Loss Regularization for CTC-based Speech Recognition
Jaesong Lee
Shinji Watanabe
111
135
0
05 Feb 2021
1