Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2411.02256
Cited By
Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs
4 November 2024
A. Haliassos
Rodrigo Mira
Honglie Chen
Zoe Landgraf
Stavros Petridis
M. Pantic
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs"
1 / 1 papers shown
Title
mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech Recognition
Andrew Rouditchenko
Saurabhchand Bhati
Samuel Thomas
Hilde Kuehne
Rogerio Feris
93
1
0
03 Feb 2025
1