ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.02256
  4. Cited By
Unified Speech Recognition: A Single Model for Auditory, Visual, and
  Audiovisual Inputs

Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs

4 November 2024
A. Haliassos
Rodrigo Mira
Honglie Chen
Zoe Landgraf
Stavros Petridis
M. Pantic
    SSL
ArXivPDFHTML

Papers citing "Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs"

1 / 1 papers shown
Title
mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech Recognition
mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech Recognition
Andrew Rouditchenko
Saurabhchand Bhati
Samuel Thomas
Hilde Kuehne
Rogerio Feris
93
1
0
03 Feb 2025
1