ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.00477
  4. Cited By
Analyzing Utility of Visual Context in Multimodal Speech Recognition
  Under Noisy Conditions
v1v2 (latest)

Analyzing Utility of Visual Context in Multimodal Speech Recognition Under Noisy Conditions

30 June 2019
Tejas Srinivasan
Ramon Sanabria
Florian Metze
ArXiv (abs)PDFHTML

Papers citing "Analyzing Utility of Visual Context in Multimodal Speech Recognition Under Noisy Conditions"

6 / 6 papers shown
Multimodal Speech Recognition for Language-Guided Embodied Agents
Multimodal Speech Recognition for Language-Guided Embodied AgentsInterspeech (Interspeech), 2023
Allen Chang
Xiaoyuan Zhu
Aarav Monga
Seoho Ahn
Tejas Srinivasan
Jesse Thomason
AuLLM
359
6
0
27 Feb 2023
AVATAR: Unconstrained Audiovisual Speech Recognition
AVATAR: Unconstrained Audiovisual Speech RecognitionInterspeech (Interspeech), 2022
Valentin Gabeur
Paul Hongsuck Seo
Arsha Nagrani
Chen Sun
Alahari Karteek
Cordelia Schmid
137
16
0
15 Jun 2022
Improving Multimodal Speech Recognition by Data Augmentation and Speech
  Representations
Improving Multimodal Speech Recognition by Data Augmentation and Speech Representations
Dan Oneaţă
H. Cucu
124
25
0
27 Apr 2022
Listen, Look and Deliberate: Visual context-aware speech recognition
  using pre-trained text-video representations
Listen, Look and Deliberate: Visual context-aware speech recognition using pre-trained text-video representations
Shahram Ghorbani
Yashesh Gaur
Yu Shi
Jinyu Li
127
14
0
08 Nov 2020
Fine-Grained Grounding for Multimodal Speech Recognition
Fine-Grained Grounding for Multimodal Speech RecognitionFindings (Findings), 2020
Tejas Srinivasan
Ramon Sanabria
Florian Metze
Desmond Elliott
166
11
0
05 Oct 2020
Looking Enhances Listening: Recovering Missing Speech Using Images
Looking Enhances Listening: Recovering Missing Speech Using ImagesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Tejas Srinivasan
Ramon Sanabria
Florian Metze
144
15
0
13 Feb 2020
1
Page 1 of 1