Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1804.09713
Cited By
End-to-End Multimodal Speech Recognition
25 April 2018
Shruti Palaskar
Ramon Sanabria
Florian Metze
Re-assign community
ArXiv
PDF
HTML
Papers citing
"End-to-End Multimodal Speech Recognition"
7 / 7 papers shown
Title
AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR
Paul Hongsuck Seo
Arsha Nagrani
Cordelia Schmid
29
15
0
29 Mar 2023
Multimodal Speech Recognition for Language-Guided Embodied Agents
Allen Chang
Xiaoyuan Zhu
Aarav Monga
Seoho Ahn
Tejas Srinivasan
Jesse Thomason
AuLLM
24
3
0
27 Feb 2023
Improving Multimodal Speech Recognition by Data Augmentation and Speech Representations
Dan Oneaţă
H. Cucu
19
19
0
27 Apr 2022
Fine-Grained Grounding for Multimodal Speech Recognition
Tejas Srinivasan
Ramon Sanabria
Florian Metze
Desmond Elliott
23
11
0
05 Oct 2020
Learning Representations from Imperfect Time Series Data via Tensor Rank Regularization
Paul Pu Liang
Zhun Liu
Yao-Hung Hubert Tsai
Qibin Zhao
Ruslan Salakhutdinov
Louis-Philippe Morency
AI4TS
30
81
0
01 Jul 2019
OpenNMT: Open-Source Toolkit for Neural Machine Translation
Guillaume Klein
Yoon Kim
Yuntian Deng
Jean Senellart
Alexander M. Rush
259
1,896
0
10 Jan 2017
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
218
7,926
0
17 Aug 2015
1