Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2004.08250
Cited By
How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition
17 April 2020
George Sterpu
Christian Saam
N. Harte
Re-assign community
ArXiv
PDF
HTML
Papers citing
"How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition"
14 / 14 papers shown
Title
A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition
Yusheng Dai
Hang Chen
Jun Du
Ruoyu Wang
Shihao Chen
Jie Ma
Haotian Wang
Chin-Hui Lee
38
4
0
07 Mar 2024
Practice of the conformer enhanced AUDIO-VISUAL HUBERT on Mandarin and English
Xiaoming Ren
Chao Li
Shenjian Wang
Biao Li
17
0
0
28 Feb 2023
Streaming Audio-Visual Speech Recognition with Alignment Regularization
Pingchuan Ma
Niko Moritz
Stavros Petridis
Christian Fuegen
M. Pantic
26
2
0
03 Nov 2022
Predict-and-Update Network: Audio-Visual Speech Recognition Inspired by Human Speech Perception
Jiadong Wang
Xinyuan Qian
Haizhou Li
17
14
0
05 Sep 2022
Visual Speech Recognition for Multiple Languages in the Wild
Pingchuan Ma
Stavros Petridis
M. Pantic
VLM
112
95
0
26 Feb 2022
Large-vocabulary Audio-visual Speech Recognition in Noisy Environments
Wentao Yu
Steffen Zeiler
D. Kolossa
46
3
0
10 Sep 2021
Fusing information streams in end-to-end audio-visual speech recognition
Wentao Yu
Steffen Zeiler
D. Kolossa
68
12
0
19 Apr 2021
End-to-end Audio-visual Speech Recognition with Conformers
Pingchuan Ma
Stavros Petridis
M. Pantic
79
221
0
12 Feb 2021
AV Taris: Online Audio-Visual Speech Recognition
George Sterpu
N. Harte
15
1
0
14 Dec 2020
Multimodal Integration for Large-Vocabulary Audio-Visual Speech Recognition
Wentao Yu
Steffen Zeiler
D. Kolossa
10
10
0
28 Jul 2020
Learning to Count Words in Fluent Speech enables Online Speech Recognition
George Sterpu
Christian Saam
N. Harte
6
4
0
08 Jun 2020
Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition
George Sterpu
Christian Saam
N. Harte
6
7
0
19 May 2020
Lip Reading Sentences in the Wild
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
162
782
0
16 Nov 2016
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
214
7,687
0
17 Aug 2015
1