Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.02133
Cited By
Streaming Audio-Visual Speech Recognition with Alignment Regularization
3 November 2022
Pingchuan Ma
Niko Moritz
Stavros Petridis
Christian Fuegen
M. Pantic
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Streaming Audio-Visual Speech Recognition with Alignment Regularization"
5 / 5 papers shown
Title
TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Jeff Hwang
Moto Hira
Caroline Chen
Xiaohui Zhang
Zhaoheng Ni
...
Yumeng Tao
Robin Scheibler
Samuele Cornell
Sean Kim
Stavros Petridis
25
22
0
27 Oct 2023
Visual Speech Recognition for Multiple Languages in the Wild
Pingchuan Ma
Stavros Petridis
M. Pantic
VLM
112
95
0
26 Feb 2022
Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Multi-Person Video
Dmitriy Serdyuk
Otavio Braga
Olivier Siohan
ViT
80
40
0
25 Jan 2022
Fusing information streams in end-to-end audio-visual speech recognition
Wentao Yu
Steffen Zeiler
D. Kolossa
68
12
0
19 Apr 2021
End-to-end Audio-visual Speech Recognition with Conformers
Pingchuan Ma
Stavros Petridis
M. Pantic
79
221
0
12 Feb 2021
1