Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2209.01768
Cited By
Predict-and-Update Network: Audio-Visual Speech Recognition Inspired by Human Speech Perception
5 September 2022
Jiadong Wang
Xinyuan Qian
Haizhou Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Predict-and-Update Network: Audio-Visual Speech Recognition Inspired by Human Speech Perception"
5 / 5 papers shown
Title
Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert
Jiadong Wang
Xinyuan Qian
Malu Zhang
R. Tan
Haizhou Li
EGVM
17
92
0
29 Mar 2023
Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Multi-Person Video
Dmitriy Serdyuk
Otavio Braga
Olivier Siohan
ViT
89
40
0
25 Jan 2022
Fusing information streams in end-to-end audio-visual speech recognition
Wentao Yu
Steffen Zeiler
D. Kolossa
73
12
0
19 Apr 2021
End-to-end Audio-visual Speech Recognition with Conformers
Pingchuan Ma
Stavros Petridis
M. Pantic
79
224
0
12 Feb 2021
Lip Reading Sentences in the Wild
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
162
783
0
16 Nov 2016
1