Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.03468
Cited By
Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation
7 January 2024
Qiu-shi Zhu
Jie Zhang
Yu Gu
Yuli Hu
Lirong Dai
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation"
7 / 7 papers shown
Title
Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation
Sungnyun Kim
Sungwoo Cho
Sangmin Bae
Kangwook Jang
Se-Young Yun
SSL
68
1
0
23 Jan 2025
M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Yufeng Yang
Desh Raj
Ju Lin
Niko Moritz
J. Jia
...
Egor Lakomkin
Yiteng Huang
Jacob Donley
Jay Mahadeokar
Ozlem Kalinli
19
2
0
17 Sep 2024
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training
Zi-Hua Zhang
Long Zhou
Junyi Ao
Shujie Liu
Lirong Dai
Jinyu Li
Furu Wei
61
57
0
07 Oct 2022
Visual Speech Recognition for Multiple Languages in the Wild
Pingchuan Ma
Stavros Petridis
M. Pantic
VLM
112
144
0
26 Feb 2022
End-to-end Audio-visual Speech Recognition with Conformers
Pingchuan Ma
Stavros Petridis
M. Pantic
79
224
0
12 Feb 2021
Robust Multi-channel Speech Recognition using Frequency Aligned Network
Taejin Park
K. Kumatani
Minhua Wu
Shiva Sundaram
29
6
0
06 Feb 2020
Lipreading using Temporal Convolutional Networks
Brais Martínez
Pingchuan Ma
Stavros Petridis
M. Pantic
168
238
0
23 Jan 2020
1