Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2212.05301
Cited By
Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning
10 December 2022
Chen Chen
Yuchen Hu
Qiang Zhang
Heqing Zou
Beier Zhu
E. Chng
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning"
6 / 6 papers shown
Title
Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation
Sungnyun Kim
Sungwoo Cho
Sangmin Bae
Kangwook Jang
Se-Young Yun
SSL
68
1
0
23 Jan 2025
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition
Chen Chen
Ruizhe Li
Yuchen Hu
Sabato Marco Siniscalchi
Pin-Yu Chen
Ensiong Chng
Chao-Han Huck Yang
24
19
0
08 Feb 2024
Metric-oriented Speech Enhancement using Diffusion Probabilistic Model
Chen Chen
Yuchen Hu
Weiwei Weng
Chng Eng Siong
DiffM
30
19
0
23 Feb 2023
Unsupervised Noise adaptation using Data Simulation
Chen Chen
Yuchen Hu
Heqing Zou
Linhui Sun
Chng Eng Siong
23
13
0
23 Feb 2023
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition
Yuchen Hu
Chen Chen
Ruizhe Li
Qiu-shi Zhu
E. Chng
26
15
0
22 Feb 2023
End-to-end Audio-visual Speech Recognition with Conformers
Pingchuan Ma
Stavros Petridis
M. Pantic
79
224
0
12 Feb 2021
1