ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.03468
  4. Cited By
Multichannel AV-wav2vec2: A Framework for Learning Multichannel
  Multi-Modal Speech Representation

Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation

7 January 2024
Qiu-shi Zhu
Jie Zhang
Yu Gu
Yuli Hu
Lirong Dai
    SSL
ArXiv (abs)PDFHTMLGithub

Papers citing "Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation"

3 / 3 papers shown
Unify Variables in Neural Scaling Laws for General Audio Representations via Embedding Effective Rank
Unify Variables in Neural Scaling Laws for General Audio Representations via Embedding Effective Rank
Xuyao Deng
Yanjie Sun
Yong Dou
Kele Xu
157
0
0
13 Oct 2025
Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation
Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech RepresentationInternational Conference on Learning Representations (ICLR), 2025
Sungnyun Kim
Sungwoo Cho
Sangmin Bae
Kangwook Jang
Se-Young Yun
SSL
556
5
0
23 Jan 2025
Wav2code: Restore Clean Speech Representations via Codebook Lookup for
  Noise-Robust ASR
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASRIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Yuchen Hu
Cheng Chen
Qiu-shi Zhu
Eng Siong Chng
360
19
0
11 Apr 2023
1
Page 1 of 1