Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2401.03468
Cited By

Multichannel AV-wav2vec2: A Framework for Learning Multichannel
Multi-Modal Speech Representation

Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation

7 January 2024

ArXiv (abs)PDF HTML Github

Papers citing "Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation"

3 / 3 papers shown

Unify Variables in Neural Scaling Laws for General Audio Representations via Embedding Effective Rank

Unify Variables in Neural Scaling Laws for General Audio Representations via Embedding Effective Rank

157

0

0

13 Oct 2025

Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation

Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech RepresentationInternational Conference on Learning Representations (ICLR), 2025

556

5

0

23 Jan 2025

Wav2code: Restore Clean Speech Representations via Codebook Lookup for
Noise-Robust ASR

Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASRIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

Yuchen Hu

360

19

0

11 Apr 2023

Page 1 of 1