Applying wav2vec2 for Speech Recognition on Bengali Common Voices Dataset

11 September 2022

Papers citing "Applying wav2vec2 for Speech Recognition on Bengali Common Voices Dataset"

6 / 6 papers shown

Title
Transcription and translation of videos using fine-tuned XLSR Wav2Vec2 on custom dataset and mBART Aniket Tathe Anand Kamble Suyash Kumbharkar Atharva Bhandare Anirban C. Mitra 32 1 0 01 Mar 2024
End to end Hindi to English speech conversion using Bark, mBART and a finetuned XLSR Wav2Vec2 Aniket Tathe Anand Kamble Suyash Kumbharkar Atharva Bhandare Anirban C. Mitra 30 1 0 11 Jan 2024
OOD-Speech: A Large Bengali Speech Recognition Dataset for Out-of-Distribution Benchmarking Fazle Rakib Souhardya Saha Dip Samiul Alam Nazia Tasnim Md. Istiak Hossain Shihab ... Farig Sadeque Sayma Sultana Chowdhury Tahsin Reasat Asif Sushmit Ahmed Imtiaz Humayun 11 6 0 15 May 2023
Investigating self-supervised, weakly supervised and fully supervised training approaches for multi-domain automatic speech recognition: a study on Bangladeshi Bangla Ahnaf Mozib Samin M. Kobir Md. Mushtaq Shahriyar Rafee M. F. Ahmed Mehedi Hasan Partha Ghosh Shafkat Kibria M. S. Rahman SSL 18 0 0 24 Oct 2022
Simple and Effective Zero-shot Cross-lingual Phoneme Recognition Qiantong Xu Alexei Baevski Michael Auli VLM 27 77 0 23 Sep 2021
Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition Yu Zhang James Qin Daniel S. Park Wei Han Chung-Cheng Chiu Ruoming Pang Quoc V. Le Yonghui Wu VLM SSL 146 308 0 20 Oct 2020