ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.06581
  4. Cited By
Applying wav2vec2 for Speech Recognition on Bengali Common Voices
  Dataset

Applying wav2vec2 for Speech Recognition on Bengali Common Voices Dataset

11 September 2022
Haz Sameen Shahgir
Khondker Salman Sayeed
Tanjeem Azwad Zaman
ArXivPDFHTML

Papers citing "Applying wav2vec2 for Speech Recognition on Bengali Common Voices Dataset"

6 / 6 papers shown
Title
Transcription and translation of videos using fine-tuned XLSR Wav2Vec2
  on custom dataset and mBART
Transcription and translation of videos using fine-tuned XLSR Wav2Vec2 on custom dataset and mBART
Aniket Tathe
Anand Kamble
Suyash Kumbharkar
Atharva Bhandare
Anirban C. Mitra
32
1
0
01 Mar 2024
End to end Hindi to English speech conversion using Bark, mBART and a
  finetuned XLSR Wav2Vec2
End to end Hindi to English speech conversion using Bark, mBART and a finetuned XLSR Wav2Vec2
Aniket Tathe
Anand Kamble
Suyash Kumbharkar
Atharva Bhandare
Anirban C. Mitra
30
1
0
11 Jan 2024
OOD-Speech: A Large Bengali Speech Recognition Dataset for
  Out-of-Distribution Benchmarking
OOD-Speech: A Large Bengali Speech Recognition Dataset for Out-of-Distribution Benchmarking
Fazle Rakib
Souhardya Saha Dip
Samiul Alam
Nazia Tasnim
Md. Istiak Hossain Shihab
...
Farig Sadeque
Sayma Sultana Chowdhury
Tahsin Reasat
Asif Sushmit
Ahmed Imtiaz Humayun
11
6
0
15 May 2023
Investigating self-supervised, weakly supervised and fully supervised
  training approaches for multi-domain automatic speech recognition: a study on
  Bangladeshi Bangla
Investigating self-supervised, weakly supervised and fully supervised training approaches for multi-domain automatic speech recognition: a study on Bangladeshi Bangla
Ahnaf Mozib Samin
M. Kobir
Md. Mushtaq Shahriyar Rafee
M. F. Ahmed
Mehedi Hasan
Partha Ghosh
Shafkat Kibria
M. S. Rahman
SSL
18
0
0
24 Oct 2022
Simple and Effective Zero-shot Cross-lingual Phoneme Recognition
Simple and Effective Zero-shot Cross-lingual Phoneme Recognition
Qiantong Xu
Alexei Baevski
Michael Auli
VLM
27
77
0
23 Sep 2021
Pushing the Limits of Semi-Supervised Learning for Automatic Speech
  Recognition
Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition
Yu Zhang
James Qin
Daniel S. Park
Wei Han
Chung-Cheng Chiu
Ruoming Pang
Quoc V. Le
Yonghui Wu
VLM
SSL
146
308
0
20 Oct 2020
1