v1v2 (latest)

Applying Wav2vec2.0 to Speech Recognition in Various Low-resource Languages

22 December 2020

Papers citing "Applying Wav2vec2.0 to Speech Recognition in Various Low-resource Languages"

33 / 33 papers shown

Poem Meter Classification of Recited Arabic Poetry: Integrating High-Resource Systems for a Low-Resource Task

Maged S. Al-Shaibani

Zaid Alyafeai

Irfan Ahmad

243

16 Apr 2025

Efficient Finetuning for Dimensional Speech Emotion Recognition in the Age of TransformersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

Aneesha Sampath

James Tavernor

E. Provost

373

17 Feb 2025

Semantically Corrected Amharic Automatic Speech Recognition

Samuael Adnew

Paul Pu Liang

164

20 Apr 2024

Automatic Speech Recognition (ASR) for the Diagnosis of pronunciation of Speech Sound Disorders in Korean childrenClinical Linguistics & Phonetics (Clin Linguist Phon), 2024

...

191

13 Mar 2024

Transcription and translation of videos using fine-tuned XLSR Wav2Vec2 on custom dataset and mBART

169

01 Mar 2024

End to end Hindi to English speech conversion using Bark, mBART and a finetuned XLSR Wav2Vec2

169

11 Jan 2024

Sparsely Shared LoRA on Whisper for Child Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

334

21 Sep 2023

Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. AdultsAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2023

290

12 Sep 2023

A Novel Self-training Approach for Low-resource Speech RecognitionInterspeech (Interspeech), 2023

Satwinder Singh

Feng Hou

Ruili Wang

246

10 Aug 2023

Toward Leveraging Pre-Trained Self-Supervised Frontends for Automatic Singing Voice Understanding Tasks: Three Case StudiesAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2023

Yuya Yamamoto

233

22 Jun 2023

Unsupervised speech intelligibility assessment with utterance level alignment distance between teacher and learner Wav2Vec-2.0 representations

Nayan Anand

Meenakshi Sirigiraju

Chiranjeevi Yarra

155

15 Jun 2023

DistilXLSR: A Light Weight Cross-Lingual Speech Representation ModelInterspeech (Interspeech), 2023

186

02 Jun 2023

AfriNames: Most ASR models "butcher" African Names

Tobi Olatunji

Tejumade Afonja

Bonaventure F. P. Dossou

A. Tonja

Chris C. Emezue

Amina Mardiyyah Rufai

Sahib Singh

201

01 Jun 2023

Political corpus creation through automatic speech recognition on EU debates

Hugo De Vos

Suzan Verberne

159

17 Apr 2023

WESPER: Zero-shot and Realtime Whisper to Normal Voice Conversion for Whisper-based Speech InteractionsInternational Conference on Human Factors in Computing Systems (CHI), 2023

Jun Rekimoto

291

03 Mar 2023

Phoneme Segmentation Using Self-Supervised Speech ModelsSpoken Language Technology Workshop (SLT), 2022

Luke Strgar

David Harwath

SSL

235

02 Nov 2022

Combining Contrastive and Non-Contrastive Losses for Fine-Tuning Pretrained Models in Speech AnalysisSpoken Language Technology Workshop (SLT), 2022

Florian Lux

Ching-Yi Chen

Ngoc Thang Vu

130

21 Oct 2022

Multilingual Zero Resource Speech Recognition Base on Self-Supervise Pre-Trained Acoustic ModelsInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2022

189

13 Oct 2022

Applying wav2vec2 for Speech Recognition on Bengali Common Voices Dataset

Haz Sameen Shahgir

Khondker Salman Sayeed

Tanjeem Azwad Zaman

190

11 Sep 2022

DualVoice: Speech Interaction that Discriminates between Normal and Whispered Voice InputACM Symposium on User Interface Software and Technology (UIST), 2022

Jun Rekimoto

152

22 Aug 2022

Huqariq: A Multilingual Speech Corpus of Native Languages of Peru for Speech RecognitionInternational Conference on Language Resources and Evaluation (LREC), 2022

Rodolfo Zevallos

Luis Camacho

Nelsi Melgarejo

142

12 Jul 2022

Speech Emotion: Investigating Model Representations, Multi-Task Learning and Knowledge DistillationInterspeech (Interspeech), 2022

171

02 Jul 2022

FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning

315

01 Jul 2022

DRAFT: A Novel Framework to Reduce Domain Shifting in Self-supervised Learning and Its Application to Children's ASRInterspeech (Interspeech), 2022

Ruchao Fan

Abeer Alwan

271

16 Jun 2022

Combining Spectral and Self-Supervised Features for Low Resource Speech Recognition and TranslationInterspeech (Interspeech), 2022

Dan Berrebbi

Jiatong Shi

Brian Yan

Osbel López-Francisco

Jonathan D. Amith

Shinji Watanabe

258

05 Apr 2022

SelfRemaster: Self-Supervised Speech Restoration with Analysis-by-Synthesis Approach Using Channel ModelingInterspeech (Interspeech), 2022

Hiroshi Saruwatari

220

24 Mar 2022

Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language modelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Pengyuan Zhang

241

25 Jan 2022

Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model

167

14 Dec 2021

Decoupling recognition and transcription in Mandarin ASR

235

02 Aug 2021

Automatic recognition of suprasegmentals in speech

225

02 Aug 2021

End-to-end Speech Translation via Cross-modal Progressive TrainingInterspeech (Interspeech), 2021

Rong Ye

Mingxuan Wang

Lei Li

270

21 Apr 2021

Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers

Yusuke Kida

Tatsuya Komatsu

M. Togami

172

21 Apr 2021

Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-resource Speech RecognitionIEEE Signal Processing Letters (IEEE SPL), 2021

Cheng Yi

Shiyu Zhou

Bo Xu

242

17 Jan 2021