ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.12121
  4. Cited By
Applying Wav2vec2.0 to Speech Recognition in Various Low-resource
  Languages
v1v2 (latest)

Applying Wav2vec2.0 to Speech Recognition in Various Low-resource Languages

22 December 2020
Cheng Yi
Jianzhong Wang
Ning Cheng
Shiyu Zhou
Bo Xu
    SSLVLM
ArXiv (abs)PDFHTML

Papers citing "Applying Wav2vec2.0 to Speech Recognition in Various Low-resource Languages"

33 / 33 papers shown
Poem Meter Classification of Recited Arabic Poetry: Integrating High-Resource Systems for a Low-Resource Task
Poem Meter Classification of Recited Arabic Poetry: Integrating High-Resource Systems for a Low-Resource Task
Maged S. Al-Shaibani
Zaid Alyafeai
Irfan Ahmad
243
0
0
16 Apr 2025
Efficient Finetuning for Dimensional Speech Emotion Recognition in the Age of Transformers
Efficient Finetuning for Dimensional Speech Emotion Recognition in the Age of TransformersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Aneesha Sampath
James Tavernor
E. Provost
373
6
0
17 Feb 2025
Semantically Corrected Amharic Automatic Speech Recognition
Semantically Corrected Amharic Automatic Speech Recognition
Samuael Adnew
Paul Pu Liang
164
3
0
20 Apr 2024
Automatic Speech Recognition (ASR) for the Diagnosis of pronunciation of
  Speech Sound Disorders in Korean children
Automatic Speech Recognition (ASR) for the Diagnosis of pronunciation of Speech Sound Disorders in Korean childrenClinical Linguistics & Phonetics (Clin Linguist Phon), 2024
Taekyung Ahn
Yeonjung Hong
Younggon Im
Do Hyung Kim
Dayoung Kang
...
Jae Won Kim
Min Jung Kim
Ah-ra Cho
Dae-Hyun Jang
Hosung Nam
191
6
0
13 Mar 2024
Transcription and translation of videos using fine-tuned XLSR Wav2Vec2
  on custom dataset and mBART
Transcription and translation of videos using fine-tuned XLSR Wav2Vec2 on custom dataset and mBART
Aniket Tathe
Anand Kamble
Suyash Kumbharkar
Atharva Bhandare
Anirban C. Mitra
169
3
0
01 Mar 2024
End to end Hindi to English speech conversion using Bark, mBART and a
  finetuned XLSR Wav2Vec2
End to end Hindi to English speech conversion using Bark, mBART and a finetuned XLSR Wav2Vec2
Aniket Tathe
Anand Kamble
Suyash Kumbharkar
Atharva Bhandare
Anirban C. Mitra
169
5
0
11 Jan 2024
Sparsely Shared LoRA on Whisper for Child Speech Recognition
Sparsely Shared LoRA on Whisper for Child Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
W. Liu
Ying Qin
Zhiyuan Peng
Tan Lee
334
32
0
21 Sep 2023
Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech
  Recognition for Children VS. Adults
Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. AdultsAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2023
Ahmed Adel Attia
Jing Liu
Wei Ai
Dorottya Demszky
Carol Espy-Wilson
290
40
0
12 Sep 2023
A Novel Self-training Approach for Low-resource Speech Recognition
A Novel Self-training Approach for Low-resource Speech RecognitionInterspeech (Interspeech), 2023
Satwinder Singh
Feng Hou
Ruili Wang
246
13
0
10 Aug 2023
Toward Leveraging Pre-Trained Self-Supervised Frontends for Automatic
  Singing Voice Understanding Tasks: Three Case Studies
Toward Leveraging Pre-Trained Self-Supervised Frontends for Automatic Singing Voice Understanding Tasks: Three Case StudiesAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2023
Yuya Yamamoto
233
3
0
22 Jun 2023
Unsupervised speech intelligibility assessment with utterance level
  alignment distance between teacher and learner Wav2Vec-2.0 representations
Unsupervised speech intelligibility assessment with utterance level alignment distance between teacher and learner Wav2Vec-2.0 representations
Nayan Anand
Meenakshi Sirigiraju
Chiranjeevi Yarra
155
1
0
15 Jun 2023
DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
DistilXLSR: A Light Weight Cross-Lingual Speech Representation ModelInterspeech (Interspeech), 2023
Haoyu Wang
Siyuan Wang
Weiqiang Zhang
Jinfeng Bai
186
2
0
02 Jun 2023
AfriNames: Most ASR models "butcher" African Names
AfriNames: Most ASR models "butcher" African Names
Tobi Olatunji
Tejumade Afonja
Bonaventure F. P. Dossou
A. Tonja
Chris C. Emezue
Amina Mardiyyah Rufai
Sahib Singh
201
8
0
01 Jun 2023
Political corpus creation through automatic speech recognition on EU
  debates
Political corpus creation through automatic speech recognition on EU debates
Hugo De Vos
Suzan Verberne
159
2
0
17 Apr 2023
WESPER: Zero-shot and Realtime Whisper to Normal Voice Conversion for
  Whisper-based Speech Interactions
WESPER: Zero-shot and Realtime Whisper to Normal Voice Conversion for Whisper-based Speech InteractionsInternational Conference on Human Factors in Computing Systems (CHI), 2023
Jun Rekimoto
291
37
0
03 Mar 2023
Phoneme Segmentation Using Self-Supervised Speech Models
Phoneme Segmentation Using Self-Supervised Speech ModelsSpoken Language Technology Workshop (SLT), 2022
Luke Strgar
David Harwath
SSL
235
13
0
02 Nov 2022
Combining Contrastive and Non-Contrastive Losses for Fine-Tuning
  Pretrained Models in Speech Analysis
Combining Contrastive and Non-Contrastive Losses for Fine-Tuning Pretrained Models in Speech AnalysisSpoken Language Technology Workshop (SLT), 2022
Florian Lux
Ching-Yi Chen
Ngoc Thang Vu
130
1
0
21 Oct 2022
Multilingual Zero Resource Speech Recognition Base on Self-Supervise
  Pre-Trained Acoustic Models
Multilingual Zero Resource Speech Recognition Base on Self-Supervise Pre-Trained Acoustic ModelsInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2022
Haoyu Wang
Weiqiang Zhang
Hongbin Suo
Yulong Wan
189
1
0
13 Oct 2022
Applying wav2vec2 for Speech Recognition on Bengali Common Voices
  Dataset
Applying wav2vec2 for Speech Recognition on Bengali Common Voices Dataset
Haz Sameen Shahgir
Khondker Salman Sayeed
Tanjeem Azwad Zaman
190
10
0
11 Sep 2022
DualVoice: Speech Interaction that Discriminates between Normal and
  Whispered Voice Input
DualVoice: Speech Interaction that Discriminates between Normal and Whispered Voice InputACM Symposium on User Interface Software and Technology (UIST), 2022
Jun Rekimoto
152
8
0
22 Aug 2022
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru for
  Speech Recognition
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru for Speech RecognitionInternational Conference on Language Resources and Evaluation (LREC), 2022
Rodolfo Zevallos
Luis Camacho
Nelsi Melgarejo
142
5
0
12 Jul 2022
Speech Emotion: Investigating Model Representations, Multi-Task Learning
  and Knowledge Distillation
Speech Emotion: Investigating Model Representations, Multi-Task Learning and Knowledge DistillationInterspeech (Interspeech), 2022
Vikramjit Mitra
H. Chien
Vasudha Kowtha
Joseph Y. Cheng
Erdrin Azemi
171
8
0
02 Jul 2022
FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech
  Self-Supervised Learning
FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning
Yeonghyeon Lee
Kangwook Jang
Jahyun Goo
Youngmoon Jung
Hoi-Rim Kim
315
40
0
01 Jul 2022
DRAFT: A Novel Framework to Reduce Domain Shifting in Self-supervised
  Learning and Its Application to Children's ASR
DRAFT: A Novel Framework to Reduce Domain Shifting in Self-supervised Learning and Its Application to Children's ASRInterspeech (Interspeech), 2022
Ruchao Fan
Abeer Alwan
271
39
0
16 Jun 2022
Combining Spectral and Self-Supervised Features for Low Resource Speech
  Recognition and Translation
Combining Spectral and Self-Supervised Features for Low Resource Speech Recognition and TranslationInterspeech (Interspeech), 2022
Dan Berrebbi
Jiatong Shi
Brian Yan
Osbel López-Francisco
Jonathan D. Amith
Shinji Watanabe
258
32
0
05 Apr 2022
SelfRemaster: Self-Supervised Speech Restoration with
  Analysis-by-Synthesis Approach Using Channel Modeling
SelfRemaster: Self-Supervised Speech Restoration with Analysis-by-Synthesis Approach Using Channel ModelingInterspeech (Interspeech), 2022
Takaaki Saeki
Shinnosuke Takamichi
Tomohiko Nakamura
Naoko Tanji
Hiroshi Saruwatari
220
10
0
24 Mar 2022
Improving non-autoregressive end-to-end speech recognition with
  pre-trained acoustic and language models
Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language modelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Keqi Deng
Zehui Yang
Shinji Watanabe
Yosuke Higuchi
Gaofeng Cheng
Pengyuan Zhang
241
30
0
25 Jan 2022
Improving Hybrid CTC/Attention End-to-end Speech Recognition with
  Pretrained Acoustic and Language Model
Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model
Keqi Deng
Songjun Cao
Yike Zhang
Long Ma
VLM
167
32
0
14 Dec 2021
Decoupling recognition and transcription in Mandarin ASR
Decoupling recognition and transcription in Mandarin ASR
Jiahong Yuan
Xingyu Cai
Dongji Gao
Renjie Zheng
Liang Huang
Kenneth Church
235
13
0
02 Aug 2021
Automatic recognition of suprasegmentals in speech
Automatic recognition of suprasegmentals in speech
Jiahong Yuan
Neville Ryant
Xingyu Cai
Kenneth Church
M. Liberman
225
14
0
02 Aug 2021
End-to-end Speech Translation via Cross-modal Progressive Training
End-to-end Speech Translation via Cross-modal Progressive TrainingInterspeech (Interspeech), 2021
Rong Ye
Mingxuan Wang
Lei Li
270
79
0
21 Apr 2021
Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and
  Backward Transformers
Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers
Yusuke Kida
Tatsuya Komatsu
M. Togami
172
1
0
21 Apr 2021
Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for
  Low-resource Speech Recognition
Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-resource Speech RecognitionIEEE Signal Processing Letters (IEEE SPL), 2021
Cheng Yi
Shiyu Zhou
Bo Xu
242
44
0
17 Jan 2021
1
Page 1 of 1