ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.18283
  4. Cited By
CommonAccent: Exploring Large Acoustic Pretrained Models for Accent
  Classification Based on Common Voice

CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice

Interspeech (Interspeech), 2023
29 May 2023
Juan Pablo Zuluaga
Sara Ahmed
Danielius Visockas
Cem Subakan
    VLM
ArXiv (abs)PDFHTMLGithub (9888★)

Papers citing "CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice"

9 / 9 papers shown
Title
FAC-FACodec: Controllable Zero-Shot Foreign Accent Conversion with Factorized Speech Codec
FAC-FACodec: Controllable Zero-Shot Foreign Accent Conversion with Factorized Speech Codec
Yurii Halychanskyi
Cameron Churchwell
Yutong Wen
Volodymyr Kindratenko
52
0
0
12 Oct 2025
MaskVCT: Masked Voice Codec Transformer for Zero-Shot Voice Conversion With Increased Controllability via Multiple Guidances
MaskVCT: Masked Voice Codec Transformer for Zero-Shot Voice Conversion With Increased Controllability via Multiple Guidances
Junhyeok Lee
Helin Wang
Yaohan Guan
Thomas Thebaud
Laureano Moro-Velazquez
Jesus Villalba
Najim Dehak
68
0
0
21 Sep 2025
Voxlect: A Speech Foundation Model Benchmark for Modeling Dialects and Regional Languages Around the Globe
Voxlect: A Speech Foundation Model Benchmark for Modeling Dialects and Regional Languages Around the Globe
Tiantian Feng
Kevin Huang
Anfeng Xu
Xuan Shi
Thanathai Lertpetchpun
Jihwan Lee
Yoonjeong Lee
D. Byrd
Shrikanth Narayanan
110
0
0
03 Aug 2025
The Prosody of Emojis
The Prosody of Emojis
Giulio Zhou
Tsz Kin Lam
Alexandra Birch
Barry Haddow
68
0
0
01 Aug 2025
The Man Behind the Sound: Demystifying Audio Private Attribute Profiling via Multimodal Large Language Model Agents
The Man Behind the Sound: Demystifying Audio Private Attribute Profiling via Multimodal Large Language Model Agents
Lixu Wang
Kaixiang Yao
Xinfeng Li
Dong Yang
Haoyang Li
Xiaofeng Wang
Wei Dong
AuLLM
172
4
0
14 Jul 2025
AccentBox: Towards High-Fidelity Zero-Shot Accent Generation
AccentBox: Towards High-Fidelity Zero-Shot Accent GenerationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Jinzuomu Zhong
Korin Richmond
Zhiba Su
Siqi Sun
257
8
0
10 Jan 2025
MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion
MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent ConversionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Sho Inoue
Shuai Wang
Wanxing Wang
Pengcheng Zhu
Mengxiao Bi
Haizhou Li
242
3
0
14 Sep 2024
Big model only for hard audios: Sample dependent Whisper model selection
  for efficient inferences
Big model only for hard audios: Sample dependent Whisper model selection for efficient inferences
Hugo Malard
Salah Zaiem
Robin Algayres
254
2
0
22 Sep 2023
Utterance-level Aggregation For Speaker Recognition In The Wild
Utterance-level Aggregation For Speaker Recognition In The WildIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Weidi Xie
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
164
360
0
26 Feb 2019
1