Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.04292
Cited By
Efficiently Identifying Low-Quality Language Subsets in Multilingual Datasets: A Case Study on a Large-Scale Multilingual Audio Dataset
5 October 2024
Farhan Samir
Emily P. Ahn
Shreya Prakash
Márton Soskuthy
Vered Shwartz
Jian Zhu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Efficiently Identifying Low-Quality Language Subsets in Multilingual Datasets: A Case Study on a Large-Scale Multilingual Audio Dataset"
Title
No papers