Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2003.04986
Cited By
Investigating an approach for low resource language dataset creation, curation and classification: Setswana and Sepedi
18 February 2020
Vukosi Marivate
T. Sefara
Vongani Chabalala
Keamogetswe Makhaya
T. Mokgonyane
Rethabile Mokoena
Abiodun Modupe
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Investigating an approach for low resource language dataset creation, curation and classification: Setswana and Sepedi"
7 / 7 papers shown
Title
Socially Responsible Data for Large Multilingual Language Models
Andrew Smart
Ben Hutchinson
L. M. Amugongo
Suzanne Dikker
Alex Zito
...
Seyi Olojo
Stanley Uwakwe
Edem Wornyo
Sonja Schmer-Galunder
Jamila Smith-Loud
39
3
0
08 Sep 2024
Izindaba-Tindzaba: Machine learning news categorisation for Long and Short Text for isiZulu and Siswati
Andani Madodonga
Vukosi Marivate
Matthew Adendorff
16
1
0
12 Jun 2023
LR-Sum: Summarization for Less-Resourced Languages
Chester Palen-Michel
Constantine Lignos
9
4
0
19 Dec 2022
Building Machine Translation Systems for the Next Thousand Languages
Ankur Bapna
Isaac Caswell
Julia Kreutzer
Orhan Firat
D. Esch
...
Apurva Shah
Yanping Huang
Z. Chen
Yonghui Wu
Macduff Hughes
54
98
0
09 May 2022
Listening to Affected Communities to Define Extreme Speech: Dataset and Experiments
Antonis Maronikolakis
Axel Wisiorek
Leah Nann
Haris Jabbar
Sahana Udupa
Hinrich Schütze
22
24
0
22 Mar 2022
KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi
Andre Niyongabo Rubungo
Hong Qu
Julia Kreutzer
Li Huang
21
38
0
23 Oct 2020
Low-resource Languages: A Review of Past Work and Future Challenges
Alexandre Magueresse
Vincent Carles
Evan Heetderks
20
164
0
12 Jun 2020
1