MT-Adapted Datasheets for Datasets: Template and Repository
Marta R. Costa-jussá
Roger Creus
Oriol Domingo
A. Domínguez
Miquel Escobar
Cayetana López
Marina Garcia
Margarita Geleta

Abstract
In this report we are taking the standardized model proposed by Gebru et al. (2018) for documenting the popular machine translation datasets of the EuroParl (Koehn, 2005) and News-Commentary (Barrault et al., 2019). Within this documentation process, we have adapted the original datasheet to the particular case of data consumers within the Machine Translation area. We are also proposing a repository for collecting the adapted datasheets in this research area
View on arXivComments on this paper