2
0

Study of scaling laws in language families

Abstract

This article investigates scaling laws within language families using data from over six thousand languages and analyzing emergent patterns observed in Zipf-like classification graphs. Both macroscopic (based on number of languages by family) and microscopic (based on numbers of speakers by language on a family) aspects of these classifications are examined. Particularly noteworthy is the discovery of a distinct division among the fourteen largest contemporary language families, excluding Afro-Asiatic and Nilo-Saharan languages. These families are found to be distributed across three language family quadruplets, each characterized by significantly different exponents in the Zipf graphs. This finding sheds light on the underlying structure and organization of major language families, revealing intriguing insights into the nature of linguistic diversity and distribution.

View on arXiv
@article{santos2025_2504.01681,
  title={ Study of scaling laws in language families },
  author={ Maelyson R. F. Santos and Marcelo A. F. Gomes },
  journal={arXiv preprint arXiv:2504.01681},
  year={ 2025 }
}
Comments on this paper