Inducing lexicons of in-group language with socio-temporal context

In-group language is an important signifier of group dynamics. This paper proposes a novel method for inducing lexicons of in-group language, which incorporates its socio-temporal context. Existing methods for lexicon induction do not capture the evolving nature of in-group language, nor the social structure of the community. Using dynamic word and user embeddings trained on conversations from online anti-women communities, our approach outperforms prior methods for lexicon induction. We develop a test set for the task of lexicon induction and a new lexicon of manosphere language, validated by human experts, which quantifies the relevance of each term to a specific sub-community at a given point in time. Finally, we present novel insights on in-group language which illustrate the utility of this approach.
View on arXiv@article{kock2025_2409.19257, title={ Inducing lexicons of in-group language with socio-temporal context }, author={ Christine de Kock }, journal={arXiv preprint arXiv:2409.19257}, year={ 2025 } }