Bilingual Lexicon Induction for Low-Resource Languages using Graph
Matching via Optimal Transport
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
- OT
Abstract
Bilingual lexicons form a critical component of various natural language processing applications, including unsupervised and semisupervised machine translation and crosslingual information retrieval. We improve bilingual lexicon induction performance across 40 language pairs with a graph-matching method based on optimal transport. The method is especially strong with low amounts of supervision.
View on arXivComments on this paper
