Bilingual Lexicon Induction for Low-Resource Languages using Graph Matching via Optimal Transport

Abstract
Bilingual lexicons form a critical component of various natural language processing applications, including unsupervised and semisupervised machine translation and crosslingual information retrieval. We improve bilingual lexicon induction performance across 40 language pairs with a graph-matching method based on optimal transport. The method is especially strong with low amounts of supervision.
View on arXivComments on this paper