Capturing Symmetry and Antisymmetry in Language Models through Symmetry-Aware Training Objectives

22 April 2025

Abstract

Capturing symmetric (e.g., country borders another country) and antisymmetric (e.g., parent_of) relations is crucial for a variety of applications. This paper tackles this challenge by introducing a novel Wikidata-derived natural language inference dataset designed to evaluate large language models (LLMs). Our findings reveal that LLMs perform comparably to random chance on this benchmark, highlighting a gap in relational understanding. To address this, we explore encoder retraining via contrastive learning with k-nearest neighbors. The retrained encoder matches the performance of fine-tuned classification heads while offering additional benefits, including greater efficiency in few-shot learning and improved mitigation of catastrophic forgetting.

View on arXiv

@article{yuan2025_2504.16312,
  title={ Capturing Symmetry and Antisymmetry in Language Models through Symmetry-Aware Training Objectives },
  author={ Zhangdie Yuan and Andreas Vlachos },
  journal={arXiv preprint arXiv:2504.16312},
  year={ 2025 }
}

Comments on this paper