ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.00923
24
1

Preserving Multilingual Quality While Tuning Query Encoder on English Only

1 July 2024
Oleg V. Vasilyev
Randy Sawaya
John Bohannon
ArXivPDFHTML
Abstract

A query encoder of a dual passage retrieval system can be tuned for specific types of queries or domains, while the precomputed and stored documents representations are kept intact. Switching from one query encoder to another when needed is easily feasible, unlike overhauling the embeddings of a whole knowledge base. In this work we raise a question: Can the generic, original qualities of the encoder be preserved or at least left not too degraded when it is tuned on a narrow domain? We conducted experiments on a high quality multilingual embedding model: Tuning it on a single English-only dataset, we observe that the tuning not only preserves the multilingual qualities, but even improves them. The embedding qualities on distinctly different data are also improved or at least preserved. Drawing on our observations, we suggest a more general hypothesis: Tuning with intentionally low learning rate can preserve or improve a system's properties acquired in training, but not specifically targeted by tuning. We call this adiabatic tuning and provide tentative explanations.

View on arXiv
@article{vasilyev2025_2407.00923,
  title={ Preserving Multilingual Quality While Tuning Query Encoder on English Only },
  author={ Oleg Vasilyev and Randy Sawaya and John Bohannon },
  journal={arXiv preprint arXiv:2407.00923},
  year={ 2025 }
}
Comments on this paper