ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2508.04796
  4. Cited By
Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization
v1v2 (latest)

Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization

6 August 2025
Negar Foroutan
Clara Meister
Debjit Paul
Joel Niklaus
Sina Ahmadi
Antoine Bosselut
Rico Sennrich
ArXiv (abs)PDFHTML

Papers citing "Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization"

2 / 2 papers shown
Title
Segmentation Beyond Defaults: Asymmetrical Byte Pair Encoding for Optimal Machine Translation Performance
Segmentation Beyond Defaults: Asymmetrical Byte Pair Encoding for Optimal Machine Translation Performance
Saumitra Yadav
Manish Shrivastava
28
0
0
05 Nov 2025
Back to Bytes: Revisiting Tokenization Through UTF-8
Back to Bytes: Revisiting Tokenization Through UTF-8
Amit Moryossef
Clara Meister
Pavel Stepachev
Desmond Elliott
36
0
0
19 Oct 2025
1