Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2508.04796
Cited By
v1
v2 (latest)
Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization
6 August 2025
Negar Foroutan
Clara Meister
Debjit Paul
Joel Niklaus
Sina Ahmadi
Antoine Bosselut
Rico Sennrich
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization"
2 / 2 papers shown
Title
Segmentation Beyond Defaults: Asymmetrical Byte Pair Encoding for Optimal Machine Translation Performance
Saumitra Yadav
Manish Shrivastava
28
0
0
05 Nov 2025
Back to Bytes: Revisiting Tokenization Through UTF-8
Amit Moryossef
Clara Meister
Pavel Stepachev
Desmond Elliott
36
0
0
19 Oct 2025
1