Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2508.06533
Cited By
The Art of Breaking Words: Rethinking Multilingual Tokenizer Design
3 August 2025
Aamod Thakur
Ajay Nagpal
Atharva Savarkar
Kundeshwar Pundalik
Siddhesh Dosi
Piyush Sawarkar
Viraj Thakur
Rohit Saluja
Maunendra Sankar Desarkar
Ganesh Ramakrishnan
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Art of Breaking Words: Rethinking Multilingual Tokenizer Design"
1 / 1 papers shown
Title
Alternatives To Next Token Prediction In Text Generation - A Survey
Charlie Wyatt
Aditya Joshi
Flora D. Salim
40
0
0
29 Sep 2025
1