Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2001.01589
Cited By
Morphological Word Segmentation on Agglutinative Languages for Neural Machine Translation
2 January 2020
Yirong Pan
Xiao Li
Yating Yang
Rui Dong
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Morphological Word Segmentation on Agglutinative Languages for Neural Machine Translation"
10 / 10 papers shown
Title
From Smør-re-brød to Subwords: Training LLMs on Danish, One Morpheme at a Time
Mikkel Wildner Kildeberg
Emil Allerslev Schledermann
Nicolaj Larsen
Rob van der Goot
35
0
0
02 Apr 2025
Pointer-Generator Networks for Low-Resource Machine Translation: Don't Copy That!
Niyati Bafna
Philipp Koehn
David Yarowsky
40
1
0
16 Mar 2024
How Important Is Tokenization in French Medical Masked Language Models?
Yanis Labrak
Adrien Bazoge
B. Daille
Mickael Rouvier
Richard Dufour
41
1
0
22 Feb 2024
MorphPiece : A Linguistic Tokenizer for Large Language Models
Jeffrey Hsu
32
3
0
14 Jul 2023
CompoundPiece: Evaluating and Improving Decompounding Performance of Language Models
Benjamin Minixhofer
Jonas Pfeiffer
Ivan Vulić
32
6
0
23 May 2023
Exploring Segmentation Approaches for Neural Machine Translation of Code-Switched Egyptian Arabic-English Text
Marwa Gaser
Manuel Mager
Injy Hamed
Nizar Habash
Slim Abdennadher
Ngoc Thang Vu
39
6
0
11 Oct 2022
How Effective is Byte Pair Encoding for Out-Of-Vocabulary Words in Neural Machine Translation?
Ali Araabi
Christof Monz
Vlad Niculae
28
10
0
10 Aug 2022
Linguistically inspired roadmap for building biologically reliable protein language models
Mai Ha Vu
Rahmad Akbar
Philippe A. Robert
B. Swiatczak
Victor Greiff
G. K. Sandve
Dag Trygve Tryslew Haug
46
35
0
03 Jul 2022
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
Sabrina J. Mielke
Zaid Alyafeai
Elizabeth Salesky
Colin Raffel
Manan Dey
...
Arun Raja
Chenglei Si
Wilson Y. Lee
Benoît Sagot
Samson Tan
32
142
0
20 Dec 2021
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
218
7,929
0
17 Aug 2015
1