Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.11189
Cited By
The Effectiveness of Morphology-aware Segmentation in Low-Resource Neural Machine Translation
20 March 2021
Jonne Saleva
Constantine Lignos
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Effectiveness of Morphology-aware Segmentation in Low-Resource Neural Machine Translation"
12 / 12 papers shown
Title
Morphological Typology in BPE Subword Productivity and Language Modeling
Iñigo Parra
36
0
0
31 Oct 2024
Unsupervised Morphological Tree Tokenizer
Qingyang Zhu
Xiang Hu
Pengyu Ji
Wei Wu
Kewei Tu
36
0
0
21 Jun 2024
Revisiting subword tokenization: A case study on affixal negation in large language models
Thinh Hung Truong
Yulia Otmakhova
Karin Verspoor
Trevor Cohn
Timothy Baldwin
47
2
0
03 Apr 2024
MorphPiece : A Linguistic Tokenizer for Large Language Models
Jeffrey Hsu
32
3
0
14 Jul 2023
Neural Machine Translation for the Indigenous Languages of the Americas: An Introduction
Manuel Mager
Rajat Bhatnagar
Graham Neubig
Ngoc Thang Vu
Katharina Kann
30
10
0
11 Jun 2023
CompoundPiece: Evaluating and Improving Decompounding Performance of Language Models
Benjamin Minixhofer
Jonas Pfeiffer
Ivan Vulić
32
6
0
23 May 2023
Exploring Segmentation Approaches for Neural Machine Translation of Code-Switched Egyptian Arabic-English Text
Marwa Gaser
Manuel Mager
Injy Hamed
Nizar Habash
Slim Abdennadher
Ngoc Thang Vu
31
6
0
11 Oct 2022
Benchmarking Azerbaijani Neural Machine Translation
Chih-Chen Chen
William Chen
21
0
0
29 Jul 2022
The SIGMORPHON 2022 Shared Task on Morpheme Segmentation
Khuyagbaatar Batsuren
Gábor Bella
Aryaman Arora
Viktor Martinović
Kyle Gorman
...
Magda vSevvcíková
Katevrina Pelegrinová
Fausto Giunchiglia
Ryan Cotterell
Ekaterina Vylomova
31
39
0
15 Jun 2022
BPE vs. Morphological Segmentation: A Case Study on Machine Translation of Four Polysynthetic Languages
Manuel Mager
Arturo Oncevay
Elisabeth Mager
Katharina Kann
Ngoc Thang Vu
43
19
0
16 Mar 2022
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
Sabrina J. Mielke
Zaid Alyafeai
Elizabeth Salesky
Colin Raffel
Manan Dey
...
Arun Raja
Chenglei Si
Wilson Y. Lee
Benoît Sagot
Samson Tan
32
142
0
20 Dec 2021
Survey of Low-Resource Machine Translation
Barry Haddow
Rachel Bawden
Antonio Valerio Miceli Barone
Jindvrich Helcl
Alexandra Birch
AIMat
31
148
0
01 Sep 2021
1