ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2001.01589
  4. Cited By
Morphological Word Segmentation on Agglutinative Languages for Neural
  Machine Translation

Morphological Word Segmentation on Agglutinative Languages for Neural Machine Translation

2 January 2020
Yirong Pan
Xiao Li
Yating Yang
Rui Dong
ArXivPDFHTML

Papers citing "Morphological Word Segmentation on Agglutinative Languages for Neural Machine Translation"

10 / 10 papers shown
Title
From Smør-re-brød to Subwords: Training LLMs on Danish, One Morpheme at a Time
From Smør-re-brød to Subwords: Training LLMs on Danish, One Morpheme at a Time
Mikkel Wildner Kildeberg
Emil Allerslev Schledermann
Nicolaj Larsen
Rob van der Goot
35
0
0
02 Apr 2025
Pointer-Generator Networks for Low-Resource Machine Translation: Don't
  Copy That!
Pointer-Generator Networks for Low-Resource Machine Translation: Don't Copy That!
Niyati Bafna
Philipp Koehn
David Yarowsky
40
1
0
16 Mar 2024
How Important Is Tokenization in French Medical Masked Language Models?
How Important Is Tokenization in French Medical Masked Language Models?
Yanis Labrak
Adrien Bazoge
B. Daille
Mickael Rouvier
Richard Dufour
41
1
0
22 Feb 2024
MorphPiece : A Linguistic Tokenizer for Large Language Models
MorphPiece : A Linguistic Tokenizer for Large Language Models
Jeffrey Hsu
32
3
0
14 Jul 2023
CompoundPiece: Evaluating and Improving Decompounding Performance of
  Language Models
CompoundPiece: Evaluating and Improving Decompounding Performance of Language Models
Benjamin Minixhofer
Jonas Pfeiffer
Ivan Vulić
32
6
0
23 May 2023
Exploring Segmentation Approaches for Neural Machine Translation of
  Code-Switched Egyptian Arabic-English Text
Exploring Segmentation Approaches for Neural Machine Translation of Code-Switched Egyptian Arabic-English Text
Marwa Gaser
Manuel Mager
Injy Hamed
Nizar Habash
Slim Abdennadher
Ngoc Thang Vu
39
6
0
11 Oct 2022
How Effective is Byte Pair Encoding for Out-Of-Vocabulary Words in
  Neural Machine Translation?
How Effective is Byte Pair Encoding for Out-Of-Vocabulary Words in Neural Machine Translation?
Ali Araabi
Christof Monz
Vlad Niculae
28
10
0
10 Aug 2022
Linguistically inspired roadmap for building biologically reliable
  protein language models
Linguistically inspired roadmap for building biologically reliable protein language models
Mai Ha Vu
Rahmad Akbar
Philippe A. Robert
B. Swiatczak
Victor Greiff
G. K. Sandve
Dag Trygve Tryslew Haug
46
35
0
03 Jul 2022
Between words and characters: A Brief History of Open-Vocabulary
  Modeling and Tokenization in NLP
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
Sabrina J. Mielke
Zaid Alyafeai
Elizabeth Salesky
Colin Raffel
Manan Dey
...
Arun Raja
Chenglei Si
Wilson Y. Lee
Benoît Sagot
Samson Tan
32
142
0
20 Dec 2021
Effective Approaches to Attention-based Neural Machine Translation
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
218
7,929
0
17 Aug 2015
1