Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1808.09943
Cited By
Revisiting Character-Based Neural Machine Translation with Capacity and Compression
29 August 2018
Colin Cherry
George F. Foster
Ankur Bapna
Orhan Firat
Wolfgang Macherey
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Revisiting Character-Based Neural Machine Translation with Capacity and Compression"
21 / 21 papers shown
Title
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
Julie Kallini
Shikhar Murty
Christopher D. Manning
Christopher Potts
Róbert Csordás
37
2
0
28 Oct 2024
Smirk: An Atomically Complete Tokenizer for Molecular Foundation Models
Alexius Wadell
Anoushka Bhutani
Venkatasubramanian Viswanathan
121
0
0
19 Sep 2024
Does Character-level Information Always Improve DRS-based Semantic Parsing?
Tomoya Kurosawa
Hitomi Yanaka
21
0
0
04 Jun 2023
TranSFormer: Slow-Fast Transformer for Machine Translation
Bei Li
Yi Jing
Xu Tan
Zhen Xing
Tong Xiao
Jingbo Zhu
41
7
0
26 May 2023
Investigating Lexical Sharing in Multilingual Machine Translation for Indian Languages
Sonal Sannigrahi
Rachel Bawden
29
0
0
04 May 2023
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
Sabrina J. Mielke
Zaid Alyafeai
Elizabeth Salesky
Colin Raffel
Manan Dey
...
Arun Raja
Chenglei Si
Wilson Y. Lee
Benoît Sagot
Samson Tan
30
140
0
20 Dec 2021
Switch Point biased Self-Training: Re-purposing Pretrained Models for Code-Switching
P. Chopra
Sai Krishna Rallabandi
A. Black
Khyathi Raghavi Chandu
22
6
0
01 Nov 2021
Why don't people use character-level machine translation?
Jindrich Libovický
Helmut Schmid
Alexander M. Fraser
65
28
0
15 Oct 2021
Improving Arabic Diacritization by Learning to Diacritize and Translate
Brian Thompson
A. Alshehri
32
10
0
29 Sep 2021
Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey
Danielle Saunders
AI4CE
17
85
0
14 Apr 2021
Neural Machine Translation: A Review of Methods, Resources, and Tools
Zhixing Tan
Shuo Wang
Zonghan Yang
Gang Chen
Xuancheng Huang
Maosong Sun
Yang Liu
3DV
AI4TS
15
105
0
31 Dec 2020
Char2Subword: Extending the Subword Embedding Space Using Robust Character Compositionality
Gustavo Aguilar
Bryan McCann
Tong Niu
Nazneen Rajani
N. Keskar
Thamar Solorio
47
12
0
24 Oct 2020
On Target Segmentation for Direct Speech Translation
Mattia Antonino Di Gangi
Marco Gaido
Matteo Negri
Marco Turchi
34
14
0
10 Sep 2020
Dynamic Programming Encoding for Subword Segmentation in Neural Machine Translation
Xuanli He
Gholamreza Haffari
Mohammad Norouzi
19
45
0
03 May 2020
Adapting Multilingual Neural Machine Translation to Unseen Languages
Surafel Melaku Lakew
Alina Karakanta
Marcello Federico
Matteo Negri
Marco Turchi
31
20
0
30 Oct 2019
A Latent Morphology Model for Open-Vocabulary Neural Machine Translation
Duygu Ataman
Wilker Aziz
Alexandra Birch
13
16
0
30 Oct 2019
BPE-Dropout: Simple and Effective Subword Regularization
Ivan Provilkov
Dmitrii Emelianenko
Elena Voita
24
276
0
29 Oct 2019
Localization of Fake News Detection via Multitask Transfer Learning
Jan Christian Blaise Cruz
Julianne Agatha Tan
C. Cheng
23
33
0
21 Oct 2019
Lattice-Based Transformer Encoder for Neural Machine Translation
Fengshun Xiao
Jiangtong Li
Zhao Hai
Rui Wang
Kehai Chen
21
42
0
04 Jun 2019
Character-Aware Decoder for Translation into Morphologically Rich Languages
Adithya Renduchintala
Pamela Shapiro
Kevin Duh
Philipp Koehn
AI4CE
8
4
0
06 Sep 2018
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,743
0
26 Sep 2016
1