SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing

19 August 2018

Taku Kudo

John Richardson

ArXiv (abs)PDF HTML Github (10925★)

Papers citing "SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing"

50 / 2,064 papers shown

AutoNMT: A Framework to Streamline the Research of Seq2Seq Models

Salvador Carrión

F. Casacuberta

09 Feb 2023

Measuring The Impact Of Programming Language DistributionInternational Conference on Machine Learning (ICML), 2023

449

03 Feb 2023

The unreasonable effectiveness of few-shot learning for machine translationInternational Conference on Machine Learning (ICML), 2023

Colin Cherry

320

125

02 Feb 2023

KNNs of Semantic Encodings for Rating PredictionInternational Conference on Communications in Computing (ICCC), 2023

01 Feb 2023

Adaptive Machine Translation with Large Language ModelsEuropean Association for Machine Translation Conferences/Workshops (EAMT), 2023

289

109

30 Jan 2023

Adaptive Computation with Elastic Input SequenceInternational Conference on Machine Learning (ICML), 2023

Fuzhao Xue

Valerii Likhosherstov

Anurag Arnab

N. Houlsby

Mostafa Dehghani

Yang You

244

30 Jan 2023

Pre-training for Speech Translation: CTC Meets Optimal TransportInternational Conference on Machine Learning (ICML), 2023

379

27 Jan 2023

Open Problems in Applied Deep Learning

M. Raissi

AI4CE

233

26 Jan 2023

XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Luke Zettlemoyer

Madian Khabsa

268

101

25 Jan 2023

Ensemble Transfer Learning for Multilingual Coreference Resolution

T. Lai

Heng Ji

160

22 Jan 2023

REDAffectiveLM: Leveraging Affect Enriched Embedding and Transformer-based Neural Language Model for Readers' Emotion DetectionKnowledge and Information Systems (KAIS), 2023

277

21 Jan 2023

Language Agnostic Data-Driven Inverse Text NormalizationInterspeech (Interspeech), 2023

101

20 Jan 2023

BayesSpeech: A Bayesian Transformer Network for Automatic Speech Recognition

Will Rieger

BDL UQCV

125

16 Jan 2023

Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition

204

06 Jan 2023

HIT-SCIR at MMNLU-22: Consistency Regularization for Multilingual Spoken Language Understanding

153

05 Jan 2023

Audio-Visual Efficient Conformer for Robust Speech RecognitionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

Maxime Burchi

Radu Timofte

VLM

214

04 Jan 2023

Cramming: Training a Language Model on a Single GPU in One DayInternational Conference on Machine Learning (ICML), 2022

Jonas Geiping

Tom Goldstein

MoE

270

103

28 Dec 2022

Optimizing Deep Transformers for Chinese-Thai Low-Resource Translation

272

24 Dec 2022

Pushing the performances of ASR models on English and Spanish accents

209

22 Dec 2022

Uncontrolled Lexical Exposure Leads to Overestimation of Compositional Generalization in Pretrained Models

Najoung Kim

Tal Linzen

P. Smolensky

220

21 Dec 2022

ORCA: A Challenging Benchmark for Arabic Language UnderstandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

AbdelRahim Elmadany

El Moatez Billah Nagoudi

Muhammad Abdul-Mageed

ELM

298

21 Dec 2022

Beyond Contrastive Learning: A Variational Generative Model for Multilingual RetrievalAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

John Wieting

J. Clark

William W. Cohen

Graham Neubig

Taylor Berg-Kirkpatrick

284

21 Dec 2022

Mini-Model Adaptation: Efficiently Extending Pretrained Models to New Languages via Aligned Shallow TrainingAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Kelly Marchisio

Patrick Lewis

Yihong Chen

Mikel Artetxe

265

20 Dec 2022

ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Jonas Belouadi

Steffen Eger

322

20 Dec 2022

Little Red Riding Hood Goes Around the Globe:Crosslingual Story Planning and Generation with Large Language ModelsInternational Conference on Language Resources and Evaluation (LREC), 2022

220

20 Dec 2022

SeqDiffuSeq: Text Diffusion with Encoder-Decoder Transformers

Hongyi Yuan

Zheng Yuan

Chuanqi Tan

Fei Huang

Songfang Huang

DiffM

241

20 Dec 2022

GanLM: Encoder-Decoder Pre-training with an Auxiliary DiscriminatorAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Jian Yang

Yuwei Yin

Liqun Yang

Zhoujun Li

188

20 Dec 2022

A Survey on Pretrained Language Models for Neural Code Intelligence

Yichen Xu

Yanqiao Zhu

159

20 Dec 2022

Joint Speech Transcription and Translation: Pseudo-Labeling with Out-of-Distribution DataAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

231

20 Dec 2022

Tokenization Consistency Matters for Generative Models on Extractive NLP TasksConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

163

19 Dec 2022

Synthetic Pre-Training Tasks for Neural Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

248

19 Dec 2022

(Psycho-)Linguistic Features Meet Transformer Models for Improved Explainable and Controllable Text Simplification

195

19 Dec 2022

SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based AugmentationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Ioannis Tsiamas

José A. R. Fonollosa

Marta R. Costa-jussá

288

19 Dec 2022

A Natural Bias for Language Generation ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Wojciech Stokowiec

183

19 Dec 2022

Large Language Models Meet NL2Code: A SurveyAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Daoguang Zan

240

239

19 Dec 2022

WACO: Word-Aligned Contrastive Learning for Speech TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Siqi Ouyang

Rong Ye

Lei Li

336

19 Dec 2022

AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Xingshan Zeng

Liangyou Li

Qun Liu

157

17 Dec 2022

Controlling Styles in Neural Machine Translation with Activation PromptAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

240

17 Dec 2022

Planting and Mitigating Memorized Content in Predictive-Text Language Models

16 Dec 2022

UnitY: Two-pass Direct Speech-to-speech Translation with Discrete UnitsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

320

15 Dec 2022

CLIPPO: Image-and-Language Understanding from Pixels OnlyComputer Vision and Pattern Recognition (CVPR), 2022

343

15 Dec 2022

Advancing Multilingual Pre-training: TRIP Triangular Document-level Pre-training for Multilingual Language Models

205

15 Dec 2022

Fixing MoE Over-Fitting on Low-Resource Languages in Multilingual Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

167

15 Dec 2022

Causes and Cures for Interference in Multilingual TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

308

14 Dec 2022

ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

238

13 Dec 2022

Jointly Learning Visual and Auditory Speech Representations from Raw DataInternational Conference on Learning Representations (ICLR), 2022

309

12 Dec 2022

M3ST: Mix at Three Levels for Speech TranslationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

293

07 Dec 2022

Rethinking the Objectives of Vector-Quantized Tokenizers for Image SynthesisComputer Vision and Pattern Recognition (CVPR), 2022

Ying Shan

219

06 Dec 2022

Document-Level Abstractive Summarization

Gonçalo Raposo

Afonso Raposo

Ana Sofia Carmo

126

06 Dec 2022

LMEC: Learnable Multiplicative Absolute Position Embedding Based Conformer for Speech Recognition

Yuguang Yang

Yu Pan

Jingjing Yin

Heng Lu

252

05 Dec 2022