SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing

19 August 2018

Taku Kudo

John Richardson

ArXiv (abs)PDF HTML Github (10925★)

Papers citing "SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing"

50 / 2,064 papers shown

BloombergGPT: A Large Language Model for Finance

685

1,157

30 Mar 2023

A Study of Autoregressive Decoders for Multi-Tasking in Computer Vision

...

248

30 Mar 2023

TreePiece: Faster Semantic Parsing via Tree TokenizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Sida I. Wang

Akshat Shrivastava

S. Livshits

131

30 Mar 2023

When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLPAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

494

28 Mar 2023

Sigmoid Loss for Language Image Pre-TrainingIEEE International Conference on Computer Vision (ICCV), 2023

1.8K

2,253

27 Mar 2023

Cross-utterance ASR Rescoring with Graph-based Label PropagationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Venkatesh Ravichandran

117

27 Mar 2023

An Information Extraction Study: Take In Mind the Tokenization!

Christos Theodoropoulos

Marie-Francine Moens

128

27 Mar 2023

Fine-Tashkeel: Finetuning Byte-Level Models for Accurate Arabic Text Diacritization

Bashar Al-Rfooh

Gheith A. Abandah

Rami Al-Rfou

149

25 Mar 2023

Neuro-Symbolic Execution of Generic Source Code

Yaojie Hu

Jin Tian

NAI

222

23 Mar 2023

SwissBERT: The Multilingual Language Model for SwitzerlandSwiss Text Analytics Conference (SwissText), 2023

Jannis Vamvas

Johannes Graen

Rico Sennrich

269

23 Mar 2023

A Gold Standard Dataset for the Reviewer Assignment Problem

294

23 Mar 2023

JaCoText: A Pretrained Model for Java Code-Text Generation

Jessica Nayeli López Espejel

Mahaman Sanoussi Yahaya Alassan

Walid Dahhane

E. Ettifouri

131

22 Mar 2023

Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition

Xiaoyu Yang

Qiujia Li

Chuxu Zhang

P. Woodland

206

20 Mar 2023

Character, Word, or Both? Revisiting the Segmentation Granularity for Chinese Pre-trained Language Models

Hui Huang

Muyun Yang

Zhoujun Li

Chao Bian

VLM

124

20 Mar 2023

HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanismIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Jiangyu Han

Heng Lu

127

15 Mar 2023

Learning Cross-lingual Visual Speech RepresentationsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

163

14 Mar 2023

Adapting Offline Speech Translation Models for Streaming with Future-Aware Distillation and InferenceConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

169

14 Mar 2023

Scaling Vision-Language Models with Sparse Mixture of ExpertsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Yuxiong He

329

13 Mar 2023

Beyond Single Items: Exploring User Preferences in Item Sets with the Conversational Playlist Curation DatasetAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2023

Arun Tejasvi Chaganty

410

13 Mar 2023

Proactive Prioritization of App Issues via Contrastive Learning

228

12 Mar 2023

Unsupervised Language agnostic WER Standardization

09 Mar 2023

Spelling convention sensitivity in neural language modelsFindings (Findings), 2023

Elizabeth Nielsen

Christo Kirov

Brian Roark

115

06 Mar 2023

Exploiting Language Relatedness in Machine Translation Through Domain Adaptation Techniques

109

03 Mar 2023

Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition

145

01 Mar 2023

How to DP-fy ML: A Practical Guide to Machine Learning with Differential PrivacyJournal of Artificial Intelligence Research (JAIR), 2023

504

240

01 Mar 2023

Are More Layers Beneficial to Graph Transformers?International Conference on Learning Representations (ICLR), 2023

202

01 Mar 2023

EvoPrompting: Language Models for Code-Level Neural Architecture SearchNeural Information Processing Systems (NeurIPS), 2023

466

124

28 Feb 2023

A Token-Wise Beam Search Algorithm for RNN-TAutomatic Speech Recognition & Understanding (ASRU), 2023

Gil Keren

261

28 Feb 2023

Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video CaptioningComputer Vision and Pattern Recognition (CVPR), 2023

506

326

27 Feb 2023

Language Is Not All You Need: Aligning Perception with Language ModelsNeural Information Processing Systems (NeurIPS), 2023

...

Xia Song

345

680

27 Feb 2023

LLaMA: Open and Efficient Foundation Language Models

...

7.3K

17,868

27 Feb 2023

MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Yoohwan Kwon

Soo-Whan Chung

MoE

188

27 Feb 2023

Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face VideoAAAI Conference on Artificial Intelligence (AAAI), 2023

Minsu Kim

Chae Won Kim

Y. Ro

CVBM DiffM

144

27 Feb 2023

Elementwise Language Representation

Du-Yeong Kim

Jeeeun Kim

205

27 Feb 2023

Improving Massively Multilingual ASR With Auxiliary CTC ObjectivesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Jiatong Shi

263

24 Feb 2023

Cross-Lingual Transfer of Cognitive Processing ComplexityFindings (Findings), 2023

C. Pouw

Nora Hollenstein

Lisa Beinborn

275

24 Feb 2023

Impact of Subword Pooling Strategy on Cross-lingual Event Detection

232

22 Feb 2023

Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Yuchen Hu

Chen Chen

Ruizhe Li

Qiu-shi Zhu

Eng Siong Chng

326

22 Feb 2023

Topic-switch adapted Japanese Dialogue System based on PLATO-2

178

22 Feb 2023

Learning to Play Text-based Adventure Games with Maximum Entropy Reinforcement Learning

Weichen Li

R. Devidze

Sophie Fellenz

317

21 Feb 2023

Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal PropagationInternational Conference on Learning Representations (ICLR), 2023

231

20 Feb 2023

Zero and Few-Shot Localization of Task-Oriented Dialogue Agents with a Distilled RepresentationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023

M. Moradshahi

Sina J. Semnani

M. Lam

207

18 Feb 2023

RETVec: Resilient and Efficient Text VectorizerNeural Information Processing Systems (NeurIPS), 2023

152

18 Feb 2023

Entry Separation using a Mixed Visual and Textual Language Model: Application to 19th century French Trade Directories

154

17 Feb 2023

Lip-to-Speech Synthesis in the Wild with Multi-task LearningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Minsu Kim

Joanna Hong

Y. Ro

219

17 Feb 2023

E2E Spoken Entity Extraction for Virtual AgentsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Karan Singla

Yeon-Jun Kim

S. Bangalore

454

16 Feb 2023

Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained ModelsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023

Gustavo A. Giménez-Lugo

Rolando A. Coto Solano

Katharina Kann

186

15 Feb 2023

Scaling Vision Transformers to 22 Billion ParametersInternational Conference on Machine Learning (ICML), 2023

...

407

774

10 Feb 2023

PATCorrect: Non-autoregressive Phoneme-augmented Transformer for ASR Error CorrectionInterspeech (Interspeech), 2023

137

10 Feb 2023

Language-Aware Multilingual Machine Translation with Self-Supervised LearningFindings (Findings), 2023

197

10 Feb 2023