SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing

19 August 2018

Papers citing "SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing"

50 / 2,064 papers shown

The Linear Representation Hypothesis and the Geometry of Large Language ModelsInternational Conference on Machine Learning (ICML), 2023

467

322

07 Nov 2023

Octavius: Mitigating Task Interference in MLLMs via LoRA-MoEInternational Conference on Learning Representations (ICLR), 2023

Wanli Ouyang

Yu Qiao

Jing Shao

MoE

268

05 Nov 2023

Too Much Information: Keeping Training Simple for BabyLMs

Lukas Edman

Lisa Bylinina

192

03 Nov 2023

Server-side Rescoring of Spoken Entity-centric Knowledge Queries for Virtual AssistantsInternational Journal of Speech Technology (IJST), 2023

259

02 Nov 2023

ACES: Translation Accuracy Challenge Sets at WMT 2023Conference on Machine Translation (WMT), 2023

186

02 Nov 2023

From Image to Language: A Critical Analysis of Visual Question Answering (VQA) Approaches, Challenges, and OpportunitiesInformation Fusion (Inf. Fusion), 2023

Md Farhan Ishmam

Md Sakib Hossain Shovon

M. F. Mridha

Nilanjan Dey

399

01 Nov 2023

The Unreasonable Effectiveness of Random Target Embeddings for Continuous-Output Neural Machine TranslationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023

Evgeniia Tokarchuk

Vlad Niculae

183

31 Oct 2023

Towards a Deep Understanding of Multilingual End-to-End Speech TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Yikun Lei

209

31 Oct 2023

Is Robustness Transferable across Languages in Multilingual Neural Machine Translation?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023

287

31 Oct 2023

CreoleVal: Multilingual Multitask Benchmarks for CreolesTransactions of the Association for Computational Linguistics (TACL), 2023

Marcell Richard Fekete

...

Daniel Hershcovich

352

30 Oct 2023

Skywork: A More Open Bilingual Foundation Model

...

275

121

30 Oct 2023

Roles of Scaling and Instruction Tuning in Language Perception: Model vs. Human AttentionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

357

29 Oct 2023

Probing LLMs for Joint Encoding of Linguistic CategoriesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Giulio Starace

Konstantinos Papakostas

Rochelle Choenni

Apostolos Panagiotopoulos

Matteo Rosati

Alina Leidinger

Ekaterina Shutova

258

28 Oct 2023

Unified Segment-to-Segment Framework for Simultaneous Sequence GenerationNeural Information Processing Systems (NeurIPS), 2023

Shaolei Zhang

Yang Feng

260

27 Oct 2023

Lil-Bevo: Explorations of Strategies for Training Language Models in More Humanlike Ways

Venkata S Govindarajan

Juan Diego Rodriguez

Kaj Bostrom

Kyle Mahowald

299

26 Oct 2023

Learning to Abstract with Nonparametric Variational Information BottleneckConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

221

26 Oct 2023

EMMA-X: An EM-like Multilingual Pre-training Algorithm for Cross-lingual Representation LearningNeural Information Processing Systems (NeurIPS), 2023

Dayiheng Liu

Fei Huang

Jun Xie

250

26 Oct 2023

CL-MASR: A Continual Learning Benchmark for Multilingual ASRIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

Mirco Ravanelli

266

25 Oct 2023

Enhanced Simultaneous Machine Translation with Word-level PoliciesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Kang Kim

Hankyu Cho

234

25 Oct 2023

Samsung R&D Institute Philippines at WMT 2023Conference on Machine Translation (WMT), 2023

Jan Christian Blaise Cruz

151

25 Oct 2023

MindLLM: Pre-training Lightweight Large Language Model from Scratch, Evaluations and Domain Applications

187

24 Oct 2023

A Joint Matrix Factorization Analysis of Multilingual RepresentationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

254

24 Oct 2023

Leveraging Timestamp Information for Serialized Joint Streaming Recognition and TranslationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

142

23 Oct 2023

Code-Switching with Word Senses for Pretraining in Neural Machine TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

243

21 Oct 2023

Ask Language Model to Clean Your Noisy Translation Data

Quinten Bolding

Baohao Liao

Brandon James Denis

Jun Luo

Christof Monz

219

20 Oct 2023

The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System

T. Park

He Huang

Ante Jukić

Kunal Dhawan

Krishna C. Puvvada

Nithin Rao Koluguri

Nikolay Karpov

A. Laptev

Jagadeesh Balam

Boris Ginsburg

200

18 Oct 2023

Direct Neural Machine Translation with Task-level Mixture of Experts models

Isidora Chara Tourni

Subhajit Naskar

MoE

220

18 Oct 2023

SPEED: Speculative Pipelined Execution for Efficient Decoding

Coleman Hooper

Sehoon Kim

204

18 Oct 2023

BUT CHiME-7 system description

135

18 Oct 2023

ChapGTP, ILLC's Attempt at Raising a BabyLM: Improving Data Efficiency by Automatic Task Formation

Jaap Jumelet

Michael Hanna

Marianne de Heer Kloots

Anna Langedijk

Charlotte Pouw

Oskar van der Wal

199

17 Oct 2023

ViSoBERT: A Pre-Trained Language Model for Vietnamese Social Media Text ProcessingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

226

17 Oct 2023

IMTLab: An Open-Source Platform for Building, Evaluating, and Diagnosing Interactive Machine Translation SystemsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Xu Huang

111

17 Oct 2023

Iterative Shallow Fusion of Backward Language Model for End-to-End Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

152

17 Oct 2023

Approximating Two-Layer Feedforward Networks for Efficient TransformersConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

414

16 Oct 2023

Towards a Better Understanding of Variations in Zero-Shot Neural Machine Translation PerformanceConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Shaomu Tan

Christof Monz

353

16 Oct 2023

Optimized Tokenization for Transcribed Error CorrectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Tomer Wullach

Shlomo E. Chazan

198

16 Oct 2023

Prediction of Arabic Legal Rulings using Large Language Models

218

16 Oct 2023

End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis

167

16 Oct 2023

Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization

...

211

16 Oct 2023

UvA-MT's Participation in the WMT23 General Translation Shared Task

Christof Monz

216

15 Oct 2023

Generative Adversarial Training for Text-to-Speech Synthesis Based on Raw Phonetic Input and Explicit Prosody Modelling

Tiberiu Boros

Stefan Daniel Dumitrescu

Ionut Mironica

Radu Chivereanu

GAN

152

14 Oct 2023

Embarrassingly Simple Text Watermarks

322

13 Oct 2023

Tokenizer Choice For LLM Training: Negligible or Crucial?

...

562

102

12 Oct 2023

Toward Joint Language Modeling for Speech Units and TextConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

233

12 Oct 2023

InstructRetro: Instruction Tuning post Retrieval-Augmented PretrainingInternational Conference on Machine Learning (ICML), 2023

468

11 Oct 2023

MatFormer: Nested Transformer for Elastic InferenceNeural Information Processing Systems (NeurIPS), 2023

Tim Dettmers

...

255

11 Oct 2023

An Empirical Study of Instruction-tuning Large Language Models in ChineseConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Zheng Lin

199

11 Oct 2023

On the Impact of Cross-Domain Data on German Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

...

Daniel Truhn

Jan Egger

Jiang Bian

Jens Kleesiek

Yonghui Wu

188

11 Oct 2023

BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language AssociationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Wei Zhang

Rui Yan

321

106

11 Oct 2023

Acoustic Model Fusion for End-to-end Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2023

...

209

10 Oct 2023