SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing

19 August 2018

Papers citing "SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing"

50 / 2,064 papers shown

No Pitch Left Behind: Addressing Gender Unbalance in Automatic Speech Recognition through Pitch ManipulationAutomatic Speech Recognition & Understanding (ASRU), 2023

189

10 Oct 2023

Task-Adaptive Tokenization: Enhancing Long-Form Text Generation Efficacy in Mental Health and BeyondConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

386

09 Oct 2023

Neural Language Model Pruning for Automatic Speech Recognition

222

05 Oct 2023

Kosmos-G: Generating Images in Context with Multimodal Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023

551

04 Oct 2023

ResidualTransformer: Residual Low-Rank Learning with Weight-Sharing for Transformer LayersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Yiming Wang

Jinyu Li

195

03 Oct 2023

Stack Attention: Improving the Ability of Transformers to Model Hierarchical PatternsInternational Conference on Learning Representations (ICLR), 2023

Brian DuSell

David Chiang

394

03 Oct 2023

One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Shinji Watanabe

250

02 Oct 2023

Unlikelihood Tuning on Negative Samples Amazingly Improves Zero-Shot Translation

Junjie Yang

Liang Ding

Li Shen

266

28 Sep 2023

Transformer-VQ: Linear-Time Transformers via Vector QuantizationInternational Conference on Learning Representations (ICLR), 2023

Albert Mohwald

249

28 Sep 2023

Zero-Shot and Few-Shot Video Question Answering with Multi-Modal Prompts

Bipin Rajendran

Bashir M. Al-Hashimi

MLLM VLM

253

27 Sep 2023

Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter SharingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

B. Grimstad

Xuankai Chang

Antonios Anastasopoulos

Yuya Fujita

Shinji Watanabe

288

27 Sep 2023

Enhancing End-to-End Conversational Speech Translation Through Target Language Context UtilizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

A. Hussein

Brian Yan

Antonios Anastasopoulos

Shinji Watanabe

Sanjeev Khudanpur

177

27 Sep 2023

Speech collage: code-switched audio generation by collaging monolingual corporaIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Sanjeev Khudanpur

210

27 Sep 2023

Direct Models for Simultaneous Translation and Automatic Subtitling: FBK@IWSLT2023International Workshop on Spoken Language Translation (IWSLT), 2023

Sara Papi

Marco Gaido

Matteo Negri

243

27 Sep 2023

Segmentation-Free Streaming Machine TranslationTransactions of the Association for Computational Linguistics (TACL), 2023

Javier Iranzo-Sánchez

242

26 Sep 2023

Small-scale proxies for large-scale Transformer training instabilitiesInternational Conference on Learning Representations (ICLR), 2023

...

Jascha Narain Sohl-Dickstein

Kelvin Xu

Jaehoon Lee

Justin Gilmer

Simon Kornblith

319

135

25 Sep 2023

Reproducing Whisper-Style Training Using an Open-Source Toolkit and Publicly Available DataAutomatic Speech Recognition & Understanding (ASRU), 2023

...

347

25 Sep 2023

Importance of Smoothness Induced by Optimizers in FL4ASR: Towards Understanding Federated Learning for End-to-End ASRAutomatic Speech Recognition & Understanding (ASRU), 2023

185

22 Sep 2023

Domain Adaptation for Arabic Machine Translation: The Case of Financial Texts

Emad A. Alghamdi

Jezia Zakraoui

Fares A. Abanmy

281

22 Sep 2023

JCoLA: Japanese Corpus of Linguistic AcceptabilityInternational Conference on Language Resources and Evaluation (LREC), 2023

Taiga Someya

Yushi Sugimoto

Yohei Oseki

212

22 Sep 2023

Exploring the Impact of Training Data Distribution and Subword Tokenization on Gender Bias in Machine TranslationInternational Joint Conference on Natural Language Processing (IJCNLP), 2023

Bar Iluz

Tomasz Limisiewicz

Gabriel Stanovsky

David Marevcek

195

21 Sep 2023

Kosmos-2.5: A Multimodal Literate Model

...

260

20 Sep 2023

Sequence-to-Sequence Spanish Pre-trained Language ModelsInternational Conference on Language Resources and Evaluation (LREC), 2023

373

20 Sep 2023

The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute

217

20 Sep 2023

MBR and QE Finetuning: Training-time Distillation of the Best and Most Expensive Decoding MethodsInternational Conference on Learning Representations (ICLR), 2023

413

19 Sep 2023

A Family of Pretrained Transformer Language Models for RussianInternational Conference on Language Resources and Evaluation (LREC), 2023

...

Alena Fenogenova

318

19 Sep 2023

Discrete Audio Representation as an Alternative to Mel-Spectrograms for Speaker and Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Krishna C. Puvvada

Nithin Rao Koluguri

Kunal Dhawan

Jagadeesh Balam

Boris Ginsburg

137

19 Sep 2023

Language Modeling Is CompressionInternational Conference on Learning Representations (ICLR), 2023

Grégoire Delétang

Anian Ruoss

Paul-Ambroise Duquenne

...

Marcus Hutter

418

201

19 Sep 2023

Nebula: Self-Attention for Dynamic Malware AnalysisIEEE Transactions on Information Forensics and Security (IEEE TIFS), 2023

Dmitrijs Trizna

Christian Scano

Battista Biggio

Fabio Roli

269

19 Sep 2023

Baichuan 2: Open Large-scale Language Models

...

803

927

19 Sep 2023

Adapting Large Language Models via Reading Comprehension

348

18 Sep 2023

Improved Factorized Neural Transducer Model For text-only Domain AdaptationInterspeech (Interspeech), 2023

Jing Liu

Jianwei Yu

Xie Chen

330

18 Sep 2023

How Transferable are Attribute Controllers on Pretrained Multilingual Translation Models?Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023

Danni Liu

Jan Niehues

215

15 Sep 2023

Visual Speech Recognition for Languages with Limited Labeled Data using Automatic Labels from WhisperIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Jeong Hun Yeo

269

15 Sep 2023

CoCA: Fusing Position Embedding with Collinear Constrained Attention in Transformers for Long Context Window ExtendingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

141

15 Sep 2023

Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

162

14 Sep 2023

Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural TransducerInterspeech (Interspeech), 2023

Yifan Yang

Xie Chen

206

14 Sep 2023

Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasksIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

360

14 Sep 2023

The first step is the hardest: Pitfalls of Representing and Tokenizing Temporal Data for Large Language Models

Dimitris Spathis

F. Kawsar

AI4TS

191

12 Sep 2023

AstroLLaMA: Towards Specialized Foundation Models in Astronomy

...

173

12 Sep 2023

LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French SpeechComputer Speech and Language (CSL), 2023

...

261

11 Sep 2023

MADLAD-400: A Multilingual And Document-Level Large Audited DatasetNeural Information Processing Systems (NeurIPS), 2023

Christopher A. Choquette-Choo

...

285

200

09 Sep 2023

Mask-CTC-based Encoder Pre-training for Streaming End-to-End Speech RecognitionEuropean Signal Processing Conference (EUSIPCO), 2023

195

09 Sep 2023

Data-Juicer: A One-Stop Data Processing System for Large Language Models

...

Jingren Zhou

297

05 Sep 2023

TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

...

261

05 Sep 2023

One Wide Feedforward is All You NeedConference on Machine Translation (WMT), 2023

243

04 Sep 2023

Towards Foundational AI Models for Additive Manufacturing: Language Models for G-Code Debugging, Manipulation, and Comprehension

Anushrut Jignasu

Kelly O. Marshall

Baskar Ganapathysubramanian

Aditya Balu

Chinmay Hegde

A. Krishnamurthy

ELM AI4CE

133

04 Sep 2023

Multilingual Text Representation

Fahim Faisal

203

02 Sep 2023

RepCodec: A Speech Representation Codec for Speech TokenizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Zhichao Huang

Chutong Meng

Tom Ko

212

31 Aug 2023

The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language VariantsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Luke Zettlemoyer

Madian Khabsa

360

237

31 Aug 2023