SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing

19 August 2018

Papers citing "SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing"

50 / 2,064 papers shown

Fine-Tuning Llama 2 Large Language Models for Detecting Online Sexual Predatory Chats and Abusive TextsThe European Symposium on Artificial Neural Networks (ESANN), 2023

Thanh Thi Nguyen

Campbell Wilson

Janis Dalins

116

28 Aug 2023

ANER: Arabic and Arabizi Named Entity Recognition using Transformer-Based ApproachInternet, Multimedia Systems and Applications (IMSA), 2023

Abdelrahman Boda Sadallah

28 Aug 2023

An Empirical Study of Consistency Regularization for End-to-End Speech-to-Text TranslationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023

204

28 Aug 2023

Training and Meta-Evaluating Machine Translation Evaluation Metrics at the Paragraph LevelConference on Machine Translation (WMT), 2023

310

25 Aug 2023

Code Llama: Open Foundation Models for Code

Baptiste Rozière

...

Louis Martin

458

2,786

24 Aug 2023

Cabrita: closing the gap for foreign languages

Vinicius Fernandes Caridá

CLL

108

23 Aug 2023

Lip Reading for Low-resource Languages by Learning and Combining General Speech Knowledge and Language-specific KnowledgeIEEE International Conference on Computer Vision (ICCV), 2023

Minsu Kim

Jeong Hun Yeo

J. Choi

Y. Ro

210

18 Aug 2023

Towards Automatically Addressing Self-Admitted Technical Debt: How Far Are We?International Conference on Automated Software Engineering (ASE), 2023

A. Mastropaolo

M. D. Penta

Gabriele Bavota

158

17 Aug 2023

Lightweight Adaptation of Neural Language Models via Subspace EmbeddingInternational Conference on Information and Knowledge Management (CIKM), 2023

Amit Kumar Jaiswal

Haiming Liu

150

16 Aug 2023

BIOptimus: Pre-training an Optimal Biomedical Language Model with Curriculum Learning for Named Entity RecognitionWorkshop on Biomedical Natural Language Processing (BioNLP), 2023

Vera Pavlova

M. Makhlouf

221

16 Aug 2023

Radio2Text: Streaming Speech Recognition Using mmWave Radio SignalsProceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2023

245

16 Aug 2023

SOTASTREAM: A Streaming Approach to Machine Translation Training

Marcin Junczys-Dowmunt

152

14 Aug 2023

O-1: Self-training with Oracle and 1-best HypothesisInterspeech (Interspeech), 2023

M. Baskar

Andrew Rosenberg

Bhuvana Ramabhadran

Kartik Audhkhasi

VLM

174

14 Aug 2023

A Novel Ehanced Move Recognition Algorithm Based on Pre-trained Models with Positional Embeddings

H. Wen

Jie Wang

Xiaodong Qiao

166

14 Aug 2023

A Case Study on Context Encoding in Multi-Encoder based Document-Level Neural Machine TranslationMachine Translation Summit (MT Summit), 2023

Ramakrishna Appicharla

Baban Gain

Santanu Pal

Asif Ekbal

180

11 Aug 2023

Enhancing Phenotype Recognition in Clinical Notes Using Large Language Models: PhenoBCBERT and PhenoGPT

182

11 Aug 2023

IIHT: Medical Report Generation with Image-to-Indicator Hierarchical TransformerInternational Conference on Neural Information Processing (ICONIP), 2023

127

10 Aug 2023

Exploring Linguistic Similarity and Zero-Shot Learning for Multilingual Translation of Dravidian Languages

102

10 Aug 2023

Negative Lexical Constraints in Neural Machine TranslationMachine Translation Summit (MT Summit), 2023

120

07 Aug 2023

Analysis of the Evolution of Advanced Transformer-Based Language Models: Experiments on Opinion MiningIAES International Journal of Artificial Intelligence (IJ-AI) (IJ-AI), 2023

177

07 Aug 2023

Spanish Pre-trained BERT Model and Evaluation Data

225

743

06 Aug 2023

N-gram Boosting: Improving Contextual Biasing with Normalized N-gram Targets

200

04 Aug 2023

Federated Representation Learning for Automatic Speech Recognition

203

03 Aug 2023

Many-to-Many Spoken Language Translation via Unified Speech and Text Representation Learning with Unit-to-Unit TranslationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

195

03 Aug 2023

ELIXR: Towards a general purpose X-ray artificial intelligence system through alignment of large language models and radiology vision encoders

...

249

02 Aug 2023

CodeBPE: Investigating Subtokenization Options for Large Language Model Pretraining on Source CodeInternational Conference on Learning Representations (ICLR), 2023

Nadezhda Chirkova

Sergey Troshin

242

01 Aug 2023

SelfSeg: A Self-supervised Sub-word Segmentation Method for Neural Machine Translation

Sadao Kurohashi

147

31 Jul 2023

BARTPhoBEiT: Pre-trained Sequence-to-Sequence and Image Transformers Models for Vietnamese Visual Question AnsweringInternational Conference on Multimedia Analysis and Pattern Recognition (ICMAPR), 2023

153

28 Jul 2023

A Real-World WebAgent with Planning, Long Context Understanding, and Program SynthesisInternational Conference on Learning Representations (ICLR), 2023

Hiroki Furuta

575

315

24 Jul 2023

Modality Confidence Aware Training for Robust End-to-End Spoken Language UnderstandingInterspeech (Interspeech), 2023

Ozlem Kalinli

206

22 Jul 2023

Incorporating Human Translator Style into English-Turkish Literary Machine TranslationEuropean Association for Machine Translation Conferences/Workshops (EAMT), 2023

Zeynep Yi̇rmi̇beşoğlu

164

21 Jul 2023

Topic Identification For Spontaneous Speech: Enriching Audio Features With Embedded Linguistic InformationEuropean Signal Processing Conference (EUSIPCO), 2023

Dejan Porjazovski

Tamás Grósz

M. Kurimo

155

21 Jul 2023

Prompting Large Language Models with Speech Recognition AbilitiesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

...

Ozlem Kalinli

236

190

21 Jul 2023

Jina Embeddings: A Novel Set of High-Performance Sentence Embedding Models

Bo Wang

329

20 Jul 2023

Gradient Sparsification For Masked Fine-Tuning of TransformersIEEE International Joint Conference on Neural Network (IJCNN), 2023

J. Ó. Neill

Sourav Dutta

163

19 Jul 2023

Llama 2: Open Foundation and Fine-Tuned Chat Models

Louis Martin

...

Sharan Narang

Sergey Edunov

8.2K

15,302

18 Jul 2023

Gloss Attention for Gloss-free Sign Language TranslationComputer Vision and Pattern Recognition (CVPR), 2023

Tianyun Zhong

Zhou Zhao

212

14 Jul 2023

Leveraging Pretrained ASR Encoders for Effective and Efficient End-to-End Speech Intent Classification and Slot FillingInterspeech (Interspeech), 2023

Hengguan Huang

Jagadeesh Balam

Boris Ginsburg

181

13 Jul 2023

Copy Is All You NeedInternational Conference on Learning Representations (ICLR), 2023

244

13 Jul 2023

No Train No Gain: Revisiting Efficient Training Algorithms For Transformer-based Language ModelsNeural Information Processing Systems (NeurIPS), 2023

427

12 Jul 2023

Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and ResolutionNeural Information Processing Systems (NeurIPS), 2023

...

385

186

12 Jul 2023

PolyLM: An Open Source Polyglot Large Language Model

...

Dayiheng Liu

Fei Huang

235

12 Jul 2023

Large Language Models as General Pattern MachinesConference on Robot Learning (CoRL), 2023

Montse Gonzalez Arenas

Kanishka Rao

Dorsa Sadigh

Andy Zeng

LLMAG

308

256

10 Jul 2023

Optimal Transport Posterior Alignment for Cross-lingual Semantic ParsingTransactions of the Association for Computational Linguistics (TACL), 2023

Tom Sherborne

Tom Hosking

Mirella Lapata

271

09 Jul 2023

On decoder-only architecture for speech-to-text and large language model integrationAutomatic Speech Recognition & Understanding (ASRU), 2023

...

534

186

08 Jul 2023

Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual AlignmentsAutomatic Speech Recognition & Understanding (ASRU), 2023

333

07 Jul 2023

Vision Language Transformers: A Survey

Clayton Fields

C. Kennington

VLM

182

06 Jul 2023

Focused Transformer: Contrastive Training for Context ScalingNeural Information Processing Systems (NeurIPS), 2023

Henryk Michalewski

235

165

06 Jul 2023

Improving Language Plasticity via Pretraining with Active ForgettingNeural Information Processing Systems (NeurIPS), 2023

Yihong Chen

Kelly Marchisio

Roberta Raileanu

David Ifeoluwa Adelani

431

03 Jul 2023

Challenges in Domain-Specific Abstractive Summarization and How to Overcome themInternational Conference on Agents and Artificial Intelligence (ICAART), 2023

183

03 Jul 2023