ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.06226
  4. Cited By
SentencePiece: A simple and language independent subword tokenizer and
  detokenizer for Neural Text Processing

SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing

19 August 2018
Taku Kudo
John Richardson
ArXiv (abs)PDFHTMLGithub (10925★)

Papers citing "SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing"

50 / 2,064 papers shown
Fine-Tuning Llama 2 Large Language Models for Detecting Online Sexual
  Predatory Chats and Abusive Texts
Fine-Tuning Llama 2 Large Language Models for Detecting Online Sexual Predatory Chats and Abusive TextsThe European Symposium on Artificial Neural Networks (ESANN), 2023
Thanh Thi Nguyen
Campbell Wilson
Janis Dalins
116
35
0
28 Aug 2023
ANER: Arabic and Arabizi Named Entity Recognition using
  Transformer-Based Approach
ANER: Arabic and Arabizi Named Entity Recognition using Transformer-Based ApproachInternet, Multimedia Systems and Applications (IMSA), 2023
Abdelrahman Boda Sadallah
Omar Ahmed
Shimaa S. Mohamed
Omar Hatem
Doaa Hesham
A. Yousef
66
5
0
28 Aug 2023
An Empirical Study of Consistency Regularization for End-to-End
  Speech-to-Text Translation
An Empirical Study of Consistency Regularization for End-to-End Speech-to-Text TranslationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Pengzhi Gao
Ruiqing Zhang
Zhongjun He
Hua Wu
Haifeng Wang
204
7
0
28 Aug 2023
Training and Meta-Evaluating Machine Translation Evaluation Metrics at
  the Paragraph Level
Training and Meta-Evaluating Machine Translation Evaluation Metrics at the Paragraph LevelConference on Machine Translation (WMT), 2023
Daniel Deutsch
Juraj Juraska
M. Finkelstein
and Markus Freitag
310
13
0
25 Aug 2023
Code Llama: Open Foundation Models for Code
Code Llama: Open Foundation Models for Code
Baptiste Rozière
Jonas Gehring
Fabian Gloeckle
Sten Sootla
Itai Gat
...
Hugo Touvron
Louis Martin
Nicolas Usunier
Thomas Scialom
Gabriel Synnaeve
ELMALM
458
2,786
0
24 Aug 2023
Cabrita: closing the gap for foreign languages
Cabrita: closing the gap for foreign languages
Celio H. N. Larcher
Marcos Piau
Paulo Finardi
P. Gengo
P. Esposito
Vinicius Fernandes Caridá
CLL
108
35
0
23 Aug 2023
Lip Reading for Low-resource Languages by Learning and Combining General
  Speech Knowledge and Language-specific Knowledge
Lip Reading for Low-resource Languages by Learning and Combining General Speech Knowledge and Language-specific KnowledgeIEEE International Conference on Computer Vision (ICCV), 2023
Minsu Kim
Jeong Hun Yeo
J. Choi
Y. Ro
210
27
0
18 Aug 2023
Towards Automatically Addressing Self-Admitted Technical Debt: How Far
  Are We?
Towards Automatically Addressing Self-Admitted Technical Debt: How Far Are We?International Conference on Automated Software Engineering (ASE), 2023
A. Mastropaolo
M. D. Penta
Gabriele Bavota
158
13
0
17 Aug 2023
Lightweight Adaptation of Neural Language Models via Subspace Embedding
Lightweight Adaptation of Neural Language Models via Subspace EmbeddingInternational Conference on Information and Knowledge Management (CIKM), 2023
Amit Kumar Jaiswal
Haiming Liu
150
3
0
16 Aug 2023
BIOptimus: Pre-training an Optimal Biomedical Language Model with
  Curriculum Learning for Named Entity Recognition
BIOptimus: Pre-training an Optimal Biomedical Language Model with Curriculum Learning for Named Entity RecognitionWorkshop on Biomedical Natural Language Processing (BioNLP), 2023
Vera Pavlova
M. Makhlouf
221
3
0
16 Aug 2023
Radio2Text: Streaming Speech Recognition Using mmWave Radio Signals
Radio2Text: Streaming Speech Recognition Using mmWave Radio SignalsProceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2023
Running Zhao
Jiang-Tao Luca Yu
Haiying Zhao
Edith C.H. Ngai
245
10
0
16 Aug 2023
SOTASTREAM: A Streaming Approach to Machine Translation Training
SOTASTREAM: A Streaming Approach to Machine Translation Training
Matt Post
Thamme Gowda
Roman Grundkiewicz
Huda Khayrallah
Rohit Jain
Marcin Junczys-Dowmunt
152
6
0
14 Aug 2023
O-1: Self-training with Oracle and 1-best Hypothesis
O-1: Self-training with Oracle and 1-best HypothesisInterspeech (Interspeech), 2023
M. Baskar
Andrew Rosenberg
Bhuvana Ramabhadran
Kartik Audhkhasi
VLM
174
0
0
14 Aug 2023
A Novel Ehanced Move Recognition Algorithm Based on Pre-trained Models
  with Positional Embeddings
A Novel Ehanced Move Recognition Algorithm Based on Pre-trained Models with Positional Embeddings
H. Wen
Jie Wang
Xiaodong Qiao
166
0
0
14 Aug 2023
A Case Study on Context Encoding in Multi-Encoder based Document-Level
  Neural Machine Translation
A Case Study on Context Encoding in Multi-Encoder based Document-Level Neural Machine TranslationMachine Translation Summit (MT Summit), 2023
Ramakrishna Appicharla
Baban Gain
Santanu Pal
Asif Ekbal
180
1
0
11 Aug 2023
Enhancing Phenotype Recognition in Clinical Notes Using Large Language
  Models: PhenoBCBERT and PhenoGPT
Enhancing Phenotype Recognition in Clinical Notes Using Large Language Models: PhenoBCBERT and PhenoGPT
Jing Yang
Cong Liu
Wendy Deng
Dangwei Wu
Chunhua Weng
Yunyun Zhou
Kai Wang
182
31
0
11 Aug 2023
IIHT: Medical Report Generation with Image-to-Indicator Hierarchical
  Transformer
IIHT: Medical Report Generation with Image-to-Indicator Hierarchical TransformerInternational Conference on Neural Information Processing (ICONIP), 2023
Keqi Fan
Xiaohao Cai
M. Niranjan
MedImViT
127
7
0
10 Aug 2023
Exploring Linguistic Similarity and Zero-Shot Learning for Multilingual
  Translation of Dravidian Languages
Exploring Linguistic Similarity and Zero-Shot Learning for Multilingual Translation of Dravidian Languages
Danish Ebadulla
Rahul Raman
S. Natarajan
Hridhay Kiran Shetty
A. Shenoy
102
1
0
10 Aug 2023
Negative Lexical Constraints in Neural Machine Translation
Negative Lexical Constraints in Neural Machine TranslationMachine Translation Summit (MT Summit), 2023
Josef Jon
Duvsan Varivs
Michal Novák
João Paulo Aires
Ondrej Bojar
120
2
0
07 Aug 2023
Analysis of the Evolution of Advanced Transformer-Based Language Models:
  Experiments on Opinion Mining
Analysis of the Evolution of Advanced Transformer-Based Language Models: Experiments on Opinion MiningIAES International Journal of Artificial Intelligence (IJ-AI) (IJ-AI), 2023
Nour Eddine Zekaoui
Siham Yousfi
Maryem Rhanoui
M. Mikram
177
4
0
07 Aug 2023
Spanish Pre-trained BERT Model and Evaluation Data
Spanish Pre-trained BERT Model and Evaluation Data
J. Cañete
Gabriel Chaperon
Rodrigo Fuentes
Jou-Hui Ho
Hojin Kang
Jorge Pérez
225
743
0
06 Aug 2023
N-gram Boosting: Improving Contextual Biasing with Normalized N-gram
  Targets
N-gram Boosting: Improving Contextual Biasing with Normalized N-gram Targets
Wang Yau Li
Shreekantha Nadig
K. Chang
Zafarullah Mahmood
Riqiang Wang
Simon Vandieken
Jonas Robertson
Frederic Mailhot
200
0
0
04 Aug 2023
Federated Representation Learning for Automatic Speech Recognition
Federated Representation Learning for Automatic Speech Recognition
Guruprasad V Ramesh
Gopinath Chennupati
Milind Rao
Anit Kumar Sahu
Ariya Rastrow
J. Droppo
203
0
0
03 Aug 2023
Many-to-Many Spoken Language Translation via Unified Speech and Text
  Representation Learning with Unit-to-Unit Translation
Many-to-Many Spoken Language Translation via Unified Speech and Text Representation Learning with Unit-to-Unit TranslationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Minsu Kim
J. Choi
Dahun Kim
Y. Ro
195
10
0
03 Aug 2023
ELIXR: Towards a general purpose X-ray artificial intelligence system
  through alignment of large language models and radiology vision encoders
ELIXR: Towards a general purpose X-ray artificial intelligence system through alignment of large language models and radiology vision encoders
Shawn Xu
Ling Yang
Christopher J. Kelly
M. Sieniek
Timo Kohlberger
...
Shruthi Prabhakara
Daniel Golden
Rory Pilgrim
Krish Eswaran
Andrew Sellergren
LM&MAMedIm
249
69
0
02 Aug 2023
CodeBPE: Investigating Subtokenization Options for Large Language Model
  Pretraining on Source Code
CodeBPE: Investigating Subtokenization Options for Large Language Model Pretraining on Source CodeInternational Conference on Learning Representations (ICLR), 2023
Nadezhda Chirkova
Sergey Troshin
242
9
0
01 Aug 2023
SelfSeg: A Self-supervised Sub-word Segmentation Method for Neural
  Machine Translation
SelfSeg: A Self-supervised Sub-word Segmentation Method for Neural Machine Translation
Israfel Salazar
Mary Dabre
Chenhui Chu
Sadao Kurohashi
Eiichiro Sumita
147
5
0
31 Jul 2023
BARTPhoBEiT: Pre-trained Sequence-to-Sequence and Image Transformers
  Models for Vietnamese Visual Question Answering
BARTPhoBEiT: Pre-trained Sequence-to-Sequence and Image Transformers Models for Vietnamese Visual Question AnsweringInternational Conference on Multimedia Analysis and Pattern Recognition (ICMAPR), 2023
Khiem Vinh Tran
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
ViT
153
3
0
28 Jul 2023
A Real-World WebAgent with Planning, Long Context Understanding, and
  Program Synthesis
A Real-World WebAgent with Planning, Long Context Understanding, and Program SynthesisInternational Conference on Learning Representations (ICLR), 2023
Izzeddin Gur
Hiroki Furuta
Austin Huang
Mustafa Safdari
Yutaka Matsuo
Douglas Eck
Aleksandra Faust
LM&RoLLMAG
575
315
0
24 Jul 2023
Modality Confidence Aware Training for Robust End-to-End Spoken Language
  Understanding
Modality Confidence Aware Training for Robust End-to-End Spoken Language UnderstandingInterspeech (Interspeech), 2023
Suyoun Kim
Akshat Shrivastava
Duc Le
Ju Lin
Ozlem Kalinli
M. Seltzer
AuLLM
206
3
0
22 Jul 2023
Incorporating Human Translator Style into English-Turkish Literary
  Machine Translation
Incorporating Human Translator Style into English-Turkish Literary Machine TranslationEuropean Association for Machine Translation Conferences/Workshops (EAMT), 2023
Zeynep Yi̇rmi̇beşoğlu
Olgun Dursun
Harun Dalli
Mehmet Şahin
Ena Hodzik
Sabri Gürses
Tunga Güngör
164
0
0
21 Jul 2023
Topic Identification For Spontaneous Speech: Enriching Audio Features
  With Embedded Linguistic Information
Topic Identification For Spontaneous Speech: Enriching Audio Features With Embedded Linguistic InformationEuropean Signal Processing Conference (EUSIPCO), 2023
Dejan Porjazovski
Tamás Grósz
M. Kurimo
155
1
0
21 Jul 2023
Prompting Large Language Models with Speech Recognition Abilities
Prompting Large Language Models with Speech Recognition AbilitiesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yassir Fathullah
Chunyang Wu
Egor Lakomkin
Junteng Jia
Yuan Shangguan
...
Wenhan Xiong
Jay Mahadeokar
Ozlem Kalinli
Christian Fuegen
M. Seltzer
AuLLM
236
190
0
21 Jul 2023
Jina Embeddings: A Novel Set of High-Performance Sentence Embedding
  Models
Jina Embeddings: A Novel Set of High-Performance Sentence Embedding Models
Michael Gunther
Louis Milliken
Jonathan Geuter
Georgios Mastrapas
Bo Wang
Han Xiao
RALM
329
44
0
20 Jul 2023
Gradient Sparsification For Masked Fine-Tuning of Transformers
Gradient Sparsification For Masked Fine-Tuning of TransformersIEEE International Joint Conference on Neural Network (IJCNN), 2023
J. Ó. Neill
Sourav Dutta
163
1
0
19 Jul 2023
Llama 2: Open Foundation and Fine-Tuned Chat Models
Llama 2: Open Foundation and Fine-Tuned Chat Models
Hugo Touvron
Louis Martin
Kevin R. Stone
Peter Albert
Amjad Almahairi
...
Sharan Narang
Aurelien Rodriguez
Robert Stojnic
Sergey Edunov
Thomas Scialom
AI4MHALM
8.2K
15,302
0
18 Jul 2023
Gloss Attention for Gloss-free Sign Language Translation
Gloss Attention for Gloss-free Sign Language TranslationComputer Vision and Pattern Recognition (CVPR), 2023
Aoxiong Yin
Tianyun Zhong
Lilian H. Y. Tang
Weike Jin
Tao Jin
Zhou Zhao
SLR
212
61
0
14 Jul 2023
Leveraging Pretrained ASR Encoders for Effective and Efficient
  End-to-End Speech Intent Classification and Slot Filling
Leveraging Pretrained ASR Encoders for Effective and Efficient End-to-End Speech Intent Classification and Slot FillingInterspeech (Interspeech), 2023
Hengguan Huang
Jagadeesh Balam
Boris Ginsburg
181
6
0
13 Jul 2023
Copy Is All You Need
Copy Is All You NeedInternational Conference on Learning Representations (ICLR), 2023
Tian Lan
Deng Cai
Yan Wang
Heyan Huang
Xian-Ling Mao
244
32
0
13 Jul 2023
No Train No Gain: Revisiting Efficient Training Algorithms For
  Transformer-based Language Models
No Train No Gain: Revisiting Efficient Training Algorithms For Transformer-based Language ModelsNeural Information Processing Systems (NeurIPS), 2023
Jean Kaddour
Oscar Key
Piotr Nawrot
Pasquale Minervini
Matt J. Kusner
427
58
0
12 Jul 2023
Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and
  Resolution
Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and ResolutionNeural Information Processing Systems (NeurIPS), 2023
Mostafa Dehghani
Basil Mustafa
Josip Djolonga
Jonathan Heek
Matthias Minderer
...
Avital Oliver
Piotr Padlewski
A. Gritsenko
Mario Luvcić
N. Houlsby
ViT
385
186
0
12 Jul 2023
PolyLM: An Open Source Polyglot Large Language Model
PolyLM: An Open Source Polyglot Large Language Model
Xiangpeng Wei
Hao-Ran Wei
Huan Lin
Tianhao Li
Pei Zhang
...
Yu Bowen
Dayiheng Liu
Baosong Yang
Fei Huang
Jun Xie
LRM
235
70
0
12 Jul 2023
Large Language Models as General Pattern Machines
Large Language Models as General Pattern MachinesConference on Robot Learning (CoRL), 2023
Suvir Mirchandani
F. Xia
Peter R. Florence
Brian Ichter
Danny Driess
Montse Gonzalez Arenas
Kanishka Rao
Dorsa Sadigh
Andy Zeng
LLMAG
308
256
0
10 Jul 2023
Optimal Transport Posterior Alignment for Cross-lingual Semantic Parsing
Optimal Transport Posterior Alignment for Cross-lingual Semantic ParsingTransactions of the Association for Computational Linguistics (TACL), 2023
Tom Sherborne
Tom Hosking
Mirella Lapata
OT
271
6
0
09 Jul 2023
On decoder-only architecture for speech-to-text and large language model
  integration
On decoder-only architecture for speech-to-text and large language model integrationAutomatic Speech Recognition & Understanding (ASRU), 2023
Jian Wu
Yashesh Gaur
Zhuo Chen
Long Zhou
Yilun Zhu
...
Jinyu Li
Shujie Liu
Bo Ren
Linquan Liu
Yu-Huan Wu
AuLLM
534
186
0
08 Jul 2023
Token-Level Serialized Output Training for Joint Streaming ASR and ST
  Leveraging Textual Alignments
Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual AlignmentsAutomatic Speech Recognition & Understanding (ASRU), 2023
Sara Papi
Peidong Wan
Junkun Chen
Jian Xue
Jinyu Li
Yashesh Gaur
333
8
0
07 Jul 2023
Vision Language Transformers: A Survey
Vision Language Transformers: A Survey
Clayton Fields
C. Kennington
VLM
182
7
0
06 Jul 2023
Focused Transformer: Contrastive Training for Context Scaling
Focused Transformer: Contrastive Training for Context ScalingNeural Information Processing Systems (NeurIPS), 2023
Szymon Tworkowski
Konrad Staniszewski
Mikolaj Pacek
Yuhuai Wu
Henryk Michalewski
Piotr Milo's
235
165
0
06 Jul 2023
Improving Language Plasticity via Pretraining with Active Forgetting
Improving Language Plasticity via Pretraining with Active ForgettingNeural Information Processing Systems (NeurIPS), 2023
Yihong Chen
Kelly Marchisio
Roberta Raileanu
David Ifeoluwa Adelani
Pontus Stenetorp
Sebastian Riedel
Mikel Artetx
KELMAI4CECLL
431
39
0
03 Jul 2023
Challenges in Domain-Specific Abstractive Summarization and How to
  Overcome them
Challenges in Domain-Specific Abstractive Summarization and How to Overcome themInternational Conference on Agents and Artificial Intelligence (ICAART), 2023
Anum Afzal
Juraj Vladika
Daniel Braun
Florian Matthes
HILM
183
15
0
03 Jul 2023
Previous
123...151617...404142
Next