ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.06226
  4. Cited By
SentencePiece: A simple and language independent subword tokenizer and
  detokenizer for Neural Text Processing

SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing

19 August 2018
Taku Kudo
John Richardson
ArXiv (abs)PDFHTMLGithub (10925★)

Papers citing "SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing"

50 / 2,064 papers shown
BloombergGPT: A Large Language Model for Finance
BloombergGPT: A Large Language Model for Finance
Shijie Wu
Ozan Irsoy
Steven Lu
Vadim Dabravolski
Mark Dredze
Sebastian Gehrmann
P. Kambadur
David S. Rosenberg
Gideon Mann
AIFin
685
1,157
0
30 Mar 2023
A Study of Autoregressive Decoders for Multi-Tasking in Computer Vision
A Study of Autoregressive Decoders for Multi-Tasking in Computer Vision
Lucas Beyer
Bo Wan
Gagan Madan
Filip Pavetić
Andreas Steiner
...
Emanuele Bugliarello
Tianlin Li
Qihang Yu
Liang-Chieh Chen
Xiaohua Zhai
248
9
0
30 Mar 2023
TreePiece: Faster Semantic Parsing via Tree Tokenization
TreePiece: Faster Semantic Parsing via Tree TokenizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Sida I. Wang
Akshat Shrivastava
S. Livshits
131
5
0
30 Mar 2023
When Good and Reproducible Results are a Giant with Feet of Clay: The
  Importance of Software Quality in NLP
When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLPAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Sara Papi
Marco Gaido
Andrea Pilzer
Matteo Negri
494
16
0
28 Mar 2023
Sigmoid Loss for Language Image Pre-Training
Sigmoid Loss for Language Image Pre-TrainingIEEE International Conference on Computer Vision (ICCV), 2023
Xiaohua Zhai
Basil Mustafa
Alexander Kolesnikov
Lucas Beyer
CLIPVLM
1.8K
2,253
0
27 Mar 2023
Cross-utterance ASR Rescoring with Graph-based Label Propagation
Cross-utterance ASR Rescoring with Graph-based Label PropagationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Srinath Tankasala
Long Chen
A. Stolcke
A. Raju
Qianli Deng
Chander Chandak
Aparna Khare
Roland Maas
Venkatesh Ravichandran
117
2
0
27 Mar 2023
An Information Extraction Study: Take In Mind the Tokenization!
An Information Extraction Study: Take In Mind the Tokenization!
Christos Theodoropoulos
Marie-Francine Moens
128
8
0
27 Mar 2023
Fine-Tashkeel: Finetuning Byte-Level Models for Accurate Arabic Text
  Diacritization
Fine-Tashkeel: Finetuning Byte-Level Models for Accurate Arabic Text Diacritization
Bashar Al-Rfooh
Gheith A. Abandah
Rami Al-Rfou
149
8
0
25 Mar 2023
Neuro-Symbolic Execution of Generic Source Code
Neuro-Symbolic Execution of Generic Source Code
Yaojie Hu
Jin Tian
NAI
222
0
0
23 Mar 2023
SwissBERT: The Multilingual Language Model for Switzerland
SwissBERT: The Multilingual Language Model for SwitzerlandSwiss Text Analytics Conference (SwissText), 2023
Jannis Vamvas
Johannes Graen
Rico Sennrich
269
13
0
23 Mar 2023
A Gold Standard Dataset for the Reviewer Assignment Problem
A Gold Standard Dataset for the Reviewer Assignment Problem
Ivan Stelmakh
John Wieting
Sarina Xi
Graham Neubig
Nihar B. Shah
294
20
0
23 Mar 2023
JaCoText: A Pretrained Model for Java Code-Text Generation
JaCoText: A Pretrained Model for Java Code-Text Generation
Jessica Nayeli López Espejel
Mahaman Sanoussi Yahaya Alassan
Walid Dahhane
E. Ettifouri
131
4
0
22 Mar 2023
Knowledge Distillation from Multiple Foundation Models for End-to-End
  Speech Recognition
Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition
Xiaoyu Yang
Qiujia Li
Chuxu Zhang
P. Woodland
206
11
0
20 Mar 2023
Character, Word, or Both? Revisiting the Segmentation Granularity for
  Chinese Pre-trained Language Models
Character, Word, or Both? Revisiting the Segmentation Granularity for Chinese Pre-trained Language Models
Xinnian Liang
Zefan Zhou
Hui Huang
Shuangzhi Wu
Tong Xiao
Muyun Yang
Zhoujun Li
Chao Bian
VLM
124
3
0
20 Mar 2023
HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR
  mechanism
HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanismIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yuguang Yang
Yu Pan
Jingjing Yin
Jiangyu Han
Lei Ma
Heng Lu
127
14
0
15 Mar 2023
Learning Cross-lingual Visual Speech Representations
Learning Cross-lingual Visual Speech RepresentationsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Andreas Zinonos
A. Haliassos
Pingchuan Ma
Stavros Petridis
Maja Pantic
SSL
163
10
0
14 Mar 2023
Adapting Offline Speech Translation Models for Streaming with
  Future-Aware Distillation and Inference
Adapting Offline Speech Translation Models for Streaming with Future-Aware Distillation and InferenceConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Biao Fu
Minpeng Liao
Kai Fan
Zhongqiang Huang
Boxing Chen
Yidong Chen
Xiaodon Shi
169
8
0
14 Mar 2023
Scaling Vision-Language Models with Sparse Mixture of Experts
Scaling Vision-Language Models with Sparse Mixture of ExpertsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Sheng Shen
Z. Yao
Chunyuan Li
Trevor Darrell
Kurt Keutzer
Yuxiong He
VLMMoE
329
98
0
13 Mar 2023
Beyond Single Items: Exploring User Preferences in Item Sets with the
  Conversational Playlist Curation Dataset
Beyond Single Items: Exploring User Preferences in Item Sets with the Conversational Playlist Curation DatasetAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2023
Arun Tejasvi Chaganty
Megan Leszczynski
Shu Zhen Zhang
Ravi Ganti
K. Balog
Filip Radlinski
410
13
0
13 Mar 2023
Proactive Prioritization of App Issues via Contrastive Learning
Proactive Prioritization of App Issues via Contrastive Learning
Moghis Fereidouni
A. Mosharrof
Umar Farooq
A.B. Siddique
228
7
0
12 Mar 2023
Unsupervised Language agnostic WER Standardization
Unsupervised Language agnostic WER Standardization
Satarupa Guha
Rahul Ambavat
Ankur Gupta
Manish Gupta
R. Mehta
76
0
0
09 Mar 2023
Spelling convention sensitivity in neural language models
Spelling convention sensitivity in neural language modelsFindings (Findings), 2023
Elizabeth Nielsen
Christo Kirov
Brian Roark
115
1
0
06 Mar 2023
Exploiting Language Relatedness in Machine Translation Through Domain
  Adaptation Techniques
Exploiting Language Relatedness in Machine Translation Through Domain Adaptation Techniques
Amit Kumar
Rupjyoti Baruah
A. Pratap
Mayank Swarnkar
Anil Kumar Singh
109
1
0
03 Mar 2023
Synthetic Cross-accent Data Augmentation for Automatic Speech
  Recognition
Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition
P. Klumpp
Pooja Chitkara
Leda Sari
Prashant Serai
Jilong Wu
Irina-Elena Veliche
Rongqing Huang
Qing He
145
6
0
01 Mar 2023
How to DP-fy ML: A Practical Guide to Machine Learning with Differential
  Privacy
How to DP-fy ML: A Practical Guide to Machine Learning with Differential PrivacyJournal of Artificial Intelligence Research (JAIR), 2023
Natalia Ponomareva
Hussein Hazimeh
Alexey Kurakin
Zheng Xu
Carson E. Denison
H. B. McMahan
Sergei Vassilvitskii
Steve Chien
Abhradeep Thakurta
504
240
0
01 Mar 2023
Are More Layers Beneficial to Graph Transformers?
Are More Layers Beneficial to Graph Transformers?International Conference on Learning Representations (ICLR), 2023
Haiteng Zhao
Shuming Ma
Dongdong Zhang
Zhi-Hong Deng
Furu Wei
202
17
0
01 Mar 2023
EvoPrompting: Language Models for Code-Level Neural Architecture Search
EvoPrompting: Language Models for Code-Level Neural Architecture SearchNeural Information Processing Systems (NeurIPS), 2023
Angelica Chen
David Dohan
David R. So
VLMLRM
466
124
0
28 Feb 2023
A Token-Wise Beam Search Algorithm for RNN-T
A Token-Wise Beam Search Algorithm for RNN-TAutomatic Speech Recognition & Understanding (ASRU), 2023
Gil Keren
261
4
0
28 Feb 2023
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense
  Video Captioning
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video CaptioningComputer Vision and Pattern Recognition (CVPR), 2023
Antoine Yang
Arsha Nagrani
Paul Hongsuck Seo
Antoine Miech
Jordi Pont-Tuset
Ivan Laptev
Josef Sivic
Cordelia Schmid
AI4TSVLM
506
326
0
27 Feb 2023
Language Is Not All You Need: Aligning Perception with Language Models
Language Is Not All You Need: Aligning Perception with Language ModelsNeural Information Processing Systems (NeurIPS), 2023
Shaohan Huang
Li Dong
Wenhui Wang
Y. Hao
Saksham Singhal
...
Johan Bjorck
Vishrav Chaudhary
Subhojit Som
Xia Song
Furu Wei
VLMLRMMLLM
345
680
0
27 Feb 2023
LLaMA: Open and Efficient Foundation Language Models
LLaMA: Open and Efficient Foundation Language Models
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
...
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALMPILM
7.3K
17,868
0
27 Feb 2023
MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech
  Recognition
MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yoohwan Kwon
Soo-Whan Chung
MoE
188
28
0
27 Feb 2023
Deep Visual Forced Alignment: Learning to Align Transcription with
  Talking Face Video
Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face VideoAAAI Conference on Artificial Intelligence (AAAI), 2023
Minsu Kim
Chae Won Kim
Y. Ro
CVBMDiffM
144
4
0
27 Feb 2023
Elementwise Language Representation
Elementwise Language Representation
Du-Yeong Kim
Jeeeun Kim
205
0
0
27 Feb 2023
Improving Massively Multilingual ASR With Auxiliary CTC Objectives
Improving Massively Multilingual ASR With Auxiliary CTC ObjectivesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
William Chen
Brian Yan
Jiatong Shi
Yifan Peng
Soumi Maiti
Shinji Watanabe
263
49
0
24 Feb 2023
Cross-Lingual Transfer of Cognitive Processing Complexity
Cross-Lingual Transfer of Cognitive Processing ComplexityFindings (Findings), 2023
C. Pouw
Nora Hollenstein
Lisa Beinborn
275
3
0
24 Feb 2023
Impact of Subword Pooling Strategy on Cross-lingual Event Detection
Impact of Subword Pooling Strategy on Cross-lingual Event Detection
Shantanu Agarwal
Steven Fincke
Chris Jenkins
Scott Miller
Elizabeth Boschee
232
2
0
22 Feb 2023
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust
  Speech Recognition
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yuchen Hu
Chen Chen
Ruizhe Li
Qiu-shi Zhu
Eng Siong Chng
326
15
0
22 Feb 2023
Topic-switch adapted Japanese Dialogue System based on PLATO-2
Topic-switch adapted Japanese Dialogue System based on PLATO-2
Donghuo Zeng
Jianming Wu
Yanan Wang
Kazunori Matsumoto
Gen Hattori
K. Ikeda
178
0
0
22 Feb 2023
Learning to Play Text-based Adventure Games with Maximum Entropy
  Reinforcement Learning
Learning to Play Text-based Adventure Games with Maximum Entropy Reinforcement Learning
Weichen Li
R. Devidze
Sophie Fellenz
317
5
0
21 Feb 2023
Deep Transformers without Shortcuts: Modifying Self-attention for
  Faithful Signal Propagation
Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal PropagationInternational Conference on Learning Representations (ICLR), 2023
Bobby He
James Martens
Guodong Zhang
Aleksandar Botev
Andy Brock
Samuel L. Smith
Yee Whye Teh
231
40
0
20 Feb 2023
Zero and Few-Shot Localization of Task-Oriented Dialogue Agents with a
  Distilled Representation
Zero and Few-Shot Localization of Task-Oriented Dialogue Agents with a Distilled RepresentationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
M. Moradshahi
Sina J. Semnani
M. Lam
207
9
0
18 Feb 2023
RETVec: Resilient and Efficient Text Vectorizer
RETVec: Resilient and Efficient Text VectorizerNeural Information Processing Systems (NeurIPS), 2023
Elie Bursztein
Marina Zhang
Owen Vallis
Xinyu Jia
Alexey Kurakin
VLM
152
6
0
18 Feb 2023
Entry Separation using a Mixed Visual and Textual Language Model:
  Application to 19th century French Trade Directories
Entry Separation using a Mixed Visual and Textual Language Model: Application to 19th century French Trade Directories
Bertrand Duménieu
Edwin Carlinet
N. Abadie
Joseph Chazalon
154
1
0
17 Feb 2023
Lip-to-Speech Synthesis in the Wild with Multi-task Learning
Lip-to-Speech Synthesis in the Wild with Multi-task LearningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Minsu Kim
Joanna Hong
Y. Ro
219
28
0
17 Feb 2023
E2E Spoken Entity Extraction for Virtual Agents
E2E Spoken Entity Extraction for Virtual AgentsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Karan Singla
Yeon-Jun Kim
S. Bangalore
454
1
0
16 Feb 2023
Meeting the Needs of Low-Resource Languages: The Value of Automatic
  Alignments via Pretrained Models
Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained ModelsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Abteen Ebrahimi
Arya D. McCarthy
Arturo Oncevay
Luis Chiruzzo
J. Ortega
Gustavo A. Giménez-Lugo
Rolando A. Coto Solano
Katharina Kann
186
7
0
15 Feb 2023
Scaling Vision Transformers to 22 Billion Parameters
Scaling Vision Transformers to 22 Billion ParametersInternational Conference on Machine Learning (ICML), 2023
Mostafa Dehghani
Josip Djolonga
Basil Mustafa
Piotr Padlewski
Jonathan Heek
...
Mario Luvcić
Xiaohua Zhai
Daniel Keysers
Jeremiah Harmsen
N. Houlsby
MLLM
407
774
0
10 Feb 2023
PATCorrect: Non-autoregressive Phoneme-augmented Transformer for ASR
  Error Correction
PATCorrect: Non-autoregressive Phoneme-augmented Transformer for ASR Error CorrectionInterspeech (Interspeech), 2023
Zi Xuan Zhang
Zhehui Wang
R. Kamma
S. Eswaran
Narayanan Sadagopan
KELM
137
7
0
10 Feb 2023
Language-Aware Multilingual Machine Translation with Self-Supervised
  Learning
Language-Aware Multilingual Machine Translation with Self-Supervised LearningFindings (Findings), 2023
Haoran Xu
Jean Maillard
Vedanuj Goswami
LRM
197
4
0
10 Feb 2023
Previous
123...192021...404142
Next
Page 20 of 42
Pageof 42