Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1808.06226
Cited By
SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing
19 August 2018
Taku Kudo
John Richardson
Re-assign community
ArXiv (abs)
PDF
HTML
Github (10925★)
Papers citing
"SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing"
50 / 2,064 papers shown
BloombergGPT: A Large Language Model for Finance
Shijie Wu
Ozan Irsoy
Steven Lu
Vadim Dabravolski
Mark Dredze
Sebastian Gehrmann
P. Kambadur
David S. Rosenberg
Gideon Mann
AIFin
685
1,157
0
30 Mar 2023
A Study of Autoregressive Decoders for Multi-Tasking in Computer Vision
Lucas Beyer
Bo Wan
Gagan Madan
Filip Pavetić
Andreas Steiner
...
Emanuele Bugliarello
Tianlin Li
Qihang Yu
Liang-Chieh Chen
Xiaohua Zhai
248
9
0
30 Mar 2023
TreePiece: Faster Semantic Parsing via Tree Tokenization
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Sida I. Wang
Akshat Shrivastava
S. Livshits
131
5
0
30 Mar 2023
When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLP
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Sara Papi
Marco Gaido
Andrea Pilzer
Matteo Negri
494
16
0
28 Mar 2023
Sigmoid Loss for Language Image Pre-Training
IEEE International Conference on Computer Vision (ICCV), 2023
Xiaohua Zhai
Basil Mustafa
Alexander Kolesnikov
Lucas Beyer
CLIP
VLM
1.8K
2,253
0
27 Mar 2023
Cross-utterance ASR Rescoring with Graph-based Label Propagation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Srinath Tankasala
Long Chen
A. Stolcke
A. Raju
Qianli Deng
Chander Chandak
Aparna Khare
Roland Maas
Venkatesh Ravichandran
117
2
0
27 Mar 2023
An Information Extraction Study: Take In Mind the Tokenization!
Christos Theodoropoulos
Marie-Francine Moens
128
8
0
27 Mar 2023
Fine-Tashkeel: Finetuning Byte-Level Models for Accurate Arabic Text Diacritization
Bashar Al-Rfooh
Gheith A. Abandah
Rami Al-Rfou
149
8
0
25 Mar 2023
Neuro-Symbolic Execution of Generic Source Code
Yaojie Hu
Jin Tian
NAI
222
0
0
23 Mar 2023
SwissBERT: The Multilingual Language Model for Switzerland
Swiss Text Analytics Conference (SwissText), 2023
Jannis Vamvas
Johannes Graen
Rico Sennrich
269
13
0
23 Mar 2023
A Gold Standard Dataset for the Reviewer Assignment Problem
Ivan Stelmakh
John Wieting
Sarina Xi
Graham Neubig
Nihar B. Shah
294
20
0
23 Mar 2023
JaCoText: A Pretrained Model for Java Code-Text Generation
Jessica Nayeli López Espejel
Mahaman Sanoussi Yahaya Alassan
Walid Dahhane
E. Ettifouri
131
4
0
22 Mar 2023
Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition
Xiaoyu Yang
Qiujia Li
Chuxu Zhang
P. Woodland
206
11
0
20 Mar 2023
Character, Word, or Both? Revisiting the Segmentation Granularity for Chinese Pre-trained Language Models
Xinnian Liang
Zefan Zhou
Hui Huang
Shuangzhi Wu
Tong Xiao
Muyun Yang
Zhoujun Li
Chao Bian
VLM
124
3
0
20 Mar 2023
HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanism
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yuguang Yang
Yu Pan
Jingjing Yin
Jiangyu Han
Lei Ma
Heng Lu
127
14
0
15 Mar 2023
Learning Cross-lingual Visual Speech Representations
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Andreas Zinonos
A. Haliassos
Pingchuan Ma
Stavros Petridis
Maja Pantic
SSL
163
10
0
14 Mar 2023
Adapting Offline Speech Translation Models for Streaming with Future-Aware Distillation and Inference
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Biao Fu
Minpeng Liao
Kai Fan
Zhongqiang Huang
Boxing Chen
Yidong Chen
Xiaodon Shi
169
8
0
14 Mar 2023
Scaling Vision-Language Models with Sparse Mixture of Experts
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Sheng Shen
Z. Yao
Chunyuan Li
Trevor Darrell
Kurt Keutzer
Yuxiong He
VLM
MoE
329
98
0
13 Mar 2023
Beyond Single Items: Exploring User Preferences in Item Sets with the Conversational Playlist Curation Dataset
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2023
Arun Tejasvi Chaganty
Megan Leszczynski
Shu Zhen Zhang
Ravi Ganti
K. Balog
Filip Radlinski
410
13
0
13 Mar 2023
Proactive Prioritization of App Issues via Contrastive Learning
Moghis Fereidouni
A. Mosharrof
Umar Farooq
A.B. Siddique
228
7
0
12 Mar 2023
Unsupervised Language agnostic WER Standardization
Satarupa Guha
Rahul Ambavat
Ankur Gupta
Manish Gupta
R. Mehta
76
0
0
09 Mar 2023
Spelling convention sensitivity in neural language models
Findings (Findings), 2023
Elizabeth Nielsen
Christo Kirov
Brian Roark
115
1
0
06 Mar 2023
Exploiting Language Relatedness in Machine Translation Through Domain Adaptation Techniques
Amit Kumar
Rupjyoti Baruah
A. Pratap
Mayank Swarnkar
Anil Kumar Singh
109
1
0
03 Mar 2023
Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition
P. Klumpp
Pooja Chitkara
Leda Sari
Prashant Serai
Jilong Wu
Irina-Elena Veliche
Rongqing Huang
Qing He
145
6
0
01 Mar 2023
How to DP-fy ML: A Practical Guide to Machine Learning with Differential Privacy
Journal of Artificial Intelligence Research (JAIR), 2023
Natalia Ponomareva
Hussein Hazimeh
Alexey Kurakin
Zheng Xu
Carson E. Denison
H. B. McMahan
Sergei Vassilvitskii
Steve Chien
Abhradeep Thakurta
504
240
0
01 Mar 2023
Are More Layers Beneficial to Graph Transformers?
International Conference on Learning Representations (ICLR), 2023
Haiteng Zhao
Shuming Ma
Dongdong Zhang
Zhi-Hong Deng
Furu Wei
202
17
0
01 Mar 2023
EvoPrompting: Language Models for Code-Level Neural Architecture Search
Neural Information Processing Systems (NeurIPS), 2023
Angelica Chen
David Dohan
David R. So
VLM
LRM
466
124
0
28 Feb 2023
A Token-Wise Beam Search Algorithm for RNN-T
Automatic Speech Recognition & Understanding (ASRU), 2023
Gil Keren
261
4
0
28 Feb 2023
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning
Computer Vision and Pattern Recognition (CVPR), 2023
Antoine Yang
Arsha Nagrani
Paul Hongsuck Seo
Antoine Miech
Jordi Pont-Tuset
Ivan Laptev
Josef Sivic
Cordelia Schmid
AI4TS
VLM
506
326
0
27 Feb 2023
Language Is Not All You Need: Aligning Perception with Language Models
Neural Information Processing Systems (NeurIPS), 2023
Shaohan Huang
Li Dong
Wenhui Wang
Y. Hao
Saksham Singhal
...
Johan Bjorck
Vishrav Chaudhary
Subhojit Som
Xia Song
Furu Wei
VLM
LRM
MLLM
345
680
0
27 Feb 2023
LLaMA: Open and Efficient Foundation Language Models
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
...
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
7.3K
17,868
0
27 Feb 2023
MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yoohwan Kwon
Soo-Whan Chung
MoE
188
28
0
27 Feb 2023
Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video
AAAI Conference on Artificial Intelligence (AAAI), 2023
Minsu Kim
Chae Won Kim
Y. Ro
CVBM
DiffM
144
4
0
27 Feb 2023
Elementwise Language Representation
Du-Yeong Kim
Jeeeun Kim
205
0
0
27 Feb 2023
Improving Massively Multilingual ASR With Auxiliary CTC Objectives
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
William Chen
Brian Yan
Jiatong Shi
Yifan Peng
Soumi Maiti
Shinji Watanabe
263
49
0
24 Feb 2023
Cross-Lingual Transfer of Cognitive Processing Complexity
Findings (Findings), 2023
C. Pouw
Nora Hollenstein
Lisa Beinborn
275
3
0
24 Feb 2023
Impact of Subword Pooling Strategy on Cross-lingual Event Detection
Shantanu Agarwal
Steven Fincke
Chris Jenkins
Scott Miller
Elizabeth Boschee
232
2
0
22 Feb 2023
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yuchen Hu
Chen Chen
Ruizhe Li
Qiu-shi Zhu
Eng Siong Chng
326
15
0
22 Feb 2023
Topic-switch adapted Japanese Dialogue System based on PLATO-2
Donghuo Zeng
Jianming Wu
Yanan Wang
Kazunori Matsumoto
Gen Hattori
K. Ikeda
178
0
0
22 Feb 2023
Learning to Play Text-based Adventure Games with Maximum Entropy Reinforcement Learning
Weichen Li
R. Devidze
Sophie Fellenz
317
5
0
21 Feb 2023
Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation
International Conference on Learning Representations (ICLR), 2023
Bobby He
James Martens
Guodong Zhang
Aleksandar Botev
Andy Brock
Samuel L. Smith
Yee Whye Teh
231
40
0
20 Feb 2023
Zero and Few-Shot Localization of Task-Oriented Dialogue Agents with a Distilled Representation
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
M. Moradshahi
Sina J. Semnani
M. Lam
207
9
0
18 Feb 2023
RETVec: Resilient and Efficient Text Vectorizer
Neural Information Processing Systems (NeurIPS), 2023
Elie Bursztein
Marina Zhang
Owen Vallis
Xinyu Jia
Alexey Kurakin
VLM
152
6
0
18 Feb 2023
Entry Separation using a Mixed Visual and Textual Language Model: Application to 19th century French Trade Directories
Bertrand Duménieu
Edwin Carlinet
N. Abadie
Joseph Chazalon
154
1
0
17 Feb 2023
Lip-to-Speech Synthesis in the Wild with Multi-task Learning
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Minsu Kim
Joanna Hong
Y. Ro
219
28
0
17 Feb 2023
E2E Spoken Entity Extraction for Virtual Agents
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Karan Singla
Yeon-Jun Kim
S. Bangalore
454
1
0
16 Feb 2023
Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Abteen Ebrahimi
Arya D. McCarthy
Arturo Oncevay
Luis Chiruzzo
J. Ortega
Gustavo A. Giménez-Lugo
Rolando A. Coto Solano
Katharina Kann
186
7
0
15 Feb 2023
Scaling Vision Transformers to 22 Billion Parameters
International Conference on Machine Learning (ICML), 2023
Mostafa Dehghani
Josip Djolonga
Basil Mustafa
Piotr Padlewski
Jonathan Heek
...
Mario Luvcić
Xiaohua Zhai
Daniel Keysers
Jeremiah Harmsen
N. Houlsby
MLLM
407
774
0
10 Feb 2023
PATCorrect: Non-autoregressive Phoneme-augmented Transformer for ASR Error Correction
Interspeech (Interspeech), 2023
Zi Xuan Zhang
Zhehui Wang
R. Kamma
S. Eswaran
Narayanan Sadagopan
KELM
137
7
0
10 Feb 2023
Language-Aware Multilingual Machine Translation with Self-Supervised Learning
Findings (Findings), 2023
Haoran Xu
Jean Maillard
Vedanuj Goswami
LRM
197
4
0
10 Feb 2023
Previous
1
2
3
...
19
20
21
...
40
41
42
Next
Page 20 of 42
Page
of 42
Go