Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1808.06226
Cited By
SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing
19 August 2018
Taku Kudo
John Richardson
Re-assign community
ArXiv (abs)
PDF
HTML
Github (10925★)
Papers citing
"SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing"
50 / 2,064 papers shown
How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena
Marco Gaido
Sara Papi
Matteo Negri
L. Bentivogli
231
1
0
20 Feb 2024
MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces
Tianyu Zheng
Ge Zhang
Xingwei Qu
Ming Kuang
Stephen W. Huang
Zhaofeng He
OffRL
231
2
0
20 Feb 2024
Emergent Word Order Universals from Cognitively-Motivated Language Models
Tatsuki Kuribayashi
Ryo Ueda
Ryosuke Yoshida
Yohei Oseki
Ted Briscoe
Timothy Baldwin
306
5
0
19 Feb 2024
Pushing the Limits of Zero-shot End-to-End Speech Translation
Ioannis Tsiamas
Gerard I. Gállego
José A. R. Fonollosa
Marta R. Costa-jussá
268
15
0
16 Feb 2024
BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains
Yanis Labrak
Adrien Bazoge
Emmanuel Morin
P. Gourraud
Mickael Rouvier
Richard Dufour
487
367
0
15 Feb 2024
Fast Vocabulary Transfer for Language Model Compression
Leonidas Gee
Andrea Zugarini
Leonardo Rigutini
Paolo Torroni
183
41
0
15 Feb 2024
Multi-word Tokenization for Sequence Compression
Leonidas Gee
Leonardo Rigutini
Marco Ernandes
Andrea Zugarini
203
14
0
15 Feb 2024
Knowledge of Pretrained Language Models on Surface Information of Tokens
Tatsuya Hiraoka
Naoaki Okazaki
217
5
0
15 Feb 2024
UniEnc-CASSNAT: An Encoder-only Non-autoregressive ASR for Speech SSL Models
Ruchao Fan
Natarajan Balaji Shankar
Abeer Alwan
245
2
0
14 Feb 2024
Self-consistent context aware conformer transducer for speech recognition
Konstantin Kolokolov
Pavel Pekichev
Karthik Raghunathan
171
0
0
09 Feb 2024
Text-to-Code Generation with Modality-relative Pre-training
Fenia Christopoulou
Guchun Zhang
Gerasimos Lampouras
AI4TS
257
1
0
08 Feb 2024
Offline Actor-Critic Reinforcement Learning Scales to Large Models
Jost Tobias Springenberg
A. Abdolmaleki
Jingwei Zhang
Oliver Groth
Michael Bloesch
...
Sarah Bechtle
Steven Kapturowski
Agrim Gupta
N. Heess
Martin Riedmiller
OffRL
LRM
216
33
0
08 Feb 2024
Guiding LLMs The Right Way: Fast, Non-Invasive Constrained Generation
Luca Beurer-Kellner
Marc Fischer
Martin Vechev
342
77
0
07 Feb 2024
Lens: A Knowledge-Guided Foundation Model for Network Traffic
Qineng Wang
Chen Qian
Xiaochang Li
Ziyu Yao
Huajie Shao
Ziyu Yao
Bo Ji
Long Cheng
Gang Zhou
Huajie Shao
166
7
0
06 Feb 2024
Predicting positive transfer for improved low-resource speech recognition using acoustic pseudo-tokens
Nay San
Georgios Paraskevopoulos
Aryaman Arora
Xiluo He
Prabhjot Kaur
Oliver Adams
Dan Jurafsky
174
14
0
03 Feb 2024
Towards Sustainable Workplace Mental Health: A Novel Approach to Early Intervention and Support
David W. Vinson
Mihael Arcan
Paul-David Niland
Fionn Delahunty
AI4MH
137
3
0
02 Feb 2024
Sequence Shortening for Context-Aware Machine Translation
Paweł Mąka
Yusuf Can Semerci
Jan Scholtes
Gerasimos Spanakis
167
3
0
02 Feb 2024
IMUGPT 2.0: Language-Based Cross Modality Transfer for Sensor-Based Human Activity Recognition
Zi-Jian Leng
Amitrajit Bhattacharjee
Hrudhai Rajasekhar
Lizhe Zhang
Elizabeth Bruda
Hiroko H. Dodge
Thomas Plötz
VLM
260
44
0
01 Feb 2024
Getting the most out of your tokenizer for pre-training and domain adaptation
Gautier Dagan
Gabriele Synnaeve
Baptiste Rozière
353
57
0
01 Feb 2024
Disentangling the Roles of Target-Side Transfer and Regularization in Multilingual Machine Translation
Yan Meng
Christof Monz
LRM
215
2
0
01 Feb 2024
Positional Encoding Helps Recurrent Neural Networks Handle a Large Vocabulary
Takashi Morita
451
8
0
31 Jan 2024
EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor Image Comprehension in Remote Sensing Domain
Wei Zhang
Miaoxin Cai
Tong Zhang
Zhuang Yin
Xuerui Mao
433
214
0
30 Jan 2024
SpeechBERTScore: Reference-Aware Automatic Evaluation of Speech Generation Leveraging NLP Evaluation Metrics
Takaaki Saeki
Soumi Maiti
Shinnosuke Takamichi
Shinji Watanabe
Hiroshi Saruwatari
221
54
0
30 Jan 2024
TeenyTinyLlama: open-source tiny language models trained in Brazilian Portuguese
N. Corrêa
Sophia Falk
Shiza Fatimah
Aniket Sen
N. D. Oliveira
268
22
0
30 Jan 2024
Byte Pair Encoding Is All You Need For Automatic Bengali Speech Recognition
Ahnaf Mozib Samin
221
1
0
28 Jan 2024
Modular Adaptation of Multilingual Encoders to Written Swiss German Dialect
Jannis Vamvas
Noëmi Aepli
Rico Sennrich
260
1
0
25 Jan 2024
TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and Generation
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Gokcce Uludougan
Zeynep Yirmibecsouglu Balal
Furkan Akkurt
Melikcsah Turker
Onur Gungor
S. Uskudarli
214
20
0
25 Jan 2024
MambaByte: Token-free Selective State Space Model
Junxiong Wang
Tushaar Gangavarapu
Jing Nathan Yan
Alexander M. Rush
Mamba
311
54
0
24 Jan 2024
MaLA-500: Massive Language Adaptation of Large Language Models
Peiqin Lin
Shaoxiong Ji
Jörg Tiedemann
Marcely Zanon Boito
Hinrich Schütze
ELM
405
26
0
24 Jan 2024
Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech Recognisers
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Michael Hentschel
Yuta Nishikawa
Tatsuya Komatsu
Yusuke Fujita
277
5
0
22 Jan 2024
Text-to-Image Cross-Modal Generation: A Systematic Review
Maciej Żelaszczyk
Jacek Mańdziuk
320
6
0
21 Jan 2024
Instructional Fingerprinting of Large Language Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Lyne Tchapmi
Fei Wang
Mingyu Derek Ma
Pang Wei Koh
Chaowei Xiao
Muhao Chen
WaLM
278
57
0
21 Jan 2024
Orion-14B: Open-source Multilingual Large Language Models
Du Chen
Yi Huang
Xiaopu Li
Yongqiang Li
Yongqiang Liu
Haihui Pan
Leichao Xu
Dacheng Zhang
Zhipeng Zhang
Kun Han
139
4
0
20 Jan 2024
Improving fine-grained understanding in image-text pre-training
Ioana Bica
Anastasija Ilić
Matthias Bauer
Goker Erdogan
Matko Bovsnjak
...
A. Gritsenko
Matthias Minderer
Charles Blundell
Razvan Pascanu
Jovana Mitrović
VLM
220
45
0
18 Jan 2024
Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation
Minsu Kim
Jeong Hun Yeo
Se Jin Park
J. Choi
Y. Ro
290
8
0
18 Jan 2024
Deciphering Textual Authenticity: A Generalized Strategy through the Lens of Large Language Semantics for Detecting Human vs. Machine-Generated Text
USENIX Security Symposium (USENIX Security), 2024
Mazal Bethany
Brandon Wherry
Emet Bethany
Nishant Vishwamitra
Anthony Rios
Peyman Najafirad
DeLMO
223
13
0
17 Jan 2024
A Generative Adversarial Attack for Multilingual Text Classifiers
Tom Roth
Inigo Jauregi Unanue
A. Abuadbba
Massimo Piccardi
AAML
122
0
0
16 Jan 2024
Enhancing Document-level Translation of Large Language Model via Translation Mixed-instructions
Yachao Li
Junhui Li
Jing Jiang
Min Zhang
301
12
0
16 Jan 2024
Cross-Attention Watermarking of Large Language Models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Folco Bertini Baldassini
H. Nguyen
Ching-Chung Chang
Isao Echizen
WaLM
140
4
0
12 Jan 2024
Distilling Vision-Language Models on Millions of Videos
Computer Vision and Pattern Recognition (CVPR), 2024
Yue Zhao
Long Zhao
Xingyi Zhou
Jialin Wu
Chun-Te Chu
...
Hartwig Adam
Ting Liu
Boqing Gong
Philipp Krahenbuhl
Liangzhe Yuan
VLM
279
20
0
11 Jan 2024
A Simple Baseline for Spoken Language to Sign Language Translation with 3D Avatars
European Conference on Computer Vision (ECCV), 2024
Ronglai Zuo
Fangyun Wei
Zenggui Chen
Brian Mak
Jiaolong Yang
Xin Tong
SLR
328
18
0
09 Jan 2024
Deep Learning in Physical Layer: Review on Data Driven End-to-End Communication Systems and their Enabling Semantic Applications
IEEE Open Journal of the Communications Society (OJ-COMSOC), 2024
Nazmul Islam
Seokjoo Shin
AI4CE
342
16
0
08 Jan 2024
An Exploratory Study on Automatic Identification of Assumptions in the Development of Deep Learning Frameworks
Science of Computer Programming (SCP), 2024
Chen Yang
Peng Liang
Zinan Ma
223
0
0
08 Jan 2024
RoBERTurk: Adjusting RoBERTa for Turkish
Nuri Tas
113
5
0
07 Jan 2024
DiarizationLM: Speaker Diarization Post-Processing with Large Language Models
Quan Wang
Yiling Huang
Guanlong Zhao
Evan Clark
Wei Xia
Hank Liao
AuLLM
630
19
0
07 Jan 2024
PIXAR: Auto-Regressive Language Modeling in Pixel Space
Yintao Tai
Xiyang Liao
Alessandro Suglia
Antonio Vergari
MLLM
345
13
0
06 Jan 2024
Cheetah: Natural Language Generation for 517 African Languages
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Ife Adebara
AbdelRahim Elmadany
Muhammad Abdul-Mageed
348
13
0
02 Jan 2024
An Empirical Study of Scaling Law for OCR
Miao Rang
Zhenni Bi
Chuanjian Liu
Yunhe Wang
Kai Han
430
12
0
29 Dec 2023
SentinelLMs: Encrypted Input Adaptation and Fine-tuning of Language Models for Private and Secure Inference
AAAI Conference on Artificial Intelligence (AAAI), 2023
Abhijit Mishra
Mingda Li
S. Deo
SILM
98
6
0
28 Dec 2023
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices
Xiangxiang Chu
Limeng Qiao
Xinyang Lin
Shuang Xu
Yang Yang
...
Fei Wei
Xinyu Zhang
Bo Zhang
Xiaolin Wei
Chunhua Shen
MLLM
306
70
0
28 Dec 2023
Previous
1
2
3
...
11
12
13
...
40
41
42
Next