Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2011.01513
Cited By
CharBERT: Character-aware Pre-trained Language Model
3 November 2020
Wentao Ma
Yiming Cui
Chenglei Si
Ting Liu
Shijin Wang
Guoping Hu
Re-assign community
ArXiv (abs)
PDF
HTML
Github (121★)
Papers citing
"CharBERT: Character-aware Pre-trained Language Model"
50 / 59 papers shown
Using External knowledge to Enhanced PLM for Semantic Matching
International Conference on Intelligent Computing (ICIC), 2025
Min Li
Chun Yuan
297
0
0
10 May 2025
KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing Applications
M. Bommarito
Daniel Martin Katz
Jillian Bommarito
210
6
0
21 Mar 2025
Comateformer: Combined Attention Transformer for Semantic Sentence Matching
European Conference on Artificial Intelligence (ECAI), 2024
Bo Li
Di Liang
Zixin Zhang
288
11
0
10 Dec 2024
TempCharBERT: Keystroke Dynamics for Continuous Access Control Based on Pre-trained Language Models
International Workshop on Information Forensics and Security (WIFS), 2024
Matheus Simão
Fabiano Prado
Omar Abdul Wahab
Anderson Avila
130
2
0
11 Nov 2024
From Babble to Words: Pre-Training Language Models on Continuous Streams of Phonemes
Zébulon Goriely
Richard Diehl Martinez
Andrew Caines
Lisa Beinborn
P. Buttery
CLL
321
8
0
30 Oct 2024
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
International Conference on Learning Representations (ICLR), 2024
Julie Kallini
Shikhar Murty
Christopher D. Manning
Christopher Potts
Róbert Csordás
447
20
0
28 Oct 2024
LLM The Genius Paradox: A Linguistic and Math Expert's Struggle with Simple Word-based Counting Problems
Nan Xu
Xuezhe Ma
LRM
425
5
0
18 Oct 2024
Advancing Post-OCR Correction: A Comparative Study of Synthetic Data
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Shuhao Guan
Derek Greene
366
11
0
05 Aug 2024
Beyond Binary Gender Labels: Revealing Gender Biases in LLMs through Gender-Neutral Name Predictions
Zhiwen You
Haejin Lee
Shubhanshu Mishra
Sullam Jeoung
Apratim Mishra
Jinseok Kim
Jana Diesner
248
19
0
07 Jul 2024
KEHRL: Learning Knowledge-Enhanced Language Representations with Hierarchical Reinforcement Learning
Dongyang Li
Taolin Zhang
Longtao Huang
Chengyu Wang
Xiaofeng He
Hui Xue
KELM
OffRL
216
0
0
24 Jun 2024
Large Language Models for Cyber Security: A Systematic Literature Review
HanXiang Xu
Shenao Wang
Ningke Li
Kaidi Wang
Yanjie Zhao
Kai Chen
Ting Yu
Yang Liu
Haoyu Wang
722
142
0
08 May 2024
Adversarial Training with OCR Modality Perturbation for Scene-Text Visual Question Answering
IEEE International Conference on Multimedia and Expo (ICME), 2024
Zhixuan Shen
Haonan Luo
Sijia Li
Tianrui Li
333
0
0
14 Mar 2024
Knowledge of Pretrained Language Models on Surface Information of Tokens
Tatsuya Hiraoka
Naoaki Okazaki
283
6
0
15 Feb 2024
MambaByte: Token-free Selective State Space Model
Junxiong Wang
Tushaar Gangavarapu
Jing Nathan Yan
Alexander M. Rush
Mamba
414
61
0
24 Jan 2024
TransURL: Improving malicious URL detection with multi-layer Transformer encoding and multi-scale pyramid features
Ruitong Liu
Yanbin Wang
Yifan Jia
Peiyue Li
Zhan Qin
Wenrui Ma
Fan Zhang
AI4TS
ViT
205
0
0
01 Dec 2023
Learning Mutually Informed Representations for Characters and Subwords
Yilin Wang
Xinyi Hu
Matthew R. Gormley
228
0
0
14 Nov 2023
Text Rendering Strategies for Pixel Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jonas F. Lotz
Elizabeth Salesky
Phillip Rust
Desmond Elliott
VLM
404
19
0
01 Nov 2023
Optimized Tokenization for Transcribed Error Correction
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Tomer Wullach
Shlomo E. Chazan
209
0
0
16 Oct 2023
Enhancing OCR Performance through Post-OCR Models: Adopting Glyph Embedding for Improved Correction
Yung-Hsin Chen
Yuli Zhou
202
4
0
29 Aug 2023
SCAT: Robust Self-supervised Contrastive Learning via Adversarial Training for Text Classification
J. Wu
Dit-Yan Yeung
SILM
349
1
0
04 Jul 2023
People and Places of Historical Europe: Bootstrapping Annotation Pipeline and a New Corpus of Named Entities in Late Medieval Texts
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Vít Novotný
Kristýna Luger
Michal Štefánik
Tereza Vrabcová
Ales Horak
216
1
0
26 May 2023
From Characters to Words: Hierarchical Pre-trained Language Model for Open-vocabulary Language Understanding
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Li Sun
F. Luisier
Kayhan Batmanghelich
D. Florêncio
Changrong Zhang
VLM
222
7
0
23 May 2023
IPA-CLIP: Integrating Phonetic Priors into Vision and Language Pretraining
Chihaya Matsuhira
Marc A. Kastner
Takahiro Komamizu
Takatsugu Hirayama
Keisuke Doman
Yasutomo Kawanishi
Ichiro Ide
220
7
0
06 Mar 2023
Elementwise Language Representation
Du-Yeong Kim
Jeeeun Kim
232
0
0
27 Feb 2023
READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Chenglei Si
Zhengyan Zhang
Yingfa Chen
Xiaozhi Wang
Zhiyuan Liu
Maosong Sun
AAML
298
1
0
14 Feb 2023
MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End Language Modeling
Nathan Godey
Roman Castagné
Eric Villemonte de la Clergerie
Benoît Sagot
170
3
0
14 Dec 2022
Word-Level Representation From Bytes For Language Modeling
Chul Lee
Qipeng Guo
Xipeng Qiu
227
1
0
23 Nov 2022
CGoDial: A Large-Scale Benchmark for Chinese Goal-oriented Dialog Evaluation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yinpei Dai
Wanwei He
Bowen Li
Yuchuan Wu
Zhen Cao
Zhongqi An
Jian Sun
Yongbin Li
ELM
ALM
329
13
0
21 Nov 2022
Continuous Prompt Tuning Based Textual Entailment Model for E-commerce Entity Typing
Yibo Wang
Congying Xia
Guan Wang
Philip Yu
192
6
0
04 Nov 2022
Analogy Generation by Prompting Large Language Models: A Case Study of InstructGPT
B. Bhavya
Jinjun Xiong
Chengxiang Zhai
LRM
188
50
0
09 Oct 2022
MockingBERT: A Method for Retroactively Adding Resilience to NLP Models
International Conference on Computational Linguistics (COLING), 2022
Jan Jezabek
A. Singh
SILM
KELM
130
0
0
21 Aug 2022
Language Modelling with Pixels
International Conference on Learning Representations (ICLR), 2022
Phillip Rust
Jonas F. Lotz
Emanuele Bugliarello
Elizabeth Salesky
Miryam de Lhoneux
Desmond Elliott
VLM
391
60
0
14 Jul 2022
Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation
Shengyao Zhuang
Houxing Ren
Linjun Shou
Jian Pei
Ming Gong
Guido Zuccon
Daxin Jiang
377
82
0
21 Jun 2022
Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the Research Manifold
Findings (Findings), 2022
Sebastian Ruder
Ivan Vulić
Anders Søgaard
198
38
0
20 Jun 2022
Local Byte Fusion for Neural Machine Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Makesh Narsimhan Sreedhar
Xiangpeng Wan
Yu-Jie Cheng
Junjie Hu
564
7
0
23 May 2022
Revisiting Pre-trained Language Models and their Evaluation for Arabic Natural Language Understanding
Abbas Ghaddar
Yimeng Wu
Sunyam Bagga
Ahmad Rashid
Khalil Bibi
...
Zhefeng Wang
Baoxing Huai
Xin Jiang
Qun Liu
Philippe Langlais
202
9
0
21 May 2022
Down and Across: Introducing Crossword-Solving as a New NLP Benchmark
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Saurabh Kulshreshtha
Olga Kovaleva
Namrata Shivagunde
Anna Rumshisky
ELM
LRM
390
5
0
20 May 2022
Signal in Noise: Exploring Meaning Encoded in Random Character Sequences with Character-Aware Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Mark Chu
Bhargav Srinivasa Desikan
E. Nadler
Ruggerio L. Sardo
Elise Darragh-Ford
Douglas Guilbeault
226
2
0
15 Mar 2022
Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Lihu Chen
Gaël Varoquaux
Fabian M. Suchanek
272
19
0
15 Mar 2022
Overlap-based Vocabulary Generation Improves Cross-lingual Transfer Among Related Languages
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Vaidehi Patil
Partha P. Talukdar
Sunita Sarawagi
418
34
0
03 Mar 2022
Artificial Intelligence for the Metaverse: A Survey
Engineering applications of artificial intelligence (EAAI), 2022
Thien Huynh-The
Quoc-Viet Pham
Xuan-Qui Pham
Thanh Thi Nguyen
Zhu Han
Dong-Seong Kim
453
505
0
15 Feb 2022
An Assessment of the Impact of OCR Noise on Language Models
International Conference on Agents and Artificial Intelligence (ICAART), 2022
Konstantin Todorov
Giovanni Colavizza
365
10
0
26 Jan 2022
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
Sabrina J. Mielke
Zaid Alyafeai
Elizabeth Salesky
Colin Raffel
Manan Dey
...
Arun Raja
Chenglei Si
Wilson Y. Lee
Benoît Sagot
Samson Tan
327
202
0
20 Dec 2021
Using Distributional Principles for the Semantic Study of Contextual Language Models
Olivier Ferret
149
1
0
23 Nov 2021
Character-level HyperNetworks for Hate Speech Detection
Expert systems with applications (ESWA), 2021
Tomer Wullach
A. Adler
Einat Minkov
193
19
0
11 Nov 2021
Can Character-based Language Models Improve Downstream Task Performance in Low-Resource and Noisy Language Scenarios?
Arij Riabi
Benoît Sagot
Djamé Seddah
284
16
0
26 Oct 2021
BERT Cannot Align Characters
Antonis Maronikolakis
Philipp Dufter
Hinrich Schütze
147
1
0
20 Sep 2021
Integrating Approaches to Word Representation
Yuval Pinter
NAI
251
5
0
10 Sep 2021
AMMUS : A Survey of Transformer-based Pretrained Models in Natural Language Processing
Katikapalli Subramanyam Kalyan
A. Rajasekharan
S. Sangeetha
VLM
LM&MA
328
319
0
12 Aug 2021
LadRa-Net: Locally-Aware Dynamic Re-read Attention Net for Sentence Semantic Matching
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
Kun Zhang
Guangyi Lv
Le Wu
Enhong Chen
Qi Liu
Meng Wang
247
8
0
06 Aug 2021
1
2
Next
Page 1 of 2