CharBERT: Character-aware Pre-trained Language Model

3 November 2020

ArXiv (abs)PDF HTML Github (121★)

Papers citing "CharBERT: Character-aware Pre-trained Language Model"

50 / 59 papers shown

Using External knowledge to Enhanced PLM for Semantic MatchingInternational Conference on Intelligent Computing (ICIC), 2025

Min Li

Chun Yuan

297

10 May 2025

KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing Applications

M. Bommarito

Daniel Martin Katz

Jillian Bommarito

210

21 Mar 2025

Comateformer: Combined Attention Transformer for Semantic Sentence MatchingEuropean Conference on Artificial Intelligence (ECAI), 2024

Bo Li

Di Liang

Zixin Zhang

288

10 Dec 2024

TempCharBERT: Keystroke Dynamics for Continuous Access Control Based on Pre-trained Language ModelsInternational Workshop on Information Forensics and Security (WIFS), 2024

130

11 Nov 2024

From Babble to Words: Pre-Training Language Models on Continuous Streams of Phonemes

Zébulon Goriely

Richard Diehl Martinez

321

30 Oct 2024

MrT5: Dynamic Token Merging for Efficient Byte-level Language ModelsInternational Conference on Learning Representations (ICLR), 2024

Julie Kallini

Shikhar Murty

Christopher D. Manning

Christopher Potts

Róbert Csordás

447

28 Oct 2024

LLM The Genius Paradox: A Linguistic and Math Expert's Struggle with Simple Word-based Counting Problems

Nan Xu

Xuezhe Ma

LRM

425

18 Oct 2024

Advancing Post-OCR Correction: A Comparative Study of Synthetic DataAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Shuhao Guan

Derek Greene

366

05 Aug 2024

Beyond Binary Gender Labels: Revealing Gender Biases in LLMs through Gender-Neutral Name Predictions

248

07 Jul 2024

KEHRL: Learning Knowledge-Enhanced Language Representations with Hierarchical Reinforcement Learning

Hui Xue

216

24 Jun 2024

Large Language Models for Cyber Security: A Systematic Literature Review

722

142

08 May 2024

Adversarial Training with OCR Modality Perturbation for Scene-Text Visual Question AnsweringIEEE International Conference on Multimedia and Expo (ICME), 2024

333

14 Mar 2024

Knowledge of Pretrained Language Models on Surface Information of Tokens

Tatsuya Hiraoka

Naoaki Okazaki

283

15 Feb 2024

MambaByte: Token-free Selective State Space Model

414

24 Jan 2024

TransURL: Improving malicious URL detection with multi-layer Transformer encoding and multi-scale pyramid features

Yanbin Wang

205

01 Dec 2023

Learning Mutually Informed Representations for Characters and Subwords

Yilin Wang

Xinyi Hu

Matthew R. Gormley

228

14 Nov 2023

Text Rendering Strategies for Pixel Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

404

01 Nov 2023

Optimized Tokenization for Transcribed Error CorrectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Tomer Wullach

Shlomo E. Chazan

209

16 Oct 2023

Enhancing OCR Performance through Post-OCR Models: Adopting Glyph Embedding for Improved Correction

Yung-Hsin Chen

Yuli Zhou

202

29 Aug 2023

SCAT: Robust Self-supervised Contrastive Learning via Adversarial Training for Text Classification

J. Wu

Dit-Yan Yeung

SILM

349

04 Jul 2023

People and Places of Historical Europe: Bootstrapping Annotation Pipeline and a New Corpus of Named Entities in Late Medieval TextsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

216

26 May 2023

From Characters to Words: Hierarchical Pre-trained Language Model for Open-vocabulary Language UnderstandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Kayhan Batmanghelich

222

23 May 2023

IPA-CLIP: Integrating Phonetic Priors into Vision and Language Pretraining

220

06 Mar 2023

Elementwise Language Representation

Du-Yeong Kim

Jeeeun Kim

232

27 Feb 2023

READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input NoisesAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Chenglei Si

Zhengyan Zhang

Yingfa Chen

Xiaozhi Wang

Zhiyuan Liu

Maosong Sun

AAML

298

14 Feb 2023

MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End Language Modeling

Nathan Godey

Roman Castagné

Eric Villemonte de la Clergerie

Benoît Sagot

170

14 Dec 2022

Word-Level Representation From Bytes For Language Modeling

Chul Lee

Qipeng Guo

Xipeng Qiu

227

23 Nov 2022

CGoDial: A Large-Scale Benchmark for Chinese Goal-oriented Dialog EvaluationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

329

21 Nov 2022

Continuous Prompt Tuning Based Textual Entailment Model for E-commerce Entity Typing

Yibo Wang

Congying Xia

Guan Wang

Philip Yu

192

04 Nov 2022

Analogy Generation by Prompting Large Language Models: A Case Study of InstructGPT

B. Bhavya

Jinjun Xiong

Chengxiang Zhai

LRM

188

09 Oct 2022

MockingBERT: A Method for Retroactively Adding Resilience to NLP ModelsInternational Conference on Computational Linguistics (COLING), 2022

Jan Jezabek

A. Singh

SILM KELM

130

21 Aug 2022

Language Modelling with PixelsInternational Conference on Learning Representations (ICLR), 2022

391

14 Jul 2022

Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation

Guido Zuccon

377

21 Jun 2022

Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the Research ManifoldFindings (Findings), 2022

Sebastian Ruder

Ivan Vulić

Anders Søgaard

198

20 Jun 2022

Local Byte Fusion for Neural Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Makesh Narsimhan Sreedhar

Xiangpeng Wan

Yu-Jie Cheng

Junjie Hu

564

23 May 2022

Revisiting Pre-trained Language Models and their Evaluation for Arabic Natural Language Understanding

...

Xin Jiang

Qun Liu

Philippe Langlais

202

21 May 2022

Down and Across: Introducing Crossword-Solving as a New NLP BenchmarkAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

390

20 May 2022

Signal in Noise: Exploring Meaning Encoded in Random Character Sequences with Character-Aware Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Mark Chu

Bhargav Srinivasa Desikan

226

15 Mar 2022

Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little CostAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Lihu Chen

Gaël Varoquaux

Fabian M. Suchanek

272

15 Mar 2022

Overlap-based Vocabulary Generation Improves Cross-lingual Transfer Among Related LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Vaidehi Patil

Partha P. Talukdar

Sunita Sarawagi

418

03 Mar 2022

Artificial Intelligence for the Metaverse: A SurveyEngineering applications of artificial intelligence (EAAI), 2022

453

505

15 Feb 2022

An Assessment of the Impact of OCR Noise on Language ModelsInternational Conference on Agents and Artificial Intelligence (ICAART), 2022

Konstantin Todorov

Giovanni Colavizza

365

26 Jan 2022

Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP

...

327

202

20 Dec 2021

Using Distributional Principles for the Semantic Study of Contextual Language Models

Olivier Ferret

149

23 Nov 2021

Character-level HyperNetworks for Hate Speech DetectionExpert systems with applications (ESWA), 2021

Tomer Wullach

A. Adler

Einat Minkov

193

11 Nov 2021

Can Character-based Language Models Improve Downstream Task Performance in Low-Resource and Noisy Language Scenarios?

Arij Riabi

Benoît Sagot

Djamé Seddah

284

26 Oct 2021

BERT Cannot Align Characters

Antonis Maronikolakis

Philipp Dufter

Hinrich Schütze

147

20 Sep 2021

Integrating Approaches to Word Representation

Yuval Pinter

NAI

251

10 Sep 2021

AMMUS : A Survey of Transformer-based Pretrained Models in Natural Language Processing

Katikapalli Subramanyam Kalyan

A. Rajasekharan

S. Sangeetha

VLM LM&MA

328

319

12 Aug 2021

LadRa-Net: Locally-Aware Dynamic Re-read Attention Net for Sentence Semantic MatchingIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021

Kun Zhang

Guangyi Lv

Le Wu

Enhong Chen

Qi Liu

Meng Wang

247

06 Aug 2021