Knesset-DictaBERT: A Hebrew Language Model for Parliamentary Proceedings
Main:2 Pages
Bibliography:1 Pages
1 Tables
Appendix:1 Pages
Abstract
We present Knesset-DictaBERT, a large Hebrew language model fine-tuned on the Knesset Corpus, which comprises Israeli parliamentary proceedings. The model is based on the DictaBERT architecture and demonstrates significant improvements in understanding parliamentary language according to the MLM task. We provide a detailed evaluation of the model's performance, showing improvements in perplexity and accuracy over the baseline DictaBERT model.
View on arXivComments on this paper
