A Survey of Knowledge Enhanced Pre-trained Models

1 October 2021

Jian Yang

Xinyu Hu

Gang Xiao

Yulong Shen

Xinyu Hu

Ying Zhang

Jinghui Peng

KELM

ArXiv (abs)PDF HTML Github

Main:26 Pages

15 Figures

Bibliography:6 Pages

4 Tables

Abstract

Pre-trained models learn informative representations on large-scale training data through a self-supervised or supervised learning method, which has achieved promising performance in natural language processing (NLP), computer vision (CV), and cross-modal fields after fine-tuning. These models, however, suffer from poor robustness and lack of interpretability. Pre-trained models with knowledge injection, which we call knowledge enhanced pre-trained models (KEPTMs), possess deep understanding and logical reasoning and introduce interpretability. In this survey, we provide a comprehensive overview of KEPTMs in NLP and CV. We first introduce the progress of pre-trained models and knowledge representation learning. Then we systematically categorize existing KEPTMs from three different perspectives. Finally, we outline some potential directions of KEPTMs for future research.

View on arXiv

Comments on this paper