Pre-Trained Models: Past, Present and FutureAI Open (AO), 2021 Xu Han Zhengyan Zhang Ning Ding Yuxian Gu Xiao Liu ...Jie Tang Ji-Rong Wen Jinhui Yuan Wayne Xin Zhao Jun Zhu |
A Primer in BERTology: What we know about how BERT worksTransactions of the Association for Computational Linguistics (TACL), 2020 |