Is Japanese CCGBank empirically correct? A case study of passive and causative constructions

International Workshop on Treebanks and Linguistic Theories (TLT), 2023

28 February 2023

Abstract

The Japanese CCGBank serves as training and evaluation data for developing Japanese CCG parsers. However, since it is automatically generated from the Kyoto Corpus, a dependency treebank, its linguistic validity still needs to be sufficiently verified. In this paper, we focus on the analysis of passive/causative constructions in the Japanese CCGBank and show that, together with the compositional semantics of ccg2lambda, a semantic parsing system, it yields empirically wrong predictions for the nested construction of passives and causatives.

View on arXiv

Comments on this paper