Is Japanese CCGBank empirically correct? A case study of passive and
causative constructions
International Workshop on Treebanks and Linguistic Theories (TLT), 2023
Abstract
The Japanese CCGBank serves as training and evaluation data for developing Japanese CCG parsers. However, since it is automatically generated from the Kyoto Corpus, a dependency treebank, its linguistic validity still needs to be sufficiently verified. In this paper, we focus on the analysis of passive/causative constructions in the Japanese CCGBank and show that, together with the compositional semantics of ccg2lambda, a semantic parsing system, it yields empirically wrong predictions for the nested construction of passives and causatives.
View on arXivComments on this paper
