Kyoto University Text Corpus Version 4.0

This is a text corpus that is manually annotated with various linguistic information. It consists of approximately 40,000 sentences from Mainichi newspaper in 1995 with morphological and syntactic annotations. Out of these sentences, 5,000 sentences are annotated with predicate-argument structures including zero anaphora and coreferences.