Kyoto University Text Corpus Version 4.0

This is a text corpus that is manually annotated with various linguistic information. It consists of approximately 40,000 sentences from Mainichi newspaper in 1995 with morphological and syntactic annotations. Out of these sentences, 5,000 sentences are annotated with predicate-argument structures including zero anaphora and coreferences.

Download

Reference


Front page   New List of pages Search Recent changes   Help   RSS of recent changes