next up previous contents
Next: Vocabulary Up: Corpus Specification Previous: Number of Speakers   Contents


Contents

The spoken content of a speech corpus is the second major feature that determines the possible usage of the resource. Of course, this feature is not totally orthogonal to other specifications, for instance the speaking style. Basically, there are four main approaches defining the spoken content of a corpus: by vocabulary, by domain, by task or by phonological distribution. These might be applied in a mixed manner in some cases.



Subsections

BITS Projekt-Account 2004-06-01