next up previous contents
Next: Meta Data File Formats Up: File Formats Previous: Signal File Formats   Contents


Annotation File Formats

Annotations are symbolic data associated with the recorded signals of the speech corpus. See chapter [*] for a discussion of the different principles, label categories, methods and tools.

The format in which these data are stored has to be included into the corpus specification. Since no widely accepted standard exists and since very often speech corpora contain a new form of annotation that has never been applied before, many speech corpora contain proprietary formats that were defined only for that special occasion. Some of these formats have become commonly accepted and have been re-used in other collections.

In this section we list some more or less standardized file formats for annotation data and outline their respective properties. These properties can be described by the following criteria:

Where appropriate, the annotation formats will be described in these terms in the following sections.


next up previous contents
Next: Meta Data File Formats Up: File Formats Previous: Signal File Formats   Contents
BITS Projekt-Account 2004-06-01