|
Sample data sets
Format: binary (due to data size), big endian; consists of concatened entries with 18 bytes length of the following format:
- model_id: uint8 ( 120 different model ids in total, ids can be between 1 and 150)
- spk_id: uint8 ( can be any of the spks with id 1 to 150)
- cdf of FA/FR: float32 (in percent, depending on type_of_test (see next point) gives the error (FR for genuine test, FA for impostor test))
- type_of_test: c_string ( 1 char, "I" (impostor test) or "G" (genuine test), terminated with binary 0)
- score: float32 (matching score of model to utterance, higher score -> better match)
- session_id: c_string (2 chars, the session number of the recording, terminated with binary 0)
- recording_id: c_string (2 chars, item number in recording session; same item -> same utterance, terminated with binary 0)
The datasets (download using right mouse button menu):
- GMM, 4 mixture, 30 fixed clients, 60 fixed impostors, all impostors used for each client (GMM_4_60_imps_all_imps.dat.gz ; 2.5M)
- GMM, 4 mixture, 30 fixed clients, 60 fixed impostors, same gender impostors, (GMM_4_60_imps_same_gender_imps.dat.gz ; 1.2M)
- GMM, 4 mixtures, 120 fixed clients, cross validation with remaining clients as impostors, (GMM_4_120_spks_cross_valid_imps_all_imps.dat.gz ; 20M)
- GMM, 4 mixtures, 120 fixed clients, cross validation with same gender impostors from remaining clients, (GMM_4_120_spks_cross_valid_imps_same_gender_imps.dat.gz ; 10M)
|