BAS
Bavarian Archive for Speech Signals
Corpora e Lessici dell'Italiano Parlato e Scritto, map task recordings - CLIPS_MT_MANUAL
Last update of this page: 2014-02-21  --  gleiche Seite in deutsch
Description
 CLIPS_MT_MANUAL is a sub-corpus of the original Italian CLIPS corpus (Corpora e
 Lessici dell'Italiano Parlato e Scritto) that covers only 15 maptask
 dialogues recorded in 15 locations by local speaker pairs.
 The reason why BAS decided to bring forward another edition of these data
 is that we found a large number of errors (formal and content) in the
 annotation and signal files of the original corpus that prevented our
 colleagues to perform proper phonetic investigations. To make published
 results on these (corrected) data replicable for the scientific community
 BAS decided - with the kind permission of the CLIPS copyright holders -
 to ingest this part of CLIPS in the BAS CLARIN repository under the name 'CLIPS_MT_MANUAL'
 ('MT' = 'map task', 'MANUAL' indicates manual annotation), and make it so
 available for all European academic researchers.
 In a nutshell this corpus contains 3228 inspected and partially repaired WAV signal files, each containing
 one dialogue turn (*.wav),
 3228 corrected original CLIPS annotation files (*.acs, *.phn, *.std, *.wrd),
 3228 BAS Partitur files containing the annotation tiers ORT, KAN and SAP (*.par),
 3228 EMU database annotation files (*.vot, *.hlb)
 covering 30 maptask dialogues performed by 30 speakers (each speaker pair
 performing two different map tasks) recorded in 15 different locations in Italy
 in 2000-2004.
Numbers (Version 1.1):
- Number of speakers: 30
- Number of recordings: 3228
- Number of phonetic segments: 104056
- File Formats: 
  
  - RIFF WAVE WAV 22,05kHz, 16 bit  
  
- Meta data: Speaker/Session Protocol (SpeechDat)
  
- CLIPS annotation (Transcriber)
  
- BPF (tiers ORT,KAN,SAP)
  
- Praat TextGrid
  
- Emu database: *.hlb, *.phonetic
  
 
- Segmentations: manual phonetic segmentation;
partial segmentation of closure + burst in plosives
- Distribution: DVD-R
Publications
 
Schiel F, Stevens M, Reichel U D, Cutugno F (2013): Machine Learning of Probabilistic Phonological Pronunciation Rules from the Italian CLIPS Corpus. In: Proc. of the Interspeech 2013, Lyon, France, 1414-1418.
Signal Examples
Recording DGmtB04R_p2G#142
passandogli<ii> <sp>% trovandot0i% %il gelato sulla destra praticamente gli devi<ii> <sp< gli devi andare intorno passandogli dalla sinistra
Recording DGmtB04R_p1F#143
<vocal> gli devo% %andare% %intorno
Availability and Costs
Without restrictions usable (except distribution to third parties).
CLIPS_MT_MANUAL
1 DVD-R Iso 9660 + Shipping
 
Scientific EUR 255,65 (ELRA Members EUR 127,82) + VAT
 
Commercial EUR 511,30 (ELRA Members EUR 255,64) + VAT
Questions and Orders: 
 
Florian Schiel