BAS
Bavarian Archive for Speech Signals
SmartWeb Handheld Corpus - SHC
Last update 2014-03-04 - gleiche Seite in deutsch
Description
This corpus contains recordings spoken by 156 speakers in a human - machine query situation.
Users were asked to solve several tasks with a spoken query system to the WWW using a
smart phone as potable device in natural environments (office, hall, restaurant, street).
Recorded channels are the Bluetooth headset over UMTS (telephone quality), the Bluetooth
headset and an additional collar microphone in high quality.
- Total number of recorded queries: 10966
- Total duration segmented speech: 1835min
- Vocabulary size (number of unique transcribed word tokens): 5624
- Formats: WAV 44,1kHz, 16 bit, ALAW 8kHz 8bit, Verbmobil transliteration, BAS Partitur Format (BPF)
- Segmentation: automatic segmentation into queries by the recording server
- Distribution: 15 DVD-R
Publication: Mögele, H., Kaiser, M., Schiel, F. (2006): SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus, Proc. of the LREC 2006, Genova, Italy.
Audio examples
Recording u013/nsw-0130rec-030 Bluetooth Headset UMTS
ich bin gerade in ~Kaiserslautern am am Stadion und m"ochte wissen <"ah> , wo die n"achste Premiere-Sports-Bar ist .
Recording u013/nsw-0130rec-030 Bluetooth Headset High Quality (no UMTS transmission)
ich bin gerade in ~Kaiserslautern am
am Stadion und m"ochte wissen <"ah> , wo die n"achste Premiere-Sports-Bar ist .
Recording u013/nsw-0130rec-030 Collar Microphone High Quality (no UMTS transmission)
ich bin gerade in ~Kaiserslautern am am Stadion und m"ochte wissen <"ah> , wo die n"achste Premiere-Sports-Bar ist .
Availability and fees
Without restrictions (except the re-distribution to third parties).
SmartWeb Handheld Corpus - SHC
15 DVD-R Iso 9660 + shipping + handling
Scientific EUR 3.825,00 (ELRA Members EUR 1.912,50) + VAT
Commercial EUR 4.825,00 (ELRA Members EUR 2.912,50) + VAT
Questions and orders:
Florian Schiel