NCHLT Speech II Corpus
License agreement
By downloading this resource I accept and agree to the terms of use and the associated license conditions under which the resource is distributed.
Download
MD5: 78b9cd0bd557fc0d101b00d0e1053c86
License agreement
By downloading this resource I accept and agree to the terms of use and the associated license conditions under which the resource is distributed.
Collections
- Resource Catalogue [335]
- Resource Index [386]
Author(s)
Jaco Badenhorst
Febe de Wet
Neil Kleynhans
Thipe Modipa
Metadata
Show full item recordDescription
The speech corpus generated from aligned audio samples from National Parliament using Hansard transcriptions are provided in terms of audio and transcriptions. The XML files provide the following metadata for each session: - audio filename - audio orthography - GOP (goodness of pronunciation) score - start time (seconds) - end time (seconds) The audio files are formatted as 16-bit Signed Integer PCM, single channel, and 16kHz sample rate.
Contact person
Karen CalteauxContact person's e-mail address
KCalteaux@csir.co.zaPublisher(s)
Meraka Institute, CSIR