Resource Catalogue: Recent submissions
Now showing items 111-120 of 335
-
NCHLT Tshivenda Auxiliary Speech Corpus
(CSIR Meraka Institute; North-West University, 2019-06-01) ~Resource Catalogue The corpus contains orthographically transcribed broadband speech in each of South Africa's eleven official languages. Transcriptions are provided in ... -
NCHLT Siswati Auxiliary Speech Corpus
(CSIR Meraka Institute; North-West University, 2019-06-01) ~Resource Catalogue The corpus contains orthographically transcribed broadband speech in each of South Africa's eleven official languages. Transcriptions are provided in ... -
NCHLT isiXhosa Auxiliary Speech Corpus
(CSIR Meraka Institute; North-West University, 2019-06-01) ~Resource Catalogue The corpus contains orthographically transcribed broadband speech in each of South Africa's eleven official languages. Transcriptions are provided in ... -
NCHLT isiNdebele Auxiliary Speech Corpus
(CSIR Meraka Institute; North-West University, 2019-06-01) ~Resource Catalogue The corpus contains orthographically transcribed broadband speech in each of South Africa's eleven official languages. Transcriptions are provided in ... -
Afrikaans linking element dataset
(North-West University, 2019) ~Resource Catalogue (Afrikaans follows English) This data set was compiled for a study in which the possible semantic content of Afrikaans linking elements was investigated. ... -
Sesotho vowel speech data set
(Centre for Text Technology, North-West University, 2019-05-28) ~Resource Catalogue The primary aim of this speech dataset was to collect a representative set of words in which all the Sesotho vowels are present. Some of them are ... -
Sesotho function word speech data
(Centre for Text Technology, North-West University, 2019-05-28) ~Resource Catalogue The primary aim of this speech data set was to study the role of tone in the function word "ke" in the minimal pairs "ke motho" and in the function word ... -
Read Afrikaans Normal/ Read Afrikaans Fast
(Centre for Text Technology, North-West University, 2019-05-28) ~Resource Catalogue The corpus contains speech of 127 mother tongue speakers of Afrikaans. Every speaker was asked to read a text fragment from a book or a newspaper (about ... -
Sesotho tone data set
(Centre for Text Technology, North-West University, 2019-05-28) ~Resource Catalogue These recordings are of male and female speakers (11 for tasks 1 and 2; 10 for task 3) of the QwaQwa region (Eastern Free State). Ages of the speakers ... -
Afrikaans text unit identification data
(Centre for Text Technology, North-West University, 2006) ~Resource Catalogue This dataset was developed during a masters degree and used in the development of a text unit identifier capable of tagging sentences, named-entities, ...