Resource Index: Recent submissions
Now showing items 11-20 of 386
-
SPCS Speech Corpus
(Council for Scientific and Industrial Research; North-West University, 2015-11-25) ~Resource Catalogue Broadband speech corpus of approximately 10 hours and the corresponding transcriptions. The development process of the corpus involved the recording ... -
Speech transcription platform user interface
(Multilingual Speech Technologies, North-West University, 2017) ~Resource Index This is the user interface component of the Speech Transcription Platform developed by the Multilingual Speech Technologies group at North-West University ... -
Speech transcription platform speech services
(Multilingual Speech Technologies, North-West University, 2017) ~Resource Index This is the Language Technology Services component implemented for the Speech Transcription Platform project by the Multilingual Speech Technologies ... -
High quality TTS data for four South African languages (af, st, tn, xh)
(Google; North-West University, 2017) ~Resource Catalogue This data set contains multi-speaker TTS high quality transcribed audio data for four languages of South Africa: Afrikaans, Sesotho, Setswana and isiXhosa. ... -
Speech transcription server
(Multilingual Speech Technologies, North-West University, 2017) ~Resource Index This is the "Parliament-specific" application server component implemented as a proof-of-concept during the Speech Transcription Platform project by the ... -
Bilingual English-isiXhosa corpus
(North-West University - Centre for Text Technology (CTexT), 2019-11-30) ~Resource Catalogue Aligned parallel corpora for the following language pair: English-isiXhosa. The data is given as two separate UTF-8 text files, with each segment on a ... -
Monolingual isiXhosa corpus
(North-West University - Centre for Text Technology (CTexT), 2019-11-30) ~Resource Catalogue Monolingual corpus for isiXhosa. The data is given as a single UTF-8 text file, with each segment on a newline. The dataset contains existing data ... -
NCHLT English Auxiliary Speech Corpus
(CSIR Meraka Institute; North-West University, 2019-06-01) ~Resource Catalogue The corpus contains orthographically transcribed broadband speech in each of South Africa's eleven official languages. Transcriptions are provided in ... -
NCHLT Afrikaans Auxiliary Speech Corpus
(CSIR Meraka Institute; North-West University, 2019-06-01) ~Resource Catalogue The corpus contains orthographically transcribed broadband speech in each of South Africa's eleven official languages. Transcriptions are provided in ... -
NCHLT Xitsonga Auxiliary Speech Corpus
(CSIR Meraka Institute; North-West University, 2019-06-01) ~Resource Catalogue The corpus contains orthographically transcribed broadband speech in each of South Africa's eleven official languages. Transcriptions are provided in ...