Title | Unisa South African Spoken and Signed Language Corpus |
Description | This resource comprises annotated transcriptions of audio and video segments of the Xhosa section of the spoken corpus project SOUTHTALK (Southern African Spoken Language Corpus) under the auspices of the University of South Africa and the University of Gothenburg. |
Contact name | Gideon Kotzé |
Contact email | kotzegj1@unisa.ac.za |
Publisher(s) | University of South Africa |
Language(s) | isiXhosa |
Subject | corpus; Xhosa; transcribed audio; transcribed video |
URI | https://hdl.handle.net/20.500.12185/491 |
Media type | Text |
Type | Data |
Media category | Monolingual text |
Format extent | 524KB (audio); 800KB (video) (including all annotations and headers) |
Version | 0.0.1 |
Format size | 5246 untokenized words (audio); 34432 untokenized words (video). Annotations within the text itself were not removed. |
Stratum | It is not yet available, but each document contains information on the genders of the speakers, as well as their age and status of education. This information can be found in the header section of each file. |
Primary collection | Resource Index |
ISO639 code | xho |
Submit date | 2019-02-01T05:49:36Z |
Date available | 2019-02-01T05:49:36Z |
Date created | 2018-02-28 |