Title | South African Broadcast News (SABN) Corpus |
Description | The corpus consists of approximately 20 hours of audio recordings from one of the country's main radio news channels, SAFM. Bulletins were broadcast between 1996 and 2006 and are a mix of news-reader speech, interviews, and crossings to reporters |
Contact name | Febe de Wet |
Contact email | fdw@sun.ac.za |
Publisher(s) | Stellenbosch University; CSIR |
Language(s) | English |
Subject | broadcast news transcription; South African English; accents of English; under-resourced languages |
URI | https://hdl.handle.net/20.500.12185/484 |
Media type | Speech |
Type | Data |
Media category | Speech corpora |
Version | 0 |
Format size | 20 hours |
Stratum | The data comprises a collection of audio files. Each audio file corresponds to a news bulletin. Transcriptions of the audio are included in the data set in TextGrid format. All the 27 speakers are adults (8 male, 19 female). |
Database | Monolingual : Annotated : Unaligned |
Primary collection | Resource Index |
ISO639 code | eng |
Submit date | 2019-02-01T05:49:35Z |
Date available | 2019-02-01T05:49:35Z |
Date created | 2018-02-27 |