Show simple item record

Lagos-NWU Yoruba Speech Corpus
This speech corpus consisting of 16 female speakers and 17 male speakers was recorded in Lagos, Nigeria for the purpose of speech recognition research. Each speaker recorded about 130 utterances read from short texts selected for phonetic coverage. Recordings were done using a microphone connected to a laptop computer in a quiet office environment.
Daniel van Niekerk
daniel.vanniekerk@nwu.ac.za
North-West University; Centre for Text Technology (CTexT); University of Lagos (Nigeria)
Creative Commons Attribution 2.5 South Africa License: http://creativecommons.org/licenses/by/2.5/za/legalcode
Yoruba
Daniel van Niekerk; Etienne Barnard; Oluwapelumi Giwa; Azeez Sosimi
https://hdl.handle.net/20.500.12185/431
573-526-122-515-8
Speech
Data
Monolingual speech corpora: Annotated
268 Mb (zipped)
1
Number of speakers: 33, Number of utterances: 4316, Audio length: 165 mins. (including non-speech segments) Per speaker: approx. 130 utterances amounting to approx. 5 minutes of audio
UTF8; UTF-8 encoded Unicode text; RIFF-WAVE 16-bit PCM samples at 16kHz sampling rate
Web; Magazines; Literature and student reports; Audio recordings (normal office environment)
16 female speakers and 17 male speakers recorded in Lagos, Nigeria
Resource Catalogue
Resource Index
yor
2018-02-05T20:20:56Z; 2018-03-05T17:51:10Z
2018-02-05T20:20:56Z; 2018-03-05T17:51:10Z
2015-02-06


Files in this item

Thumbnail

This item appears in the following Collection(s)

  • Resource Catalogue [335]
    A collection of language resources available for download from the RMA of SADiLaR. The collection mostly consists of resources developed with funding from the Department of Arts and Culture.
  • Resource Index [386]
    A collection of language resource metadata mostly collected during the NHN funded technology audit of 2009, as well as the SADiLaR technology audit of 2018. Not all resources in this collection are available for download.

Show simple item record