NCHLT-inlang Pronunciation Dictionaries
Title | NCHLT-inlang Pronunciation Dictionaries |
Description | Broad phonemic transcriptions for 15,000 generic words in each of 11 languages. Each dictionary has an associated rule set for generating pronunciations for unseen words. |
Contact name | Karen Calteaux |
Contact email | KCalteaux@csir.co.za |
Publisher(s) | Meraka Institute, CSIR; North-West University |
License | Creative Commons Attribution 3.0 Unported License (CC BY 3.0): http://creativecommons.org/licenses/by/3.0/legalcode |
Language(s) | Afrikaans; English; isiNdebele; isiXhosa; isiZulu; Sesotho sa Leboa (Sepedi); Setswana; Sesotho; Siswati; Tshivenda; Xitsonga |
Author(s) | Marelie Davel |
Contributor | Charl van Heerden; Willem Basson; Simon Kemisho; Thipe Modipa; Mpho Kgampe; Etienne Barnard; Martin Puttkammer; various language practitioners from C-Trans (NWU); Translation World. |
Citation | E. Barnard, M. H. Davel, C. van Heerden, F. de Wet and J. Badenhorst, "The NCHLT corpus of the South African languages", in Proc. SLTU, May 2014. |
URI | https://hdl.handle.net/20.500.12185/365 |
ISLRN | 744-144-734-416-8 |
Media type | Speech |
Type | Data |
Media category | Pronunciation dictionaries |
Format extent | 1.1 Mb |
Version | 1.2 |
Format size | 15,000 words per language |
Format medium | Text: UTF8, tab-delimited text Pronunciations: X-SAMPA Audio: 44,100 bps, 16-bit mono wav encoding |
Project | NCHLT Speech |
Software requirements | Perl |
Source | Wordlist |
Stratum | 15,000 generic words |
Primary collection | Resource Catalogue |
Secondary collection | Resource Index |
ISO639 code | afr; eng; nbl; xho; zul; sot; nso; tsn; ssw; ven; tso |
Submit date | 2018-02-05T20:18:41Z; 2018-03-05T17:48:03Z |
Date available | 2018-02-05T20:18:41Z; 2018-03-05T17:48:03Z |
Date created | 2014-07-04 |
Files in this item
This item appears in the following Collection(s)
-
Resource Catalogue [335]
A collection of language resources available for download from the RMA of SADiLaR. The collection mostly consists of resources developed with funding from the Department of Arts and Culture. -
Resource Index [386]
A collection of language resource metadata mostly collected during the NHN funded technology audit of 2009, as well as the SADiLaR technology audit of 2018. Not all resources in this collection are available for download.