Title | UNISA English/Zulu Parallel Corpus |
Description | The resource comprises sentence aligned and tokenized parallel text in English and Zulu. The text was extracted from the following sources: an adapted version of the English/Zulu Autshumato corpus, paragraph translated Wikipedia texts, the Bible, the Book of Mormon, the Constitution of South Africa, the Universal Declaration of Human Rights and a selection of translated sentences from the book "Beyond the He/Man" (1996). |
Contact name | Gideon Kotzé |
Contact email | kotzegj1@unisa.ac.za |
Publisher(s) | University of South Africa |
License | All rights reserved |
Language(s) | English; isiZulu |
Subject | parallel corpus; English; Zulu |
URI | https://hdl.handle.net/20.500.12185/489 |
Media type | Text |
Type | Data |
Media category | Multilingual text |
Format extent | 15MB |
Version | 0.0.1 |
Format size | Token count: English = 1,490,368; Zulu = 1009820 |
Format medium | UTF8 |
Stratum | yet to be determined |
Primary collection | Resource Index |
ISO639 code | eng; zul |
Submit date | 2019-02-01T05:49:36Z |
Date available | 2019-02-01T05:49:36Z |
Date created | 2018-02-28 |