Resource Catalogue: Recent submissions
Now showing items 91-100 of 335
-
CTexT Afrikaans FLAIR Part of Speech tagger model
(Centre for Text Technology (CTexT), 2022-01-10)The CTexT Afrikaans FLAIR Part of Speech tagger model is a neural part of speech tagger model based on the FLAIR framework (Akbik et al. 2019), and ... -
Core technologies for conjunctively written South African languages
(North-West University, Centre for Language Technology (CTexT), 2021-03-31)During this SADiLaR funded project, enriched corpora for the four official South African languages with a conjunctive orthography, i.e. isiNdebele ... -
Corpus of multilingual code-switched soap opera speech
(Stellenbosch University, 2020-02-28)The corpus comprises 26.9 hours of annotated multilingual speech that contains examples of code-switching in isiZulu, isiXhosa, Setswana, Sesotho and ... -
COVID-19 Multilingual Terminology
(City of Tshwane; South African Centre for Digital Language Resources (SADiLaR); Department of Science and Innovation; Pan South African Language Board (PanSALB), 2021-07)COVID-19 multilingual terminology list document in all the South African languages. The development of this terminology list was initiated by City of ... -
CGE's Afrikaans Gender Terminology List
(Commission for Gender Equality (CGE), 2021-04)CGE's Afrikaans Gender Terminology List is a list of terms, either words or phrases, related to the promotion of gender equality. All 436 words or phrases ... -
Human Language Technology Audit 2017/18
(CSIR, 2018-08-31)This document reports on all work conducted in the 2017/18 Audit of human language technology (HLT) resources available in South Africa project. The ... -
Generic Bilingual Academic Wordlist with Definitions
(ICELDA; SADiLaR, 2021)The academic wordlist has been developed to serve as a resource to students to assist them to better understand words used within the information they ... -
Denominal adjectives in Afrikaans dataset
(South African Centre for Digital Language Resources, 2020-05-15) ~Resource Catalogue This dataset contain a collection of Afrikaans denominal adjectives that were extracted from the Virtual Institute for Afrikaans' corpus portal. The ... -
Representations of epistemological certainty and ontological ambiguity in selected earlier works by Joseph Conrad
(North-West University, 2019-02-18) ~Resource Catalogue Representations of epistemological certainty and ontological ambiguity in selected earlier works by Joseph Conrad -
SPCS Speech Corpus
(Council for Scientific and Industrial Research; North-West University, 2015-11-25) ~Resource Catalogue Broadband speech corpus of approximately 10 hours and the corresponding transcriptions. The development process of the corpus involved the recording ...