CTexT Afrikaans FLAIR String Embeddings
Title | CTexT Afrikaans FLAIR String Embeddings |
Description | The CTexT Afrikaans FLAIR String Embeddings are two Afrikaans embedding models based on the FLAIR architecture (Akbik et al. 2018, 2019) that provides real-valued vector representations for Afrikaans text. The embeddings were trained on a corpus of 230 million words. |
Contact name | Roald Eiselen |
Contact email | Roald.Eiselen@nwu.ac.za |
Publisher(s) | Centre for Text Technology (CTexT) |
License | Creative Commons Attribution-Noncommercial 4.0 International (CC BY-NC 4.0): https://creativecommons.org/licenses/by-nc/4.0/ |
Language(s) | Afrikaans |
Author(s) | Eiselen, Roald |
Contributor | Eiselen, Roald |
Subject | FLAIR; String embeddings; Word embedding |
URI | https://hdl.handle.net/20.500.12185/552 |
Media category | String embeddings |
Format extent | 230 million words |
Version | 0.1 |
Format size | 69.48 Mb |
Format medium | N/A |
Submit date | 2022-02-03T08:49:29Z |
Date available | 2022-02-03T08:49:29Z |
Date created | 2022-01-10 |
Files in this item
This item appears in the following Collection(s)
-
Resource Catalogue [335]
A collection of language resources available for download from the RMA of SADiLaR. The collection mostly consists of resources developed with funding from the Department of Arts and Culture.