Title | Tagger Parameter file for RF-Tagger (Schmid and Laws 2005) |
Description | The tagger parameter file is trained on an excerpt of the Pretoria Sepedi Corpus (D. Prinsloo, University of Pretoria): Here, about 5000 tokens were manually tagged and used for training the RF-Tagger (Helmut Schmid and Florian Laws: Estimation of Conditional Probabilities with Decision Trees and an Application to Fine-Grained POS Tagging, COLING 2008, Manchester, Great Britain). The tagger is freely available for academic purposes (see http://www.cis.uni-muenchen.de/~schmid/tools/RFTagger/). Methods and validation results can be found in: G. Faaß, U. Heid, E. Taljard, and D.J. Prinsloo. Part-of-Speech tagging in Northern Sotho: disambiguating polysemous function words. In Proceedings of the 1st Workshop on Language Technologies for African Languages - AfLaT 2009 at EACL, pages 38-45, Athens, Greece, 2009. |
Contact name | Gertrud Faass |
Contact email | gertrud.faass@uni-hildesheim.de |
Publisher(s) | Institute for Information Science and Natural Language Processing, University of Hildesheim, Germany |
License | by-nc-sa |
Language(s) | Sesotho sa Leboa (Sepedi) |
Subject | tagger parameter file; statistical tagging; RF-Tagger |
URI | https://hdl.handle.net/20.500.12185/483 |
Media type | Text |
Type | Data |
Media category | Statistical language model |
Version | 1 |
Format medium | UTF8 |
Stratum | unknown |
Primary collection | Resource Index |
ISO639 code | nso |
Submit date | 2019-02-01T05:49:35Z |
Date available | 2019-02-01T05:49:35Z |
Date created | 2018-02-21 |