A Case Study of Algorithms for Morphosyntactic Tagging of Polish Language
Marcin Kuta ; Paweł Chrzaszcz ; Jacek Kitowski
Computing and Informatics, Tome 28 (2012) no. 1, / Harvested from Computing and Informatics
The paper presents an evaluation of several part-of-speech taggers, representing main tagging algorithms, applied to corpus of frequency dictionary of the contemporary Polish language. We report our results considering two tagging schemes: IPI PAN positional tagset and its simplified version. Tagging accuracy is calculated for different training sets and takes into account many subcategories (accuracy on known and unknown tokens, word segments, sentences etc.) The comparison of results with other inflecting and analytic languages is done. Performance aspects (time demands) of used tagging tools are also discussed.
Publié le : 2012-01-26
Classification:  Machine learning; part-of-speech tagging; natural language processing
@article{cai327,
     author = {Marcin Kuta and Pawe\l\ Chrzaszcz and Jacek Kitowski},
     title = {A Case Study of Algorithms for Morphosyntactic Tagging of Polish Language},
     journal = {Computing and Informatics},
     volume = {28},
     number = {1},
     year = {2012},
     language = {en},
     url = {http://dml.mathdoc.fr/item/cai327}
}
Marcin Kuta; Paweł Chrzaszcz; Jacek Kitowski. A Case Study of Algorithms for Morphosyntactic Tagging of Polish Language. Computing and Informatics, Tome 28 (2012) no. 1, . http://gdmltest.u-ga.fr/item/cai327/