Lemmatization of Polish Person Names
The paper presents two techniques for lemmatization of Polish person names. First, we apply a rule-based approach which relies on linguistic information and heuristics. Then, we investigate an alternative knowledge-poor method which employs string distance measures. We provide an evaluation of the adopted techniques using a set of newspaper texts.
SYDOW Marcin;
KUPSC Anna;
PISKORSKI Jakub;
2008-05-15
The Association of Computational Linguistics (ACL)
JRC38212
http://langtech.jrc.it/BSNLP2007/m/BSNLP-2007-proceedings.pdf,
https://publications.jrc.ec.europa.eu/repository/handle/JRC38212,
Additional supporting files
| File name | Description | File type | |