Title: Lemmatization of Polish Person Names
Authors: SYDOW MARCINKUPSC Anna
Other Contributors: PISKORSKI JAKUB
Citation: Proceedings of the Workshop on Balto-Slavonic Natural Language Processing 2007 p. 27-34
Publisher: The Association of Computational Linguistics (ACL)
Publication Year: 2007
JRC Publication N°: JRC38212
URI: http://langtech.jrc.it/BSNLP2007/m/BSNLP-2007-proceedings.pdf
http://publications.jrc.ec.europa.eu/repository/handle/JRC38212
Type: Contributions to Conferences
Abstract: The paper presents two techniques for lemmatization of Polish person names. First, we apply a rule-based approach which relies on linguistic information and heuristics. Then, we investigate an alternative knowledge-poor method which employs string distance measures. We provide an evaluation of the adopted techniques using a set of newspaper texts.
JRC Institute:Institute for the Protection and Security of the Citizen

Files in This Item:
There are no files associated with this item.


Items in repository are protected by copyright, with all rights reserved, unless otherwise indicated.