Comparison of String Distance Metrics for Lemmatisation of Named Entities in Polish
This paper presents the results of recent experiments on
application of string distance metrics to the problem of named entity
lemmatisation in Polish. It extends of our work in [1] by introducing
new results for organisation names. Furthermore, the results presented
here and in [2, 3] centering around the same topic were used to make a
comparative study of the average usefulness of the numerous examined
string distance metrics to lemmatisation of Polish named-entities of various
types. In particular, we focus on lemmatisation of country names,
organisation names and person names.
PISKORSKI Jakub;
SYDOW Marcin;
WIELOCH Karol;
2009-10-22
Springer Verlag
JRC49628
978-3-642-04234-8,
0302-9743 (print),
1611-3349,
http://www.springerlink.com/content/n735r4594k1830m2/?p=1bf78658ef3a4b0e8568e793efd76cc9&pi=35,
https://publications.jrc.ec.europa.eu/repository/handle/JRC49628,
10.1007/978-3-642-04235-5_36,
Additional supporting files
| File name | Description | File type | |