String Distance Metrics for Reference Matching and Search Query Correction
String distance metrics have been widely used in various applications concerning processing of textual data. This paper reports on the exploration of their usability for tackling the reference matching task and for the automatic correction of misspelled search engine queries, in the context of highly inflective languages, in particular focusing on Polish. The results of numerous experiments in different scenarios are
presented and they revealed some preferred metrics. Surprisingly good results were observed for correcting misspelled search engine queries.
Nevertheless, a more in-depth analysis is necessary to achieve improvements.
The work reported here constitutes a good point of departure for further research on this topic.
PISKORSKI Jakub;
SYDOW Marcin;
2008-05-15
Springer Verlag
JRC36544
http://www.springerlink.com/content/h48610025603/?sortorder=asc&p_o=20,
http://www.springerlink.com/content/q52403m131j21424/?p=dcf2928c74174b969579744a73c83406&pi=26,
https://publications.jrc.ec.europa.eu/repository/handle/JRC36544,
10.1007/978-3-540-72035-5,
Additional supporting files
File name | Description | File type | |