Using Language-independent Rules to Achieve High Multilinguality in Text Mining
A lot of information is hidden in foreign language text, making multilingual information extraction tools ¿ and applications that allow cross-lingual information access ¿ particularly useful. Only a few system developers offer their products for more than two or three languages. Typically, they develop the tools for one language and then adapt them to the others. Information on guidelines how to produce highly multilingual applications with the least possible effort is scarce. Due to the multinational character of the European institutions, the Text Mining applications developed at the Joint Research Centre (JRC) had to be planned with the multilinguality requirements in mind. The purpose of this chapter is to present the design principles behind the in-house developments and to show a number of different applications that are built according to these guidelines. The results of the text analysis applications presented here are visible on the publicly accessible mul-tilingual news gathering, analysis and exploration tool NewsExplorer.
STEINBERGER Ralf;
POULIQUEN Bruno;
IGNAT Camelia;
2008-11-19
IOS Press
JRC44025
978-1-58603-898-4-217,
http://www.iospress.nl/,
https://publications.jrc.ec.europa.eu/repository/handle/JRC44025,
10.3233/978-1-58603-898-4-217,
Additional supporting files
| File name | Description | File type | |