An official website of the European Union How do you know?      
European Commission logo
JRC Publications Repository Menu

Acquisition and Use of Multilingual Name Dictionaries

cover
We are presenting a method and a working system that automatically builds up a large multilingual dictionary of person and organisation names through daily news analysis and that makes use of this name dictionary - together with a gazetteer of location names and other means - to link related news articles across languages for 19 languages. Prominent features of the system are the simplicity of the approach (required to extend the functionality to so many languages), the fact that monolingual and cross-lingual name variants are automatically merged with the name's base form, and the fact that the system aggregates information about persons independently of the spelling of their name. The system, accessible online at http://press.jrc.it/NewsExplorer/, has currently collected over 630,000 different names with up to 140 variants for the same name from real life news, plus their inflections. We will put this work into the wider context of other text-related activities carried out at the European Commission¿s Joint Research Centre (JRC).
2009-01-09
Bulgarian Academy of Sciences
JRC45494
http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.75.1772&rep=rep1&type=pdf,    https://publications.jrc.ec.europa.eu/repository/handle/JRC45494,   
Language Citation
NameCountryCityType
Datasets
IDTitlePublic URL
Dataset collections
IDAcronymTitlePublic URL
Scripts / source codes
DescriptionPublic URL
Additional supporting files
File nameDescriptionFile type 
Show metadata record  Copy citation url to clipboard  Download BibTeX
Items published in the JRC Publications Repository are protected by copyright, with all rights reserved, unless otherwise indicated. Additional information: https://ec.europa.eu/info/legal-notice_en#copyright-notice