Title: Combining various text analysis tools for multilingual media monitoring
Authors: STEINBERGER Ralf
Citation: Multilingual Resources and Multilingual Applications - Proceedings of the Conference of the German Society for Computational Linguistics and Language Technology (GSCL) 2011 vol. B-Series p. 25-32
Publisher: Hamburger Zentrum für Sprachkorpora
Publication Year: 2011
JRC N°: JRC66331
ISSN: 0176-559X
URI: http://www.corpora.uni-hamburg.de/gscl2011/downloads/AZM96.pdf
http://publications.jrc.ec.europa.eu/repository/handle/JRC66331
Type: Articles in periodicals and books
Abstract: There is ample evidence that information contained in media reports is complementary across countries and languages. This holds both for facts and for opinions. Monitoring multilingual and multinational media therefore gives a more complete picture of the world than monitoring the media of only one language, even if it is a world language like English. Wide coverage and highly multilingual text processing is thus important. The JRC-developed Europe Media Monitor (EMM) family of applications gathers about 100,000 media reports per day in 50 languages from the internet, groups related articles, classifies them, detects and follows trends, produces statistics and issues automatic alerts. For a subset of 20 languages, it also extracts and disambiguates entities (persons, organisations and locations) and reported speech, links related news over time and across languages, gathers historical information about entities and produces various types of social networks. More recent R&D efforts focus on event scenario template filling, opinion mining, multi-document summarisation, and machine translation. This extended abstract gives an overview of EMM from a functionality point of view rather than providing technical detail.
JRC Directorate:Space, Security and Migration

Files in This Item:
There are no files associated with this item.


Items in repository are protected by copyright, with all rights reserved, unless otherwise indicated.