An official website of the European Union How do you know?      
European Commission logo
JRC Publications Repository Menu

Combining various text analysis tools for multilingual media monitoring

2011Scientific articles and academic literatureHealth and consumer protection Safety and security
There is ample evidence that information contained in media reports is complementary across countries and languages. This holds both for facts and for opinions. Monitoring multilingual and multinational media therefore gives a more complete picture of the world than monitoring the media of only one language, even if it is a world language like English. Wide coverage and highly multilingual text processing is thus important. The JRC-developed Europe Media Monitor (EMM) family of applications gathers about 100,000 media reports per day in 50 languages from the internet, groups related articles, classifies them, detects and follows trends, produces statistics and issues automatic alerts. For a subset of 20 languages, it also extracts and disambiguates entities (persons, organisations and locations) and reported speech, links related news over time and across languages, gathers historical information about entities and produces various types of social networks. More recent R&D efforts focus on event scenario template filling, opinion mining, multi-document summarisation, and machine translation. This extended abstract gives an overview of EMM from a functionality point of view rather than providing technical detail.
2012-01-30
Hamburger Zentrum für Sprachkorpora
JRC66331
0176-559X
Show metadata record  Copy citation url to clipboard  Download BibTeX
Items published in the JRC Publications Repository are protected by copyright, with all rights reserved, unless otherwise indicated. Additional information: https://ec.europa.eu/info/legal-notice_en#copyright-notice