Title: Multilingual Multi-document Continuously-updated Social Networks
Citation: The International Workshop on Multi-Source, Multilingual Information Extraction and Summarization - Proceedings p. 25-32
Publisher: Incoma
Publication Year: 2007
JRC N°: JRC45481
URI: http://publications.jrc.ec.europa.eu/repository/handle/JRC45481
Type: Articles in periodicals and books
Abstract: We are presenting a fully-automatic live online system (ac-cessible at http://langtech.jrc.it/SocNet) that produces monolingual or mixed-language social network graphs showing which groups of persons are being mentioned to-gether in the world news of the last few hours. The basis for this system are name mentions extracted automatically from an average of 35,000 news articles per day in 32 languages. For any given person on the graph, hyperlinks lead to the list of text snippets and to the original texts where the per-son was mentioned, plus to a dedicated webpage containing additional information about this person gathered in the course of several years. For any link between persons, hy-perlinks lead to the list of text snippets and to the full texts where both persons are mentioned. Building multilingual social networks that even cross writing systems (Arabic, Greek, Chinese, etc.) is made possible by exploiting the name database built up by the multilingual online NewsEx-plorer system (Steinberger et al. 2005), which automatically associates name variants to the same person identifier. We also discuss differences between live social networks gen-erated from the news in different languages for the same time period.
JRC Directorate:Space, Security and Migration

Files in This Item:
There are no files associated with this item.

Items in repository are protected by copyright, with all rights reserved, unless otherwise indicated.