Title: Exploring Curvature-based Topic Development Analysis for Detecting Event Reporting Boundaries
Authors: PISKORSKI Jakub
Publisher: Springer
Publication Year: 2009
JRC N°: JRC48943
ISBN: 978-3-642-04734-3
ISSN: 0302-9743 (Print) 1611-3349 (Online)
URI: http://www.springer.com/computer/artificial/book/978-3-642-04734-3?detailsPage=toc
http://publications.jrc.ec.europa.eu/repository/handle/JRC48943
DOI: 10.1007/978-3-642-04735-0_13
Type: Articles in books
Abstract: In the era of proliferation of electronic news media and an ever-growing demand for prompt and concise information natural language text processing technologies which map free texts into structured data format are becoming paramount. Recently, we have witnessed an emergence of publicly accessible news aggregation systems for facilitating navigation through news. This paper reports on some explorations of refining a real-time news event extraction system, which runs on top of the Europe Media Monitoring news aggregation system developed at the Joint Research Centre of the European Commission. Our experiments focus on the task of detecting new events in a given news story, i.e., tagging events extracted by the core event extraction system as new. Several methods ranging from simple similarity computation of event descriptions of adjacent events to more elaborated ones based on curvature-based topic development analysis and which utilize global knowledge. The paper describes first the particularities of the real-time news event extraction processing chain. Next, in order to get a better insight how news stories evolve over time some statistics on event dynamics are presented. Finally, the new event detection techniques are introduced and the results of the evaluation are given.
JRC Institute:Institute for the Protection and Security of the Citizen

Files in This Item:
There are no files associated with this item.


Items in repository are protected by copyright, with all rights reserved, unless otherwise indicated.