Title: Pattern Learning for Event Extraction using Monolingual Statistical Machine Translation
Authors: TURCHI MARCOZAVARELLA VanniTANEV Hristo
Citation: Proceedings of Recent Advances in Natural Language Processing p. 371-377
Publisher: Incoma Ltd.
Publication Year: 2011
JRC Publication N°: JRC65777
ISSN: 1313-8502
URI: http://lml.bas.bg/~iva/ranlp2011/RANLR2011_Proceedings.PDF
http://publications.jrc.ec.europa.eu/repository/handle/JRC65777
Type: Contributions to Conferences
Abstract: Event extraction systems typically take advantage of language and domain-specific knowledge bases, including patterns that are used to identify specific facts in text; techniques to acquire these patterns can be considered one of the most challenging issues. In this work, we propose a languageindependent and weakly-supervised algorithm to automatically discover linear patterns from texts. Our approach is based on a phrase-based statistical machine translation system trained on monolingual data. A bootstrapping version of the algorithm is proposed. Our method was tested on patterns with different domain-specific semantic roles in three languages: English, Spanish and Russian. Performance evaluated on the extracted patterns and via the output of an event extraction system shows the feasibility of our approach and its capability of working with texts in various languages.
JRC Institute:Institute for the Protection and Security of the Citizen

Files in This Item:
There are no files associated with this item.


Items in repository are protected by copyright, with all rights reserved, unless otherwise indicated.