Please use this identifier to cite or link to this item:
|Title:||Aspects of Multilingual News Summarisation|
|Authors:||STEINBERGER Josef; TANEV Hristo; STEINBERGER Ralf; ZAVARELLA Vanni; TURCHI Marco|
|Type:||Articles in periodicals and books|
|Abstract:||In this book chapter, we discuss several pertinent aspects of an automatic system that generates summaries in multiple languages for sets of topic-related news articles (multilingual multi-document summarisation), gathered by news aggregation systems. The discussion follows a framework based on Latent Semantic Analysis (LSA) because LSA was shown to be a high-performing method across many different languages. Starting from a sentence-extractive approach we show how domain-specific aspects can be used and how a compression and paraphrasing method can be plugged in. We also discuss the challenging problem of summarisation evaluation in different languages. In particular, we describe two approaches: the first uses a parallel corpus and the second statistical machine translation.|
|JRC Directorate:||Space, Security and Migration|
Files in This Item:
There are no files associated with this item.
Items in repository are protected by copyright, with all rights reserved, unless otherwise indicated.