Title: Aspects of Multilingual News Summarisation
Authors: STEINBERGER JosefTANEV HristoSTEINBERGER RalfZAVARELLA VanniTURCHI Marco
Publisher: IGI Global
Publication Year: 2014
JRC N°: JRC82759
ISBN: 978-1-4666-5019-0 (print)
978-1-4666-5020-6 (online)
ISSN: 2327-1981 (print)
2327-199X (online)
URI: http://www.igi-global.com/book/innovative-document-summarization-techniques/84169
http://publications.jrc.ec.europa.eu/repository/handle/JRC82759
DOI: 10.4018/978-1-4666-5019-0.ch012
Type: Articles in periodicals and books
Abstract: In this book chapter, we discuss several pertinent aspects of an automatic system that generates summaries in multiple languages for sets of topic-related news articles (multilingual multi-document summarisation), gathered by news aggregation systems. The discussion follows a framework based on Latent Semantic Analysis (LSA) because LSA was shown to be a high-performing method across many different languages. Starting from a sentence-extractive approach we show how domain-specific aspects can be used and how a compression and paraphrasing method can be plugged in. We also discuss the challenging problem of summarisation evaluation in different languages. In particular, we describe two approaches: the first uses a parallel corpus and the second statistical machine translation.
JRC Directorate:Space, Security and Migration

Files in This Item:
There are no files associated with this item.


Items in repository are protected by copyright, with all rights reserved, unless otherwise indicated.