Multilingual Statistical News Summarisation: Preliminary Experiments with English
In this paper we present a generic approach
for summarising multilingual news clusters
such as the ones produced by the Europe Media Monitor (EMM) system.
It is generic because it uses robust statistical techniques to
perform the summarisation step and its multilinguality is
inherited from the multilingual entity disambiguation system
used to build the source representation.
We ran preliminary experiments with the TAC 2008
data, an English corpus for summarisation research,
and we obtained promising improvements over
a summarisation system ranked in the top 20% at the
TAC 2008 competition.
KABADJOV Mijail;
STEINBERGER Josef;
POULIQUEN Bruno;
STEINBERGER Ralf;
POESIO Massimo;
2009-10-19
IEEE Computer Society
JRC52748
http://www.wi-iat09.disco.unimib.it/,
https://publications.jrc.ec.europa.eu/repository/handle/JRC52748,
10.1109/WI-IAT.2009.340,
Additional supporting files
| File name | Description | File type | |