Title: Exploiting Higher-level Semantic Information for the Opinion-oriented Summarization of Blogs
Citation: International Journal of Computational Linguistics and Applications vol. 1 no. 1-2 p. 45-59
Publication Year: 2010
JRC N°: JRC57556
ISSN: 0976-0962
URI: http://publications.jrc.ec.europa.eu/repository/handle/JRC57556
Type: Articles in periodicals and books
Abstract: Together with the growth of the Web 2.0, people have started more and more to communicate, share ideas and comment in blogs, social networks, forums and review sites. Within this context, new and suitable techniques must be developed for the automatic treatment of the large volume of subjective data, to appropriately summarize the arguments presented therein (e.g. as "in favor" and "against"). This article assesses the impact of exploiting higher-level semantic information such as named entities and IS-A relationships for the automatic summarization of positive and negative opinions in blog threads. We first run a sentiment analyzer (with and without topic detection) and subsequently a summarizer based on a framework drawing on Latent Semantic Analysis. Further on, we employ an annotated corpus and the standard ROUGE scorer to automatically evaluate our approach. We compare the results obtained using different system configurations and discuss the issues involved, proposing a suitable method for tackling this scenario.
JRC Directorate:Space, Security and Migration

Files in This Item:
There are no files associated with this item.

Items in repository are protected by copyright, with all rights reserved, unless otherwise indicated.