Title: Improving Web Sites with Web Usage Mining - Web Content Mining and Semantic Analysis
Citation: SOFSEM 2006: Theory and Practice of Computer Science vol. 3831 p. 430-439
Publisher: Springer
Publication Year: 2006
JRC N°: JRC31127
URI: http://publications.jrc.ec.europa.eu/repository/handle/JRC31127
Type: Articles in periodicals and books
Abstract: With the emergence of the World Wide Web, Web sites have become a key communication channel for organizations. In this context, analyzing and improvingWeb communication is essential to better satisfy the objectives of the target audience. Web communication analysis is traditionnally performed by Web analytics software, which produce long lists of audience metrics. These metrics contain little semantics and are too detailed to be exploited by organization managers and chief editors, who need summarized and conceptual information to take decisions. Our solution to obtain such conceptual metrics is to analyze the content of the Web pages output by the Web server. In this paper, we ¯rst present a list of methods that we conceived to mine the output Web pages. Then, we explain how term weights in these pages can be used as audience metrics, and how they can be aggregated using OLAP tools to obtain concept- based metrics. Finally, we present the concept-based metrics that we obtained with our prototype WASA and SQL Server OLAP tools.
JRC Directorate:Space, Security and Migration

Files in This Item:
There are no files associated with this item.

Items in repository are protected by copyright, with all rights reserved, unless otherwise indicated.