Affiliation:
1. KXEN, Inc., San Francisco, CA
Abstract
The discipline of Web Usage Mining has grown rapidly in the past few years, despite the crash of the e-commerce boom of the late 1990s. Web Usage Mining is the application of data mining techniques to Web clickstream data in order to extract usage patterns. Yet, with all of the resources put into the problem, claims of success have been limited and are often tied to specific Web site properties that are not found in general. One reason for the limited success has been a component of Web Usage Mining that is often overlooked---the need to understand the content and structure of a Web site. The processing and quantification of a Web sites content and structure for all but completely static and single frame Web sites is arguably one of the most difficult tasks to automate in the Web Usage Mining process. This article shows that, not only is the Web Usage Mining process enhanced by content and structure, it cannot be completed without it. The results of experiments run on data from a large e-commerce site are presented to show that proper preprocessing cannot be completed without the use of Web site content and structure, and that the effectiveness of pattern analysis is greatly enhanced.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications
Reference28 articles.
1. Balabanovic M. and Shoham Y. 1995. Learning information retrieval agents: Experiments with automated web browsing. In On-line Working Notes of the AAAI Spring Symposium Series on Information Gathering from Distributed Heterogeneous Environments.]] Balabanovic M. and Shoham Y. 1995. Learning information retrieval agents: Experiments with automated web browsing. In On-line Working Notes of the AAAI Spring Symposium Series on Information Gathering from Distributed Heterogeneous Environments.]]
2. Bonissone P. P. and Decker K. S. 1986. Selecting uncertainty calculi and granularity: An experiment in trading-off precision and complexity. Uncert. Artif. Intell. 2217--2247.]] Bonissone P. P. and Decker K. S. 1986. Selecting uncertainty calculi and granularity: An experiment in trading-off precision and complexity. Uncert. Artif. Intell. 2217--2247.]]
Cited by
28 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献