Affiliation:
1. University of Khartoum, Sudan
2. King Faisal University, Saudi Arabia
Abstract
The web is a rich data mining source which is dynamic and fast growing, providing great opportunities which are often not exploited. Web data represent a real challenge to traditional data mining techniques due to its huge amount and the unstructured nature. Web logs contain information about the interactions between visitors and the website. Analyzing these logs provides insights into visitors' behavior, usage patterns, and trends. Web usage mining, also known as web log mining, is the process of applying data mining techniques to discover useful information hidden in web server's logs. Web logs are primarily used by Web administrators to know how much traffic they get and to detect broken links and other types of errors. Web usage mining extracts useful information that can be beneficial to a number of application areas such as: web personalization, website restructuring, system performance improvement, and business intelligence. The Web usage mining process involves three main phases: pre-processing, pattern discovery, and pattern analysis. Various preprocessing techniques have been proposed to extract information from log files and group primitive data items into meaningful, lighter level abstractions that are suitable for mining, usually in forms of visitors' sessions. Major data mining techniques in web usage mining pattern discovery are: clustering, association analysis, classification, and sequential patterns discovery. This chapter discusses the process of web usage mining, its procedure, methods, and patterns discovery techniques. The chapter also presents a practical example using real web log data.
Reference27 articles.
1. Identifying the User Access Pattern in Web Log Data.;N.Anand;International Journal of Computer Science and Information Technologies,2012
2. Link Analysis Algorithms For Web Mining.;T.Bhatia;International Journal of Computer Science and Technology,2011
3. A Survey on Preprocessing Methods for Web usage Data.;V.Chitraa;International Journal of Computer Science and Information Security,2010
4. An Efficient Path Completion Technique for web log mining.;V.Chitraa;IEEE International Conference on Computational Intelligence and Computing Research,2010
5. Data Mining of Web Access Logs From an Academic Web Site.;V.Ciesielski;Proceedings of the Third International Conference on Hybrid Intelligent Systems (HIS’03): Design and Application of Hybrid Intelligent Systems,2003
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献