Affiliation:
1. Queensland University of Technology, Australia
Abstract
With the increasing number of XML documents in varied domains, it has become essential to identify ways of finding interesting information from these documents. Data mining techniques can be used to derive this interesting information. However, mining of XML documents is impacted by the data model used in data representation due to the semi-structured nature of these documents. In this chapter, we present an overview of the various models of XML documents representations, how these models are used for mining, and some of the issues and challenges inherent in these models. In addition, this chapter also provides some insights into the future data models of XML documents for effectively capturing its two important features, structure and content, for mining.
Reference76 articles.
1. Aggarwal, C. C., Ta, N., Wang, J., Feng, J., & Zaki, M. J. (2007). XProj: A framework for projected structural clustering of XML documents. In P. Berkhin, R. Caruana, & X. Wu (Eds.), Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), (pp. 46-55). ACM.
2. Graph Data Management and Mining: A Survey of Algorithms and Applications
3. Fast discovery of association rules;R.Agrawal;Advances in knowledge discovery and data mining,1996