Affiliation:
1. Politecnico di Torino, Italy
2. Politecnico di Milano, Italy
Abstract
This paper presents a novel semi-automatic approach to construct conceptual ontologies over structured data by exploiting both the schema and content of the input dataset. It effectively combines two well-founded database and data mining techniques, i.e., functional dependency discovery and association rule mining, to support domain experts in the construction of meaningful ontologies, tailored to the analyzed data, by using Description Logic (DL). To this aim, functional dependencies are first discovered to highlight valuable conceptual relationships among attributes of the data schema (i.e., among concepts). The set of discovered correlations effectively support analysts in the assertion of the Tbox ontological statements (i.e., the statements involving shared data conceptualizations and their relationships). Then, the analyst-validated dependencies are exploited to drive the association rule mining process. Association rules represent relevant and hidden correlations among data content and they are used to provide valuable knowledge at the instance level. The pushing of functional dependency constraints into the rule mining process allows analysts to look into and exploit only the most significant data item recurrences in the assertion of the Abox ontological statements (i.e., the statements involving concept instances and their relationships).
Reference29 articles.
1. Agrawal, R., & Srikant, R. (1994). Fast algorithms for mining association rules in large data-bases. In J. B. Bocca, M. Jarke, & C. Zaniolo (Eds.), Proceedings of the International Conference on Very Large Data Bases (pp. 487-499). San Francisco, CA: Morgan Kaufmann.
2. Augurusa, E., Braga, D., Campi, A., & Ceri, S. (2003). Design and implementation of a graphical interface to XQuery. In G. B. Lamont, H. Haddad, G. A. Papadopoulos, & B. Panda (Eds.), Proceedings of the ACM Symposium on Applied Computing (pp. 1163-1167). New York, NY: ACM Press.
3. Baldi, M., Baralis, E., & Risso, F. (2005). Data mining techniques for effective and scalable traffic analysis. In A. Clemm, O. Festor, & A. Pras (Eds.), Proceedings of the IEEE International Symposium on Integrated Network Management (pp. 105-118). Washington, DC: IEEE Computer Society.
4. Baralis, E., Cagliero, L., Cerquitelli, T., D’Elia, V., & Garza, P. (2010). Support driven opportunistic aggregation for generalized itemset extraction. In Proceedings of the IEEE Conference on Intelligent Systems (pp. 102-107). Washington, DC: IEEE Computer Society.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Social Presence and User-Generated Content of Social Media in China;International Journal on Semantic Web and Information Systems;2019-07