Author:
Pancerz Krzysztof,Mich Olga
Abstract
In the paper, we propose a method for mining real-estate listings using clustering algorithms intended for numerical data. The presented approach is based on information systems over ontological graphs. Such information systems have been proposed to deal with data in the form of concepts linked by different semantic relations. A special attention is focused on preprocessing steps transforming advertisements in the textual form into information systems defined over ontological graphs, as well as on encoding attribute values for clustering algorithms.
Subject
General Earth and Planetary Sciences,General Environmental Science
Reference15 articles.
1. Brachman, R.J. 1983. “What Is-a Is and Isnt — an Analysis of Taxonomic Links in Semantic Networks.” Computer no. 16 (10):30–36.
2. Bramer, M.A. 2007. Principles of Data Mining, Undergraduate Topics in Computer Science. London: Springer.
3. Chaffin, R., D.J. Herrmann, and M. Winston. 1988. “An Empirical Taxonomy of Part-Whole Relations. Effects of Part-Whole Relation Type on Relation Identification.” Language, Cognition and Neuroscience no. 1 (3):17–48.
4. Cios, K.J., W. Pedrycz, R.W. Swiniarski, and L. Kurgan. 2007. Data Mining. A Knowledge Discovery Approach. New York: Springer.
5. Gan, G., C. Ma, and J. Wu. 2007. Data Clustering. Theory, Algorithms, and Aplications, ASA-SIAM series on statistics and applied probability. Philadelphia, Pa.; Alexandria, Va.: SIAM; American Statistical Association.