Identifying duplicated ads on property selling and renting websites
-
Published:2019-10-01
Issue:7
Volume:1333
Page:072025
-
ISSN:1742-6588
-
Container-title:Journal of Physics: Conference Series
-
language:
-
Short-container-title:J. Phys.: Conf. Ser.
Author:
Tynchenko V S,Kukartsev V V,Tynchenko V V,Bukhtoyarov V V,Chzhan E A,Kukartsev V A,Boyko A A
Abstract
Abstract
The article presents a solution for the problem of identifying duplicated ads on property selling websites. This task is formulated in the form of a classification problem: the input parameters are identified then divided into basic and non-basic, as well as a class-forming feature. It is also necessary to consider the preliminary data of processed property objects, which is necessary for proper application of the classification methods. The following is a brief review of chosen modern algorithms for solving classification problems, namely: decision trees, artificial neural networks, logistic regression. As a result of experiments, it was revealed that Artificial neural network gives the most accurate result therefore, this algorithm is suitable for the solution of the stated problem.
Subject
General Physics and Astronomy
Reference11 articles.
1. The elements of statistical learning;Friedman;Springer series in statistics,2001