Affiliation:
1. YILDIZ TEKNİK ÜNİVERSİTESİ, FEN BİLİMLERİ ENSTİTÜSÜ, VERİ BİLİMİ VE BÜYÜK VERİ ANABİLİM DALI (DİSİPLİNLERARASI), VERİ BİLİMİ VE BÜYÜK VERİ (YL) (TEZLİ)
2. YILDIZ TEKNİK ÜNİVERSİTESİ, FEN BİLİMLERİ ENSTİTÜSÜ, UZAKTAN ALGILAMA VE CBS (YL) (TEZLİ)
3. YILDIZ TEKNİK ÜNİVERSİTESİ, İNŞAAT FAKÜLTESİ, HARİTA MÜHENDİSLİĞİ BÖLÜMÜ
Abstract
The images are important components of real estate applications on the internet to inform users. There are multiple rental and sale properties and many images of these properties on the internet, and it is challenging to control the images of these real estate in terms of time, workload, and cost. Considering the requirements of the problem, Deep Learning (DL), one of the Artificial Intelligence (AI) methods, offers ideal solutions. This study aims to distinguish images that contain humans using deep learning techniques. This will also aid in not violating the privacy of people according to the Law on the Protection of Personal Data in the image content used in real estate applications. For this purpose, firstly, a dataset of real estate images with and without humans called the Real Estate Privacy (REP) dataset was created. The REP dataset was split into 70%, 20%, and 10% for training, validation, and testing, respectively. Secondly, the REP dataset was trained with Inceptionv3, ResNet-50, and DenseNet-169 architectures using transfer learning. Lastly, the performances of the architectures were evaluated by accuracy, precision, recall, and F1-score accuracy metrics. Experimental results indicate that the 52 epoch ResNet-50 architecture is the best for our datasets with 98.45% overall accuracy and 98.00% precision, 98.90% recall, and 98.44% F1-score. The Inceptionv3 model provided the best results on the 55th epoch with 98.27% accuracy, 97.81% precision, 98.71% recall, and 98.26% F1-score. Finally, the DenseNet-169 model produced the best results on the 47th epoch, with 97.81% accuracy, 97.09% precision, 98.52% recall, and 97.80% F1-score. Accuracy assessment shows that the highest accuracy among the three architectures was obtained with the ResNet-50 architecture This study shows that deep learning methods offer a perspective to image content control and can be used efficiently in real estate applications.