Abstract
AbstractThe features of the micro-location and in particular the micro-neighborhood that residents perceive on a daily basis have a considerable influence on the quality of living and also on housing prices. For automated valuation models (AVMs), the use of micro-neighborhood information would be beneficial, as incorporating additional spatial effects into the price estimate could potentially reduce the empirical error. However, measuring related features is difficult, as they must first be defined and then collected, which is extremely challenging at such a small spatial level. In this study, we investigate the extent to which the quality of micro-neighborhoods can be assessed holistically using multiple data modalities. We design a scalable approach using alternative data (images and text), with the potential to expand coverage to other urban regions. To achieve this, we propose a multimodal deep learning architecture that integrates both textual and visual inputs and fuses this information. In addition, we introduce a training strategy that enables a targeted fusion of orthogonal visual representations of the residential area within the model architecture. In our experiments, we test and compare different unimodal models with our multimodal architectures. The results demonstrate that the multimodal model with targeted fusion of the orthogonal visual inputs achieves the best performance and also improves the prediction accuracy for underrepresented location quality classes.
Funder
FH Kufstein Tirol - University of Applied Sciences
Publisher
Springer Science and Business Media LLC
Reference118 articles.
1. Aguiar, G. L., Krawczyk, B., & Cano, A. (2022). A survey on learning from imbalanced data streams: Taxonomy, challenges, empirical study, and reproducible experimental framework. ArXiv, abs/2204.03719. https://doi.org/10.48550/arXiv.2204.03719
2. Arribas, I., García, F., Guijarro, F., Oliver, J., & Tamošiūnienė, R. (2016). Mass appraisal of residential real estate using multilevel modelling. International Journal of Strategic Property Management, 20(1), 77–87. https://doi.org/10.3846/1648715X.2015.1134702
3. Austrian Tenancy Law Act (2021). Austrian Tenancy Law Act, § 16. Retrieved from https://www.ris.bka.gv.at/Dokumente/Bundesnormen/NOR40231920/NOR40231920.pdf
4. Baltrušaitis, T., Ahuja, C., & Morency, L. P. (2018). Multimodal machine learning: A survey and taxonomy. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(2), 423–443. https://doi.org/10.1109/TPAMI.2018.2798607
5. Bateman, J. (2014). Text and image: A critical introduction to the visual/verbal divide. Routledge.