Imbalanced Multimodal Attention-Based System for Multiclass House Price Prediction-Reference-Cited by-同舟云学术

Imbalanced Multimodal Attention-Based System for Multiclass House Price Prediction

Published:2022-12-27 Issue:1 Volume:11 Page:113
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Li Yansong,Branco Paula^ORCID,Zhang Hanxiang

Abstract

House price prediction is an important problem for individuals, companies, organizations, and governments. With a vast amount of diversified and multimodal data available about houses, the predictive models built should seek to make the best use of these data. This leads to the complex problem of how to effectively use multimodal data for house price prediction. Moreover, this is also a context suffering from class imbalance, an issue that cannot be disregarded. In this paper, we propose a new algorithm for addressing these problems: the imbalanced multimodal attention-based system (IMAS). The IMAS makes use of an oversampling strategy that operates on multimodal data, namely using text, numeric, categorical, and boolean data types. A self-attention mechanism is embedded to leverage the usage of neighboring information that can benefit the model’s performance. Moreover, the self-attention mechanism allows for the determination of the features that are the most relevant and adapts the weights used according to that information when performing inference. Our experimental results show the clear advantage of the IMAS, which outperforms all the competitors tested. The analysis of the weights obtained through the self-attention mechanism provides insights into the features’ relevance and also supports the importance of using this mechanism in the predictive model.

Funder

NSERC

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/11/1/113/pdf

Reference44 articles.

1. Deep learning model for house price prediction using heterogeneous data analysis along with joint self-attention mechanism;Wang;IEEE Access,2021

2. Sun, C., Myers, A., Vondrick, C., Murphy, K., and Schmid, C. (November, January 27). Videobert: A joint model for video and language representation learning. Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea.

3. Li, L.H., Yatskar, M., Yin, D., Hsieh, C.J., and Chang, K.W. (2019). Visualbert: A simple and performant baseline for vision and language. arXiv.

4. Ma, P., Mira, R., Petridis, S., Schuller, B.W., and Pantic, M. (2021). LiRA: Learning visual speech representations from audio through self-supervision. arXiv.

5. Shi, B., Hsu, W.N., Lakhotia, K., and Mohamed, A. (2022). Learning audio-visual speech representation by masked multimodal cluster prediction. arXiv.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Real Estate Price Prediction on GenerativeLanguage Models;2023 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE);2023-12-04

2. GERPM: A Geographically Weighted Stacking Ensemble Learning-Based Urban Residential Rents Prediction Model;Mathematics;2023-07-18

3. Real Estate Price Prediction Using Machine Learning;Lecture Notes in Electrical Engineering;2023