Author:
Wijaya Dedy Rahman,Ibadurrohman Raden Ilham Fadhilah,Hernawati Elis,Wikusna Wawa
Abstract
AbstractPoverty is a problem that occurs in many countries, notably in Indonesia. The common methods used to obtain poverty information are surveys and censuses. However, this process takes a long time and uses a lot of human resources. On the other hand, governments and policymakers need a faster approach to know social-economic conditions for area development plans. Hence, in this paper, we develop e-commerce data and machine learning algorithms as a proxy for poverty levels that can provide faster information than surveys or censuses. The e-commerce dataset is used and this high-dimensional data becomes a challenge. Hence, feature selection algorithms are employed to determine the best features before building a machine learning model. Furthermore, three machine learning algorithms such as support vector regression, linear regression, and k-nearest neighbor are compared to predict the poverty rate. Hence, the contribution of this paper is to propose the combination of statistical-based feature selection and machine learning algorithms to predict the poverty rate based on e-commerce data. According to the experimental results, the combination of f-score feature selection and support vector regression surpasses other methods. It shows that e-commerce data and machine learning algorithms can be potentially used as a proxy for predicting poverty.
Publisher
Springer Science and Business Media LLC
Reference34 articles.
1. BPS. Profil Kemiskinan di Indonesia Maret 2018. Badan Pusat statistik 1–8 (2018).
2. Admin Web kependudukankalbar.com. Kependudukan Kalimantan Barat. Kependudukan Kalbar (2019).
3. Praditya, D. Tech In Asia. Tech In Asia (2019).
4. Soto, V. & Virseda, J. Prediction of socio-economic levels using cellphone records. In International Conference on User Modeling, Adaptation, and Personalization (eds Konstan, J. A. et al.) 377–388 (Springer, 2011). https://doi.org/10.1007/978-3-642-22362-4.
5. Blumenstock, J., Cadamuro, G. & On, R. Predicting poverty and wealth from mobile phone metadata. Science 1979(350), 1073–1076 (2015).