Affiliation:
1. YILDIZ TEKNİK ÜNİVERSİTESİ
Abstract
With the spread of social media platforms and the rapid increase in the number of users, the amount of data produced in social media is growing rapidly. One of the goals of scientific studies to extract information from this data is occupation prediction. Social media users' occupation information can be used in many different areas, especially in smart suggestion systems. In this study, it is aimed to make occupation prediction using Turkish tweets. Within the scope of the study, an occupation dataset consisting of 25,000 Turkish tweets was created and shared publicly. Various preprocessing steps were applied on this dataset, and feature sets were extracted using both the words themselves and the word roots. In the tests, tweets were used both singularly and combined in groups of 5 and 10. In the experiments in which Support Vector Machine and Logistic Regression methods were applied, tests were repeated by feature selection. While the best result was obtained as 74.90% accuracy in the experiments with singular tweets, the best performances were reported as 96.20% accuracy in experiments with tweets combined in groups of 5, and 99.00% accuracy in experiments with tweets combined in groups of 10. It has been seen that the using of word roots in the tests has higher success than using the words themselves, and the feature selection generally increases the success. At the end of the study, these results were discussed and suggestions for future studies were presented.
Publisher
European Journal of Science and Technology
Subject
General Earth and Planetary Sciences,General Environmental Science
Reference16 articles.
1. Akın, M. D., & Akın, A. A. (2007). Türk Dilleri İçin Açık Kaynaklı Doğal Dil İşleme Kütüphanesi: Zemberek. Elektrik Mühendisliği, 431, 38-44.
2. Chu, W., & Chiu, C. (2014, Aralık). Predicting Occupation from Single Facial Images. IEEE International Symposium on Multimedia, Taichung, Tayvan. https://doi.org/10.1109/ISM.2014.13
3. Chu, W., & Chiu, C. (2016). Predicting Occupation from Images by Combining Face and Body Context Information. ACM Transactions on Multimedia Computing, Communications, and Applications, 13(1), 1-21. https://doi.org/10.1145/3009911
4. Hu, T., Xiao, H., Luo, J., & Nguyen, T. T. (2016, Mayıs). What the Language You Tweet Says About Your Occupation. The Tenth International AAAI Conference on Web and Social Media (ICWSM), Köln, Almanya. https://www.aaai.org/ocs/index.php/ICWSM/ICWSM16/paper/view/13020
5. Huang, Y., Yu, L., Wang, X., & Cui, B. (2015). A multi-source integration framework for user occupation inference in social media systems. World Wide Web, 18, 1247-1267. https://doi.org/10.1007/s11280-014-0300-6