Affiliation:
1. The Division of Computer Science and Engineering Louisiana State University Baton Rouge, LA
Abstract
Modern application stores enable developers to classify their apps by choosing from a set of generic categories, or genres, such as health, games, and music. These categories are typically static—new categories do not necessarily emerge over time to reflect innovations in the mobile software landscape. With thousands of apps classified under each category, locating apps that match a specific consumer interest can be a challenging task. To overcome this challenge, in this article, we propose an automated approach for classifying mobile apps into more focused categories of functionally related application domains. Our aim is to enhance apps visibility and discoverability. Specifically, we employ word embeddings to generate numeric semantic representations of app descriptions. These representations are then classified to generate more cohesive categories of apps. Our empirical investigation is conducted using a dataset of 600 apps, sampled from the Education, Health&Fitness, and Medical categories of the Apple App Store. The results show that our classification algorithms achieve their best performance when app descriptions are vectorized using GloVe, a count-based model of word embeddings. Our findings are further validated using a dataset of Sharing Economy apps and the results are evaluated by 12 human subjects. The results show that GloVe combined with Support Vector Machines can produce app classifications that are aligned to a large extent with human-generated classifications.
Funder
U.S. National Science Foundation
LSU Economic Development Assistantships awards
Publisher
Association for Computing Machinery (ACM)
Reference97 articles.
1. Statista. 2019. Mobile app usage. Retrieved from https://www.statista.com/topics/1002/mobile-app-usage/.
2. Docbert: Bert for document classification;Adhikari Ashutosh;Retrieved from https://arXiv:1904.08398,2019
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Enhancing Agile Story Point Estimation: Integrating Deep Learning, Machine Learning, and Natural Language Processing with SBERT and Gradient Boosted Trees;Applied Sciences;2024-08-19
2. Generating Rate Features for Mobile Applications;Proceedings of the IEEE/ACM 11th International Conference on Mobile Software Engineering and Systems;2024-04-14
3. Revisiting Android App Categorization;Proceedings of the IEEE/ACM 46th International Conference on Software Engineering;2024-04-12
4. Exploring AndroidManifest.xml for Automated Android Apps Classification;2023 IEEE International Conference on Big Data (BigData);2023-12-15
5. Strategies, Benefits and Challenges of App Store-inspired Requirements Elicitation;2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE);2023-05