Multilabel Genre Prediction Using Deep-Learning Frameworks
-
Published:2023-07-27
Issue:15
Volume:13
Page:8665
-
ISSN:2076-3417
-
Container-title:Applied Sciences
-
language:en
-
Short-container-title:Applied Sciences
Author:
Unal Fatima Zehra1, Guzel Mehmet Serdar1ORCID, Bostanci Erkan1ORCID, Acici Koray2ORCID, Asuroglu Tunc3ORCID
Affiliation:
1. Department of Computer Engineering, Ankara University, 06830 Ankara, Turkey 2. Department of Artificial Intelligence and Data Engineering, Ankara University, 06830 Ankara, Turkey 3. Faculty of Medicine and Health Technology, Tampere University, 33014 Tampere, Finland
Abstract
In this study, transfer learning has been used to overcome multilabel classification tasks. As a case study, movie genre classification by using posters has been chosen. Six state-of-the-art pretrained models, VGG16, ResNet, DenseNet, Inception, MobileNet, and ConvNeXt, have been employed for this experiment. The movie posters have been obtained from Internet Movie Database (IMDB). The dataset has been divided using an iterative stratification technique. A sequence of dense layers has been added on top of each model and these models have been trained and fine-tuned. All the results of the models compared considered accuracy, loss, Hamming loss, F1-score, precision, and AUC metrics. When the metrics used were evaluated, the most successful result regarding accuracy has been obtained from the modified DenseNet architecture at 90%. Also, the ConvNeXt, which is the newest model among all, performed quite satisfactorily, reaching over 90% accuracy. This study uses an iterative stratification method to split an unbalanced dataset which provides more reliable results than the classical splitting method which is the common method in the literature. Also, the feature extraction capabilities of the six pretrained models have been compared. The outcome of this study shows promising results regarding multilabel classification. As for future work, it is planned to enhance this study by using natural language processing and ensemble methods.
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference37 articles.
1. Senirkentli, G.B., Ekinci, F., Bostanci, E., Güzel, M.S., Dagli, Ö., Karim, A.M., and Mishra, A. (2021). Proton Therapy for Mandibula Plate Phantom. Healthcare, 9. 2. Albreiki, B., Zaki, N., and Alashwal, H. (2021). A Systematic Literature Review of Student’ Performance Prediction Using Machine Learning Techniques. Educ. Sci., 11. 3. A Behaviour-Based Architecture for Mapless Navigation Using Vision;Guzel;Int. J. Adv. Robot. Syst.,2018 4. Unal, M., Bostanci, E., Sertalp, E., Guzel, M.S., and Kanwal, N. (2018, January 19–21). Geo-location based augmented reality application for cultural heritage using drones. Proceedings of the 2018 2nd International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT), Ankara, Turkey. 5. Bhatt, D., Patel, C., Talsania, H., Patel, J., Vaghela, R., Pandya, S., Modi, K., and Ghayvat, H. (2021). CNN Variants for Computer Vision: History, Architecture, Application, Challenges and Future Scope. Electronics, 10.
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|