Affiliation:
1. Department of Information Technology, Satya Wacana Christian University, 52-60 Diponegoro Rd., Salatiga 50711, Indonesia
2. Department of Marketing and Logistics Management, Chaoyang University of Technology, Taichung City 413310, Taiwan
3. Department of Information System, Atma Jaya Catholic University of Indonesia, Jakarta 12930, Indonesia
Abstract
Researchers in the fields of machine learning and artificial intelligence have recently begun to focus their attention on object recognition. One of the biggest obstacles in image recognition through computer vision is the detection and identification of similar items. Identifying similar musical instruments can be approached as a classification problem, where the goal is to train a machine learning model to classify instruments based on their features and shape. Cellos, clarinets, erhus, guitars, saxophones, trumpets, French horns, harps, recorders, bassoons, and violins were all classified in this investigation. There are many different musical instruments that have the same size, shape, and sound. In addition, we were amazed by the simplicity with which humans can identify items that are very similar to one another, but this is a challenging task for computers. For this study, we used YOLOv7 to identify pairs of musical instruments that are most like one another. Next, we compared and evaluated the results from YOLOv7 with those from YOLOv5. Furthermore, the results of our tests allowed us to enhance the performance in terms of detecting similar musical instruments. Moreover, with an average accuracy of 86.7%, YOLOv7 outperformed previous approaches and other research results.
Funder
National Science and Technology Council
Subject
Artificial Intelligence,Computer Science Applications,Information Systems,Management Information Systems
Reference41 articles.
1. Joint Probabilistic People Detection in Overlapping Depth Images;Wetzel;IEEE Access,2020
2. Implementing a Real-Time, AI-Based, People Detection and Social Distancing Measuring System for COVID-19;Saponara;J. Real Time Image Process.,2021
3. Assessment of Temporal Aspects in Popular Singers;Ribeiro;CODAS,2015
4. New Colour Fusion Deep Learning Model for Large-Scale Action Recognition;Lavinia;Int. J. Comput. Vis. Robot.,2020
5. Bai, T., Pang, Y., Wang, J., Han, K., Luo, J., Wang, H., Lin, J., Wu, J., and Zhang, H. (2020). An Optimized Faster R-CNN Method Based on DRNet and RoI Align for Building Detection in Remote Sensing Images. Remote Sens., 12.
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Deep Learning-Based Face Mask Recognition System with YOLOv8;2024 16th International Conference on Computer and Automation Engineering (ICCAE);2024-03-14
2. Analysis of Internet Movie Database with Global Vectors for a Word Representation;Vietnam Journal of Computer Science;2023-12-22
3. Automated Fruit Classification Based on Deep Learning Utilizing Yolov8;2023 10th IEEE Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON);2023-12-01
4. Contrast Stretching For Automatic Road Marking Detection at Night With YOLOv5;2023 12th International Conference on Awareness Science and Technology (iCAST);2023-11-09
5. Automatic Hand Recognition Using Deep Learning and YOLOv8;2023 12th International Conference on Awareness Science and Technology (iCAST);2023-11-09