Acoustic Gender and Age Classification as an Aid to Human–Computer Interaction in a Smart Home Environment
-
Published:2022-12-29
Issue:1
Volume:11
Page:169
-
ISSN:2227-7390
-
Container-title:Mathematics
-
language:en
-
Short-container-title:Mathematics
Author:
Vlaj DamjanORCID, Zgank AndrejORCID
Abstract
The advanced smart home environment presents an important trend for the future of human wellbeing. One of the prerequisites for applying its rich functionality is the ability to differentiate between various user categories, such as gender, age, speakers, etc. We propose a model for an efficient acoustic gender and age classification system for human–computer interaction in a smart home. The objective was to improve acoustic classification without using high-complexity feature extraction. This was realized with pitch as an additional feature, combined with additional acoustic modeling approaches. In the first step, the classification is based on Gaussian mixture models. In the second step, two new procedures are introduced for gender and age classification. The first is based on the count of the frames with the speaker’s pitch values, and the second is based on the sum of the frames with pitch values belonging to a certain speaker. Since both procedures are based on pitch values, we have proposed a new, effective algorithm for pitch value calculation. In order to improve gender and age classification, we also incorporated speech segmentation with the proposed voice activity detection algorithm. We also propose a procedure that enables the quick adaptation of the classification algorithm to frequent smart home users. The proposed classification model with pitch values has improved the results in comparison with the baseline system.
Funder
Slovenian Research Agency
Subject
General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)
Reference43 articles.
1. United Nations (2004). World Population to 2300, Department of Economic and Social Affairs, Population Division. 2. Mukhamediev, R.I., Popova, Y., Kuchin, Y., Zaitseva, E., Kalimoldayev, A., Symagulov, A., Levashenko, V., Abdoldina, F., Gopejenko, V., and Yakunin, K. (2022). Review of Artificial Intelligence and Machine Learning Technologies: Classification, Restrictions, Opportunities and Challenges. Mathematics, 10. 3. Astapov, S., Gusev, A., Volkova, M., Logunov, A., Zaluskaia, V., Kapranova, V., Timofeeva, E., Evseeva, E., Kabarov, V., and Matveev, Y. (2021). Application of Fusion of Various Spontaneous Speech Analytics Methods for Improving Far-Field Neural-Based Diarization. Mathematics, 9. 4. Giannoulis, P., Tsiami, A., Rodomagoulakis, I., Katsamanis, A., Potamianos, G., and Maragos, P. (2014, January 12). The Athena-RC system for speech activity detection and speaker localization in the DIRHA smart home. Proceedings of the 2014 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA), Nancy, France. 5. What we do–and don’t–know about the Smart Home: An analysis of the Smart Home literature;Solaimani;Indoor Built Environ.,2015
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|