DGR: Gender Recognition of Human Speech Using One-Dimensional Conventional Neural Network-Reference-Cited by-同舟云学术

DGR: Gender Recognition of Human Speech Using One-Dimensional Conventional Neural Network

Published:2019-09-09 Issue: Volume:2019 Page:1-12
ISSN:1058-9244
Container-title:Scientific Programming
language:en
Short-container-title:Scientific Programming

Author:

Alkhawaldeh Rami S.¹^ORCID

Affiliation:

1. Department of Computer Information Systems, The University of Jordan, Aqaba 77110, Jordan

Abstract

The speech entailed in human voice comprises essentially paralinguistic information used in many voice-recognition applications. Gender voice is considered one of the pivotal parts to be detected from a given voice, a task that involves certain complications. In order to distinguish gender from a voice signal, a set of techniques have been employed to determine relevant features to be utilized for building a model from a training set. This model is useful for determining the gender (i.e., male or female) from a voice signal. The contributions are three-fold including (i) providing analysis information about well-known voice signal features using a prominent dataset, (ii) studying various machine learning models of different theoretical families to classify the voice gender, and (iii) using three prominent feature selection algorithms to find promisingly optimal features for improving classification models. The experimental results show the importance of subfeatures over others, which are vital for enhancing the efficiency of classification models’ performance. Experimentation reveals that the best recall value is equal to 99.97%; the best recall value is 99.7% for two models of deep learning (DL) and support vector machine (SVM), and with feature selection, the best recall value is 100% for SVM techniques.

Publisher

Hindawi Limited

Subject

Computer Science Applications,Software

Link

http://downloads.hindawi.com/journals/sp/2019/7213717.pdf

Reference17 articles.

1. Pitch-based gender identification with two-stage classification

2. Automatic speaker age and gender recognition using acoustic and prosodic level information fusion

3. A new approach with score-level fusion for the classification of a speaker age and gender

4. Deep neural network framework and transformed MFCCs for speaker's age and gender classification

Cited by 51 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Gender Recognition Based on the Stacking of Different Acoustic Features;Applied Sciences;2024-07-27

2. Voice-based age, gender, and language recognition based on ResNet deep model and transfer learning in spectro-temporal domain;Neurocomputing;2024-05

3. Identity, Gender, Age, and Emotion Recognition from Speaker Voice with Multi-task Deep Networks for Cognitive Robotics;Cognitive Computation;2024-02-05

4. Automatic Gender Authentication from Arabic Speech Using Hybrid Learning;Journal of Advances in Information Technology;2024

5. Gender Recognition from Speech Signal Using CNN, KNN, SVM and RF;Procedia Computer Science;2024