Affiliation:
1. School of Computer Science, Civil Aviation Flight University of China, Guanghan 618307, China
Abstract
In the current field of air traffic control speech, there is a lack of effective objective speech quality evaluation methods. This paper proposes a new network framework based on ResNet–BiLSTM to address this issue. Firstly, the mel-spectrogram of the speech signal is segmented using the sliding window technique. Next, a preceding feature extractor composed of convolutional and pooling layers is employed to extract shallow features from the mel-spectrogram segment. Then, ResNet is utilized to extract spatial features from the shallow features, while BiLSTM is used to extract temporal features, and these features are horizontally concatenated. Finally, based on the concatenated spatiotemporal features, the final speech quality score is computed using fully connected layers. We conduct experiments on the air traffic control speech database and compare the objective scoring results with the subjective scoring results. The experimental results demonstrate that the proposed method has a high correlation with the mean opinion score (MOS) of air traffic control speech.
Funder
National Key R&D Program of China
Fundamental Research Funds for the Central Universities
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference40 articles.
1. ITU-T Recommendations (1996). P.800: Methods for Subjective Determination of Transmission Quality, International Telecommunication Union.
2. Union Investment (2006). ITU-T Recommendation P.800.1: Mean Opinion Score (MOS) Terminology, International Telecommunication Union. Tech. Rep.
3. Links between subjective assessments and objective metrics for steering;Nybacka;Int. J. Automot. Technol.,2014
4. Survey on QoE assessment approach for network service;Yang;IEEE Access,2018
5. ITU-T Recommendations (2001). P.862: PESQ—An Objective Method for End-to-End Speech Quality Assessment of Narrow-Band Telephone Networks and Speech Codecs, International Telecommunication Union.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献