A Time Domain Estimation Algorithm for Speech Signal Pitch Period

Author:

Wu Shuxing,Li Tiansong,Zhang Xiuqin

Abstract

Abstract In speech recognition and speech synthesis, accurate estimation of the pitch period is an important part of speech processing. The traditional direct peak estimation method and the autocorrelation function method are both effective time domain estimation algorithms. The autocorrelation method is a pitch period estimation algorithm suitable for low SNR. Both algorithms need to get accurate peak position estimation. In this paper, a multi-line cut method which is a method for judging the position of the peak point is proposed. The multi-line cut method is used to intercept the sampled data of the waveform by using multiple cut lines. The median value is calculated by the starting and ending points of the cut line position, and the peak position is indirectly evaluated. By minimizing the impact of interference on the peak estimate, the likelihood of falling into local extreme points is reduced, therefore a more accurate peak point estimate than the direct search for peak points can be obtained. The simulation results show that compared with the traditional direct peak estimation method, the performance of peak estimation by the multi-line cut method can be greatly improved, and the multi-line cut method can be used to estimate the peak value in the autocorrelation method, and also achieve a certain performance improvement. In addition, the number of cut lines is directly related to performance, and the more the number is, the better the performance is. The complexity of this method is not high and easy to implement.

Publisher

IOP Publishing

Subject

General Physics and Astronomy

Reference10 articles.

1. Speech pitch detection using short-time energy;Swee,2010

2. A comparative performance study of several pitch detection algorithms;Rabiner;IEEE Transactions on Acoustics, Speech and Signal Processing,1976

3. Pitch period estimation base on voiced degree weighted sub-frame octave region dynamic programming;Zeng,2010

4. An autocorrelation pitch detector and voicing decision with confidence measures developed for noise-corrupted speech;Krubsack;IEEE Transactions on Signal Processing,1991

5. Average magnitude difference function pitch extractor;Ross;IEEE Transactions on Acoustics, Speech and Signal Processing,1974

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3