Time-domain processing (two) speech signals

By short time domain processing techniques, correlation characteristics of the speech signal can be obtained. Today, how to take advantage of short-term autocorrelation function of extracting a voice signal pitch.

So, what is pitch it? Each vocal cords open and close a period of time called the pitch or pitch period, which is called the inverse of the fundamental frequency, referred to as the pitch. The length of the pitch and individual vocal cords, thickness, toughness, stiffness and pronunciation habits related, in large part reflects the individual characteristics. In addition, the pitch also with the person's sex, age may be, older men is low (about 50Hz), children and young women is high (about 450Hz). The pitch is mainly used in low bit rate speech coding, speech analysis and synthesis, speech recognition and speaker recognition, occupies a very important position in the field of voice signal.

Short-term autocorrelation function formula:
Here Insert Picture Description
short autocorrelation function has a number of features:
1) when k takes 0, the function maximum value at this time is the short-time autocorrelation function of the signal energy (see previous article) ;
2) If the original signal sequence is a period for the period T, then the autocorrelation function is a periodic function of period T. With this characteristic, the speech signal can be calculated in the pitch.

For chestnut:

Here Insert Picture Description
Here Insert Picture Description
The figure is based on the length of the sampling rate, 44100Hz 0.9 second speech signal frame length is set 1200, a frame shift of 600, taking the red frame (vocal part) of the one, as shown in FIG.
Here Insert Picture Description

FIG certain frame (a) of the vocal portion

Here Insert Picture Description

(B) of the autocorrelation function for the frame

From the chart (b), after removing the first maximum value (0), a maximum value at k = 236, then the frame rate corresponding to the fundamental frequency:

Here Insert Picture Description
In addition, short-time autocorrelation function may also be used for endpoint detection, a judgment is voiced or unvoiced speech and so on. Well, today's content talk so much, see the next issue!

Published 24 original articles · won praise 2 · Views 4138

Guess you like

Origin blog.csdn.net/Leisure_ksj/article/details/104130362