Patent Number: 6,311,153

Title: Speech recognition method and apparatus using frequency warping of linear prediction coefficients

Abstract: An audio signal compression apparatus for compressively coding an input audio signal comprises a time-to-frequency transformation unit for transforming the input audio signal to a frequency domain signal; a spectrum envelope calculation unit for calculating a spectrum envelope having different resolutions for different frequencies, from the input audio signal, using a weighting function on frequency based on human auditory characteristics; a normalization unit for normalizing the frequency domain signal using the spectrum envelope to obtain a residual signal; a power normalization unit for normalizing the residual signal by the power; an auditory weighting calculation unit for calculating weighting coefficients on frequency, based on the spectrum of the input audio signal and human auditory characteristics; and a multi-stage quantization device having plural stages of vector quantizers connected in series, to which the normalized residual signal is input, and at least one of the vector quantizers quantizing the residual signal using the weighting coefficients. Therefore, a low frequency band, which is auditively important, can be analyzed with a higher frequency resolution as compared with a high frequency band, whereby efficient signal compression utilizing human auditory characteristics is realized.

Inventors: Nakatoh; Yoshihisa (Katano, JP), Norimatsu; Takeshi (Kobe, JP), Tsushima; Mineo (Katano, JP), Ishikawa; Tomokazu (Toyonakashi, JP), Serikawa; Mitsuhiko (Nishinomiya, JP), Katayama; Taro (Toyonaka, JP), Nakahashi; Junichi (Nara, JP), Yagi; Yoriko (Nagaokakyo, JP)

Assignee: Matsushita Electric Industrial Co., Ltd.

International Classification: H04B 1/66 (20060101); G10L 19/00 (20060101); G01L 021/00 ()

Expiration Date: 10/30/2018