Archives of Acoustics, 31, 4(S), pp. 183-188, 2006

High quality speech codec employing sines + noise + transients model

Maciej Kulesza
Gdańsk University of Technology, Multimedia Systems Department, Narutowicza 11/12, 80-952 Gdańsk
Poland

Ł. Litwic
Gdańsk University of Technology, Multimedia Systems Department, Narutowicza 11/12, 80-952 Gdańsk
Poland

G. Szwoch
Gdańsk University of Technology, Multimedia Systems Department, Narutowicza 11/12, 80-952 Gdańsk
Poland

A. Czyżewski
Gdańsk University of Technology, Multimedia Systems Department, Narutowicza 11/12, 80-952 Gdańsk
Poland

A method of high quality wideband speech signal representation employing sines+transients+
noise model is presented. The need for a wideband speech coding approach as well
as various methods for analysis and synthesis of sines, residual and transient states of speech
signal is discussed. The perceptual criterion is applied in the proposed approach during encoding
of sines amplitudes in order to reduce bandwidth requirements and to preserve high
quality of speech. Therefore, the psychoacoustic model devised for perceptual speech coding
is presented. The experimental results reveal that method for tonality estimation employed
in the psychoacoustic model has a significant impact on perceptual coding accuracy. Various
methods for tonality estimation are presented and compared.
Keywords: speech coding, sines+noise+transients model, VoIP telephony.
Full Text: PDF
Copyright © Polish Academy of Sciences & Institute of Fundamental Technological Research (IPPT PAN).