Natural quality variable-rate spectral speech coding below 3.0 kbps. Erzin, E., Kumar, A., & Gersho, A. In IEEE International Conference on Acoustics, Speech and Signal Processing, pages 1579–1582, apr, 1997.
abstract   bibtex   
We propose new techniques for natural quality variable rate spectral speech coding at an average rate of 2.2 kbps for dialog speech and 2.8 kbps for monolog speech. The coder models the Fourier spectrum of each frame and it builds on recent enhancements to the classical multiband excitation (MBE) approach. New techniques for robust pitch estimation and tracking, for efficient quantization of voiced and unvoiced spectra and encoding of partial phase information are the key features that result in improved quality over earlier spectral vocoders. Subjective performance results are reported which show that the coder is very close in quality to the ITU-T G.723.1 algorithm at 5.3 kbps.
@inproceedings{Erzin1997,
abstract = {We propose new techniques for natural quality variable rate spectral speech coding at an average rate of 2.2 kbps for dialog speech and 2.8 kbps for monolog speech. The coder models the Fourier spectrum of each frame and it builds on recent enhancements to the classical multiband excitation (MBE) approach. New techniques for robust pitch estimation and tracking, for efficient quantization of voiced and unvoiced spectra and encoding of partial phase information are the key features that result in improved quality over earlier spectral vocoders. Subjective performance results are reported which show that the coder is very close in quality to the ITU-T G.723.1 algorithm at 5.3 kbps.},
author = {Erzin, Engin and Kumar, A. and Gersho, A.},
booktitle = {IEEE International Conference on Acoustics, Speech and Signal Processing},
isbn = {0-8186-7920-4},
month = {apr},
pages = {1579--1582},
title = {{Natural quality variable-rate spectral speech coding below 3.0 kbps}},
year = {1997}
}

Downloads: 0