BIT ALLOCATION IN TIME AND FREQUENCY DOMAINS FOR PREDICTIVE CODING OF SPEECH.

Masaaki Honda, Fumitada Itakura

Research output: Contribution to journalArticle

25 Citations (Scopus)

Abstract

Adaptive predictive coding with dynamic bit allocation is presented for speech encoding at low to medium bit rates (6. 4 kbits/s to 16 kbits/s). In this system, a split-band predictive coding scheme and a bit allocation scheme are employed to remove the redundancies due to a periodic concentration of the prediction residual energy as well as the nonuniform nature of the speech spectrum. Quantization bits are dynamically allocated, both over the subbands (in the frequency domain) and over the subintervals (in the time domain), in accordance with the distribution of the residual energies in the time-frequency domain. Optimum bit allocation is derived based on the mean square error criterion on the speech waveform. The SNR gain is presented as the sum of the spectral SNR gain G//f, equivalent to the prediction gain, and the temporal SNR gain G//t.

Original languageEnglish
Pages (from-to)465-473
Number of pages9
JournalIEEE Transactions on Acoustics, Speech, and Signal Processing
VolumeASSP-32
Issue number3
Publication statusPublished - 1984 Jun
Externally publishedYes

Fingerprint

Mean square error
Redundancy

ASJC Scopus subject areas

  • Signal Processing

Cite this

BIT ALLOCATION IN TIME AND FREQUENCY DOMAINS FOR PREDICTIVE CODING OF SPEECH. / Honda, Masaaki; Itakura, Fumitada.

In: IEEE Transactions on Acoustics, Speech, and Signal Processing, Vol. ASSP-32, No. 3, 06.1984, p. 465-473.

Research output: Contribution to journalArticle

@article{725424cb3a814151891894e83edee125,
title = "BIT ALLOCATION IN TIME AND FREQUENCY DOMAINS FOR PREDICTIVE CODING OF SPEECH.",
abstract = "Adaptive predictive coding with dynamic bit allocation is presented for speech encoding at low to medium bit rates (6. 4 kbits/s to 16 kbits/s). In this system, a split-band predictive coding scheme and a bit allocation scheme are employed to remove the redundancies due to a periodic concentration of the prediction residual energy as well as the nonuniform nature of the speech spectrum. Quantization bits are dynamically allocated, both over the subbands (in the frequency domain) and over the subintervals (in the time domain), in accordance with the distribution of the residual energies in the time-frequency domain. Optimum bit allocation is derived based on the mean square error criterion on the speech waveform. The SNR gain is presented as the sum of the spectral SNR gain G//f, equivalent to the prediction gain, and the temporal SNR gain G//t.",
author = "Masaaki Honda and Fumitada Itakura",
year = "1984",
month = "6",
language = "English",
volume = "ASSP-32",
pages = "465--473",
journal = "IEEE Transactions on Signal Processing",
issn = "1053-587X",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
number = "3",

}

TY - JOUR

T1 - BIT ALLOCATION IN TIME AND FREQUENCY DOMAINS FOR PREDICTIVE CODING OF SPEECH.

AU - Honda, Masaaki

AU - Itakura, Fumitada

PY - 1984/6

Y1 - 1984/6

N2 - Adaptive predictive coding with dynamic bit allocation is presented for speech encoding at low to medium bit rates (6. 4 kbits/s to 16 kbits/s). In this system, a split-band predictive coding scheme and a bit allocation scheme are employed to remove the redundancies due to a periodic concentration of the prediction residual energy as well as the nonuniform nature of the speech spectrum. Quantization bits are dynamically allocated, both over the subbands (in the frequency domain) and over the subintervals (in the time domain), in accordance with the distribution of the residual energies in the time-frequency domain. Optimum bit allocation is derived based on the mean square error criterion on the speech waveform. The SNR gain is presented as the sum of the spectral SNR gain G//f, equivalent to the prediction gain, and the temporal SNR gain G//t.

AB - Adaptive predictive coding with dynamic bit allocation is presented for speech encoding at low to medium bit rates (6. 4 kbits/s to 16 kbits/s). In this system, a split-band predictive coding scheme and a bit allocation scheme are employed to remove the redundancies due to a periodic concentration of the prediction residual energy as well as the nonuniform nature of the speech spectrum. Quantization bits are dynamically allocated, both over the subbands (in the frequency domain) and over the subintervals (in the time domain), in accordance with the distribution of the residual energies in the time-frequency domain. Optimum bit allocation is derived based on the mean square error criterion on the speech waveform. The SNR gain is presented as the sum of the spectral SNR gain G//f, equivalent to the prediction gain, and the temporal SNR gain G//t.

UR - http://www.scopus.com/inward/record.url?scp=0021445198&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0021445198&partnerID=8YFLogxK

M3 - Article

VL - ASSP-32

SP - 465

EP - 473

JO - IEEE Transactions on Signal Processing

JF - IEEE Transactions on Signal Processing

SN - 1053-587X

IS - 3

ER -