Adaptive bit allocation scheme in predictive coding of speech

Masaaki Honda, Nobuhiko Kitawaki, Fumitada Itakura

Research output: Contribution to journalConference article

10 Citations (Scopus)

Abstract

An adaptive predictive coding with adaptive bit allocation (APC-AB) is presented for speech encoding at low to medium bit rates (6.4kb/s-24kb/s). In this system, a split-band predictive coding scheme and a bit allocation scheme are employed in order to remove the redundancies due to a periodic concentration of the prediction residual energy as well as nonuniform nature of the speech spectrum. Quantization bits are dynamically allocated both over the sub-bands(frequency domain) and over the subintervals(time domain) in accordance with the distribution of the residual energies in the time-frequency domain. Optimum bit allocation is derived based on the mean square error criterion on the speech waveform, and the SNR gain is presented in relation to the prediction gain of the full-band signal. This system is evaluated in terms of the segmental SNR and speech quality. The result shows that the APC-AB system has advantage over the conventional full-band APC system in the segmental SNR and the stability of the prediction loop. It was also shown that this system can provide speech quality subjectively equivalent to 7 bit Log-PCM at 16 kb/s, and to 6 bit Log-PCM at 9.6 kb/s.

Original languageEnglish
Article number1171495
Pages (from-to)1672-1675
Number of pages4
JournalICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume1982-May
DOIs
Publication statusPublished - 1982 Jan 1
Externally publishedYes
Event1982 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1982 - Paris, France
Duration: 1982 May 31982 May 5

Fingerprint

Pulse code modulation
Mean square error
Frequency bands
Redundancy

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

Cite this

Adaptive bit allocation scheme in predictive coding of speech. / Honda, Masaaki; Kitawaki, Nobuhiko; Itakura, Fumitada.

In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 1982-May, 1171495, 01.01.1982, p. 1672-1675.

Research output: Contribution to journalConference article

@article{b05f7485617b4ffd9431cd68ae9e871f,
title = "Adaptive bit allocation scheme in predictive coding of speech",
abstract = "An adaptive predictive coding with adaptive bit allocation (APC-AB) is presented for speech encoding at low to medium bit rates (6.4kb/s-24kb/s). In this system, a split-band predictive coding scheme and a bit allocation scheme are employed in order to remove the redundancies due to a periodic concentration of the prediction residual energy as well as nonuniform nature of the speech spectrum. Quantization bits are dynamically allocated both over the sub-bands(frequency domain) and over the subintervals(time domain) in accordance with the distribution of the residual energies in the time-frequency domain. Optimum bit allocation is derived based on the mean square error criterion on the speech waveform, and the SNR gain is presented in relation to the prediction gain of the full-band signal. This system is evaluated in terms of the segmental SNR and speech quality. The result shows that the APC-AB system has advantage over the conventional full-band APC system in the segmental SNR and the stability of the prediction loop. It was also shown that this system can provide speech quality subjectively equivalent to 7 bit Log-PCM at 16 kb/s, and to 6 bit Log-PCM at 9.6 kb/s.",
author = "Masaaki Honda and Nobuhiko Kitawaki and Fumitada Itakura",
year = "1982",
month = "1",
day = "1",
doi = "10.1109/ICASSP.1982.1171495",
language = "English",
volume = "1982-May",
pages = "1672--1675",
journal = "Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing",
issn = "0736-7791",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Adaptive bit allocation scheme in predictive coding of speech

AU - Honda, Masaaki

AU - Kitawaki, Nobuhiko

AU - Itakura, Fumitada

PY - 1982/1/1

Y1 - 1982/1/1

N2 - An adaptive predictive coding with adaptive bit allocation (APC-AB) is presented for speech encoding at low to medium bit rates (6.4kb/s-24kb/s). In this system, a split-band predictive coding scheme and a bit allocation scheme are employed in order to remove the redundancies due to a periodic concentration of the prediction residual energy as well as nonuniform nature of the speech spectrum. Quantization bits are dynamically allocated both over the sub-bands(frequency domain) and over the subintervals(time domain) in accordance with the distribution of the residual energies in the time-frequency domain. Optimum bit allocation is derived based on the mean square error criterion on the speech waveform, and the SNR gain is presented in relation to the prediction gain of the full-band signal. This system is evaluated in terms of the segmental SNR and speech quality. The result shows that the APC-AB system has advantage over the conventional full-band APC system in the segmental SNR and the stability of the prediction loop. It was also shown that this system can provide speech quality subjectively equivalent to 7 bit Log-PCM at 16 kb/s, and to 6 bit Log-PCM at 9.6 kb/s.

AB - An adaptive predictive coding with adaptive bit allocation (APC-AB) is presented for speech encoding at low to medium bit rates (6.4kb/s-24kb/s). In this system, a split-band predictive coding scheme and a bit allocation scheme are employed in order to remove the redundancies due to a periodic concentration of the prediction residual energy as well as nonuniform nature of the speech spectrum. Quantization bits are dynamically allocated both over the sub-bands(frequency domain) and over the subintervals(time domain) in accordance with the distribution of the residual energies in the time-frequency domain. Optimum bit allocation is derived based on the mean square error criterion on the speech waveform, and the SNR gain is presented in relation to the prediction gain of the full-band signal. This system is evaluated in terms of the segmental SNR and speech quality. The result shows that the APC-AB system has advantage over the conventional full-band APC system in the segmental SNR and the stability of the prediction loop. It was also shown that this system can provide speech quality subjectively equivalent to 7 bit Log-PCM at 16 kb/s, and to 6 bit Log-PCM at 9.6 kb/s.

UR - http://www.scopus.com/inward/record.url?scp=84989413545&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84989413545&partnerID=8YFLogxK

U2 - 10.1109/ICASSP.1982.1171495

DO - 10.1109/ICASSP.1982.1171495

M3 - Conference article

VL - 1982-May

SP - 1672

EP - 1675

JO - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing

JF - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing

SN - 0736-7791

M1 - 1171495

ER -