Adaptive bit allocation scheme in predictive coding of speech

Masaaki Honda, Nobuhiko Kitawaki, Fumitada Itakura

研究成果: Conference article

10 引用 (Scopus)

抄録

An adaptive predictive coding with adaptive bit allocation (APC-AB) is presented for speech encoding at low to medium bit rates (6.4kb/s-24kb/s). In this system, a split-band predictive coding scheme and a bit allocation scheme are employed in order to remove the redundancies due to a periodic concentration of the prediction residual energy as well as nonuniform nature of the speech spectrum. Quantization bits are dynamically allocated both over the sub-bands(frequency domain) and over the subintervals(time domain) in accordance with the distribution of the residual energies in the time-frequency domain. Optimum bit allocation is derived based on the mean square error criterion on the speech waveform, and the SNR gain is presented in relation to the prediction gain of the full-band signal. This system is evaluated in terms of the segmental SNR and speech quality. The result shows that the APC-AB system has advantage over the conventional full-band APC system in the segmental SNR and the stability of the prediction loop. It was also shown that this system can provide speech quality subjectively equivalent to 7 bit Log-PCM at 16 kb/s, and to 6 bit Log-PCM at 9.6 kb/s.

元の言語English
記事番号1171495
ページ(範囲)1672-1675
ページ数4
ジャーナルICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
1982-May
DOI
出版物ステータスPublished - 1982 1 1
外部発表Yes
イベント1982 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1982 - Paris, France
継続期間: 1982 5 31982 5 5

Fingerprint

Pulse code modulation
Mean square error
Frequency bands
Redundancy

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

これを引用

Adaptive bit allocation scheme in predictive coding of speech. / Honda, Masaaki; Kitawaki, Nobuhiko; Itakura, Fumitada.

:: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 巻 1982-May, 1171495, 01.01.1982, p. 1672-1675.

研究成果: Conference article

@article{b05f7485617b4ffd9431cd68ae9e871f,
title = "Adaptive bit allocation scheme in predictive coding of speech",
abstract = "An adaptive predictive coding with adaptive bit allocation (APC-AB) is presented for speech encoding at low to medium bit rates (6.4kb/s-24kb/s). In this system, a split-band predictive coding scheme and a bit allocation scheme are employed in order to remove the redundancies due to a periodic concentration of the prediction residual energy as well as nonuniform nature of the speech spectrum. Quantization bits are dynamically allocated both over the sub-bands(frequency domain) and over the subintervals(time domain) in accordance with the distribution of the residual energies in the time-frequency domain. Optimum bit allocation is derived based on the mean square error criterion on the speech waveform, and the SNR gain is presented in relation to the prediction gain of the full-band signal. This system is evaluated in terms of the segmental SNR and speech quality. The result shows that the APC-AB system has advantage over the conventional full-band APC system in the segmental SNR and the stability of the prediction loop. It was also shown that this system can provide speech quality subjectively equivalent to 7 bit Log-PCM at 16 kb/s, and to 6 bit Log-PCM at 9.6 kb/s.",
author = "Masaaki Honda and Nobuhiko Kitawaki and Fumitada Itakura",
year = "1982",
month = "1",
day = "1",
doi = "10.1109/ICASSP.1982.1171495",
language = "English",
volume = "1982-May",
pages = "1672--1675",
journal = "Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing",
issn = "0736-7791",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Adaptive bit allocation scheme in predictive coding of speech

AU - Honda, Masaaki

AU - Kitawaki, Nobuhiko

AU - Itakura, Fumitada

PY - 1982/1/1

Y1 - 1982/1/1

N2 - An adaptive predictive coding with adaptive bit allocation (APC-AB) is presented for speech encoding at low to medium bit rates (6.4kb/s-24kb/s). In this system, a split-band predictive coding scheme and a bit allocation scheme are employed in order to remove the redundancies due to a periodic concentration of the prediction residual energy as well as nonuniform nature of the speech spectrum. Quantization bits are dynamically allocated both over the sub-bands(frequency domain) and over the subintervals(time domain) in accordance with the distribution of the residual energies in the time-frequency domain. Optimum bit allocation is derived based on the mean square error criterion on the speech waveform, and the SNR gain is presented in relation to the prediction gain of the full-band signal. This system is evaluated in terms of the segmental SNR and speech quality. The result shows that the APC-AB system has advantage over the conventional full-band APC system in the segmental SNR and the stability of the prediction loop. It was also shown that this system can provide speech quality subjectively equivalent to 7 bit Log-PCM at 16 kb/s, and to 6 bit Log-PCM at 9.6 kb/s.

AB - An adaptive predictive coding with adaptive bit allocation (APC-AB) is presented for speech encoding at low to medium bit rates (6.4kb/s-24kb/s). In this system, a split-band predictive coding scheme and a bit allocation scheme are employed in order to remove the redundancies due to a periodic concentration of the prediction residual energy as well as nonuniform nature of the speech spectrum. Quantization bits are dynamically allocated both over the sub-bands(frequency domain) and over the subintervals(time domain) in accordance with the distribution of the residual energies in the time-frequency domain. Optimum bit allocation is derived based on the mean square error criterion on the speech waveform, and the SNR gain is presented in relation to the prediction gain of the full-band signal. This system is evaluated in terms of the segmental SNR and speech quality. The result shows that the APC-AB system has advantage over the conventional full-band APC system in the segmental SNR and the stability of the prediction loop. It was also shown that this system can provide speech quality subjectively equivalent to 7 bit Log-PCM at 16 kb/s, and to 6 bit Log-PCM at 9.6 kb/s.

UR - http://www.scopus.com/inward/record.url?scp=84989413545&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84989413545&partnerID=8YFLogxK

U2 - 10.1109/ICASSP.1982.1171495

DO - 10.1109/ICASSP.1982.1171495

M3 - Conference article

AN - SCOPUS:84989413545

VL - 1982-May

SP - 1672

EP - 1675

JO - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing

JF - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing

SN - 0736-7791

M1 - 1171495

ER -