Noise post-processing for low bit-rate CELP coders

Hiroyuki Ehara, Kazutoshi Yasunaga, Koji Yoshida, Yusuke Hiwasaki, Kazunori Mano, Takao Kaneko

Research output: Contribution to journalArticle

Abstract

This paper presents a newly developed noise post-processing (NPP) algorithm and the results of several tests demonstrating its subjective performance. This NPP algorithm is designed to improve the subjective performance of low bit-rate code excited linear prediction (CELP) decoding under background noise conditions. The NPP algorithm is based on a stationary noise generator and improves the subjective quality of noisy signal input. A backward adaptive detector defines noisy input signal frames from decoded LSF, energy, and pitch parameters. The noise generator estimates and produces stationary noise signals using past line spectral frequency (LSF) and energy parameters. The stationary noise generator has a frame erasure concealment (FEC) scheme designed for stationary noise signals and therefore improves the speech decoder's robustness for frame erasure under background noise conditions. The algorithm has been applied to the following CELP decoders: 1) a candidate algorithm of the ITU-T 4-kbit/s speech coding standard and 2) existing ITU-T standards, the G.729 and G.723.1 series. In both cases, NPP improved the subjective performance of the baseline decoders. Improvements of approximately 0.25 CMOS (CCR MOS: comparison category rating mean opinion score) and around 0.2-0.8 DMOS (DCR MOS: degradation category rating mean opinion score) were demonstrated in the results of our subjective tests when applied to the 4-kbit/s decoder and G.729/G.723.1 decoders respectively. Other test results show that NPP improves the subjective performance of a G.729 decoder by around 0.45 in DMOS under both error-free and frame-erasure conditions, and a further improvement of around 0.2 DMOS is achieved by the FEC scheme in the noise generator.

Original languageEnglish
Pages (from-to)1507-1516
Number of pages10
JournalIEICE Transactions on Information and Systems
VolumeE87-D
Issue number6
Publication statusPublished - 2004 Jun
Externally publishedYes

Fingerprint

Noise generators
Processing
Speech coding
Decoding
Detectors
Degradation

Keywords

  • Background noise
  • CELP
  • Frame erasure
  • ITU-T
  • Post-processing

ASJC Scopus subject areas

  • Information Systems
  • Computer Graphics and Computer-Aided Design
  • Software

Cite this

Ehara, H., Yasunaga, K., Yoshida, K., Hiwasaki, Y., Mano, K., & Kaneko, T. (2004). Noise post-processing for low bit-rate CELP coders. IEICE Transactions on Information and Systems, E87-D(6), 1507-1516.

Noise post-processing for low bit-rate CELP coders. / Ehara, Hiroyuki; Yasunaga, Kazutoshi; Yoshida, Koji; Hiwasaki, Yusuke; Mano, Kazunori; Kaneko, Takao.

In: IEICE Transactions on Information and Systems, Vol. E87-D, No. 6, 06.2004, p. 1507-1516.

Research output: Contribution to journalArticle

Ehara, H, Yasunaga, K, Yoshida, K, Hiwasaki, Y, Mano, K & Kaneko, T 2004, 'Noise post-processing for low bit-rate CELP coders', IEICE Transactions on Information and Systems, vol. E87-D, no. 6, pp. 1507-1516.
Ehara H, Yasunaga K, Yoshida K, Hiwasaki Y, Mano K, Kaneko T. Noise post-processing for low bit-rate CELP coders. IEICE Transactions on Information and Systems. 2004 Jun;E87-D(6):1507-1516.
Ehara, Hiroyuki ; Yasunaga, Kazutoshi ; Yoshida, Koji ; Hiwasaki, Yusuke ; Mano, Kazunori ; Kaneko, Takao. / Noise post-processing for low bit-rate CELP coders. In: IEICE Transactions on Information and Systems. 2004 ; Vol. E87-D, No. 6. pp. 1507-1516.
@article{c75168829f9c44d7883a878b746da613,
title = "Noise post-processing for low bit-rate CELP coders",
abstract = "This paper presents a newly developed noise post-processing (NPP) algorithm and the results of several tests demonstrating its subjective performance. This NPP algorithm is designed to improve the subjective performance of low bit-rate code excited linear prediction (CELP) decoding under background noise conditions. The NPP algorithm is based on a stationary noise generator and improves the subjective quality of noisy signal input. A backward adaptive detector defines noisy input signal frames from decoded LSF, energy, and pitch parameters. The noise generator estimates and produces stationary noise signals using past line spectral frequency (LSF) and energy parameters. The stationary noise generator has a frame erasure concealment (FEC) scheme designed for stationary noise signals and therefore improves the speech decoder's robustness for frame erasure under background noise conditions. The algorithm has been applied to the following CELP decoders: 1) a candidate algorithm of the ITU-T 4-kbit/s speech coding standard and 2) existing ITU-T standards, the G.729 and G.723.1 series. In both cases, NPP improved the subjective performance of the baseline decoders. Improvements of approximately 0.25 CMOS (CCR MOS: comparison category rating mean opinion score) and around 0.2-0.8 DMOS (DCR MOS: degradation category rating mean opinion score) were demonstrated in the results of our subjective tests when applied to the 4-kbit/s decoder and G.729/G.723.1 decoders respectively. Other test results show that NPP improves the subjective performance of a G.729 decoder by around 0.45 in DMOS under both error-free and frame-erasure conditions, and a further improvement of around 0.2 DMOS is achieved by the FEC scheme in the noise generator.",
keywords = "Background noise, CELP, Frame erasure, ITU-T, Post-processing",
author = "Hiroyuki Ehara and Kazutoshi Yasunaga and Koji Yoshida and Yusuke Hiwasaki and Kazunori Mano and Takao Kaneko",
year = "2004",
month = "6",
language = "English",
volume = "E87-D",
pages = "1507--1516",
journal = "IEICE Transactions on Information and Systems",
issn = "0916-8532",
publisher = "Maruzen Co., Ltd/Maruzen Kabushikikaisha",
number = "6",

}

TY - JOUR

T1 - Noise post-processing for low bit-rate CELP coders

AU - Ehara, Hiroyuki

AU - Yasunaga, Kazutoshi

AU - Yoshida, Koji

AU - Hiwasaki, Yusuke

AU - Mano, Kazunori

AU - Kaneko, Takao

PY - 2004/6

Y1 - 2004/6

N2 - This paper presents a newly developed noise post-processing (NPP) algorithm and the results of several tests demonstrating its subjective performance. This NPP algorithm is designed to improve the subjective performance of low bit-rate code excited linear prediction (CELP) decoding under background noise conditions. The NPP algorithm is based on a stationary noise generator and improves the subjective quality of noisy signal input. A backward adaptive detector defines noisy input signal frames from decoded LSF, energy, and pitch parameters. The noise generator estimates and produces stationary noise signals using past line spectral frequency (LSF) and energy parameters. The stationary noise generator has a frame erasure concealment (FEC) scheme designed for stationary noise signals and therefore improves the speech decoder's robustness for frame erasure under background noise conditions. The algorithm has been applied to the following CELP decoders: 1) a candidate algorithm of the ITU-T 4-kbit/s speech coding standard and 2) existing ITU-T standards, the G.729 and G.723.1 series. In both cases, NPP improved the subjective performance of the baseline decoders. Improvements of approximately 0.25 CMOS (CCR MOS: comparison category rating mean opinion score) and around 0.2-0.8 DMOS (DCR MOS: degradation category rating mean opinion score) were demonstrated in the results of our subjective tests when applied to the 4-kbit/s decoder and G.729/G.723.1 decoders respectively. Other test results show that NPP improves the subjective performance of a G.729 decoder by around 0.45 in DMOS under both error-free and frame-erasure conditions, and a further improvement of around 0.2 DMOS is achieved by the FEC scheme in the noise generator.

AB - This paper presents a newly developed noise post-processing (NPP) algorithm and the results of several tests demonstrating its subjective performance. This NPP algorithm is designed to improve the subjective performance of low bit-rate code excited linear prediction (CELP) decoding under background noise conditions. The NPP algorithm is based on a stationary noise generator and improves the subjective quality of noisy signal input. A backward adaptive detector defines noisy input signal frames from decoded LSF, energy, and pitch parameters. The noise generator estimates and produces stationary noise signals using past line spectral frequency (LSF) and energy parameters. The stationary noise generator has a frame erasure concealment (FEC) scheme designed for stationary noise signals and therefore improves the speech decoder's robustness for frame erasure under background noise conditions. The algorithm has been applied to the following CELP decoders: 1) a candidate algorithm of the ITU-T 4-kbit/s speech coding standard and 2) existing ITU-T standards, the G.729 and G.723.1 series. In both cases, NPP improved the subjective performance of the baseline decoders. Improvements of approximately 0.25 CMOS (CCR MOS: comparison category rating mean opinion score) and around 0.2-0.8 DMOS (DCR MOS: degradation category rating mean opinion score) were demonstrated in the results of our subjective tests when applied to the 4-kbit/s decoder and G.729/G.723.1 decoders respectively. Other test results show that NPP improves the subjective performance of a G.729 decoder by around 0.45 in DMOS under both error-free and frame-erasure conditions, and a further improvement of around 0.2 DMOS is achieved by the FEC scheme in the noise generator.

KW - Background noise

KW - CELP

KW - Frame erasure

KW - ITU-T

KW - Post-processing

UR - http://www.scopus.com/inward/record.url?scp=3042644656&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=3042644656&partnerID=8YFLogxK

M3 - Article

VL - E87-D

SP - 1507

EP - 1516

JO - IEICE Transactions on Information and Systems

JF - IEICE Transactions on Information and Systems

SN - 0916-8532

IS - 6

ER -