Speaker adaptation algorithms based on piecewise-moving adaptive segment quantization method

Yoshianao Shiraki, Masaaki Honda

Research output: Contribution to journalArticle

Abstract

A speaker adaptation algorithm based on piecewise movement in the speech spectrum space is proposed for speaker-independent, very-low-bit-rate coding. The universal codebook, which is constructed beforehand and is based on the features of the spectral distribution of an unknown speaker, is adapted to the speaker. The adaptation is based on a process which moves the center of gravity of the universal codebook for several clusters. Two algorithms are proposed for the adaptation, simplified method and a minimum distortion method. The minimum distortion method theoretically ensures that quantization distortion does not increase for the speech data for adaptive training. A coding experiment using spectral segments of actual speech showed that the method has a higher adaptation speed compared than the traditional LBG method. It was also shown that the minimum distortion method converges to the same distortion as the LBG method.

Original languageEnglish
Pages (from-to)58-66
Number of pages9
JournalSystems and Computers in Japan
Volume21
Issue number11
Publication statusPublished - 1990
Externally publishedYes

Fingerprint

Speaker Adaptation
Quantization
Codebook
Coding
Centre of gravity
Spectral Distribution
Gravitation
Converge
Unknown
Experiments

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Hardware and Architecture
  • Information Systems
  • Theoretical Computer Science

Cite this

Speaker adaptation algorithms based on piecewise-moving adaptive segment quantization method. / Shiraki, Yoshianao; Honda, Masaaki.

In: Systems and Computers in Japan, Vol. 21, No. 11, 1990, p. 58-66.

Research output: Contribution to journalArticle

@article{89fced251a94486b97ef41daebdc8aad,
title = "Speaker adaptation algorithms based on piecewise-moving adaptive segment quantization method",
abstract = "A speaker adaptation algorithm based on piecewise movement in the speech spectrum space is proposed for speaker-independent, very-low-bit-rate coding. The universal codebook, which is constructed beforehand and is based on the features of the spectral distribution of an unknown speaker, is adapted to the speaker. The adaptation is based on a process which moves the center of gravity of the universal codebook for several clusters. Two algorithms are proposed for the adaptation, simplified method and a minimum distortion method. The minimum distortion method theoretically ensures that quantization distortion does not increase for the speech data for adaptive training. A coding experiment using spectral segments of actual speech showed that the method has a higher adaptation speed compared than the traditional LBG method. It was also shown that the minimum distortion method converges to the same distortion as the LBG method.",
author = "Yoshianao Shiraki and Masaaki Honda",
year = "1990",
language = "English",
volume = "21",
pages = "58--66",
journal = "Systems and Computers in Japan",
issn = "0882-1666",
publisher = "John Wiley and Sons Inc.",
number = "11",

}

TY - JOUR

T1 - Speaker adaptation algorithms based on piecewise-moving adaptive segment quantization method

AU - Shiraki, Yoshianao

AU - Honda, Masaaki

PY - 1990

Y1 - 1990

N2 - A speaker adaptation algorithm based on piecewise movement in the speech spectrum space is proposed for speaker-independent, very-low-bit-rate coding. The universal codebook, which is constructed beforehand and is based on the features of the spectral distribution of an unknown speaker, is adapted to the speaker. The adaptation is based on a process which moves the center of gravity of the universal codebook for several clusters. Two algorithms are proposed for the adaptation, simplified method and a minimum distortion method. The minimum distortion method theoretically ensures that quantization distortion does not increase for the speech data for adaptive training. A coding experiment using spectral segments of actual speech showed that the method has a higher adaptation speed compared than the traditional LBG method. It was also shown that the minimum distortion method converges to the same distortion as the LBG method.

AB - A speaker adaptation algorithm based on piecewise movement in the speech spectrum space is proposed for speaker-independent, very-low-bit-rate coding. The universal codebook, which is constructed beforehand and is based on the features of the spectral distribution of an unknown speaker, is adapted to the speaker. The adaptation is based on a process which moves the center of gravity of the universal codebook for several clusters. Two algorithms are proposed for the adaptation, simplified method and a minimum distortion method. The minimum distortion method theoretically ensures that quantization distortion does not increase for the speech data for adaptive training. A coding experiment using spectral segments of actual speech showed that the method has a higher adaptation speed compared than the traditional LBG method. It was also shown that the minimum distortion method converges to the same distortion as the LBG method.

UR - http://www.scopus.com/inward/record.url?scp=0025599102&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0025599102&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:0025599102

VL - 21

SP - 58

EP - 66

JO - Systems and Computers in Japan

JF - Systems and Computers in Japan

SN - 0882-1666

IS - 11

ER -