TY - JOUR
T1 - DISTORTION MEASURES FOR SPEECH PROCESSING.
AU - Gray, Robert M.
AU - Buzo, Andres
AU - Gray, Augustine H.
AU - Matsuyama, Yasuo
PY - 1980/8
Y1 - 1980/8
N2 - Several properties, interrelations, and interpretations were developed for various speech spectral distortion measures. The principal results are 1) the development of notions of relative strength and equivalence of the various distortion measures both in a mathematical sense corresponding to subjective equivalence and in a coding sense when used in minimum distortion or nearest neighbor speech processing systems; (2) the demonstration that the Itakura-Saito and related distortion measures possess a property similar to the triangle inequality when used in nearest neighbor systems such as quantization and cluster analysis; and 3) that the Itakura-Saito and normalized model distortion measures yield efficient computation algorithms for generalized centroids or minimum distortion points of groups or clusters of speech frames, an important computation in classical cluster analysis and in algorithms for optimal quantizer design. It is argued that the Itakura-Saito and related distortions are well-suited computationally, mathematically, and intuitively for such applications.
AB - Several properties, interrelations, and interpretations were developed for various speech spectral distortion measures. The principal results are 1) the development of notions of relative strength and equivalence of the various distortion measures both in a mathematical sense corresponding to subjective equivalence and in a coding sense when used in minimum distortion or nearest neighbor speech processing systems; (2) the demonstration that the Itakura-Saito and related distortion measures possess a property similar to the triangle inequality when used in nearest neighbor systems such as quantization and cluster analysis; and 3) that the Itakura-Saito and normalized model distortion measures yield efficient computation algorithms for generalized centroids or minimum distortion points of groups or clusters of speech frames, an important computation in classical cluster analysis and in algorithms for optimal quantizer design. It is argued that the Itakura-Saito and related distortions are well-suited computationally, mathematically, and intuitively for such applications.
UR - http://www.scopus.com/inward/record.url?scp=0019050955&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0019050955&partnerID=8YFLogxK
M3 - Article
AN - SCOPUS:0019050955
VL - ASSP-28
SP - 367
EP - 376
JO - IEEE Transactions on Acoustics, Speech, and Signal Processing
JF - IEEE Transactions on Acoustics, Speech, and Signal Processing
SN - 1053-587X
IS - 4
ER -