Resonant bandwidth estimation of vowels using clustered-line spectrum modeling for pressure speech waveforms

O. Yasojima, Y. Takahashi, M. Tohyama

    研究成果: Conference contribution

    2 引用 (Scopus)

    抄録

    The estimation of resonant frequency bandwidths is a fundamental issue related to the quality of spoken vowels and vocal-tract acoustics. In this article, we discuss our analysis of bandwidths using clustered line-spectrum modeling (CLSM) of the pressure waveforms of vowels on a cycle-by-cycle basis with reference to Lx waveforms from an electrolaryngograph recorded at the same time as the speech signal. We used CLSM to decompose the waveforms into three dominant resonant (modal) oscillations with almost exponentially decaying envelopes. The modal (so-called formant) frequencies were observed in a wide frequency range from 100 (Hz) to over 4 (kHz). The modal bandwidths were estimated from the decaying constants of the modal oscillations and were wider than those reported in the literature under the closed glottis condition. The bandwidths increased for both male and female speakers as the formant frequencies became higher. The bandwidths for females, however, were wider with greater variances than those for males. We could effectively represent a cycle of a vowel record shorter than 10 (ms) by CLSM based on the least squares error criterion in the frequency domain. We thus confirmed that cycle-by-cycle analysis using CLSM is a practical approach to characterizing vowel sounds in terms of dominant frequencies using their modal bandwidths.

    元の言語English
    ホスト出版物のタイトルSixth IEEE International Symposium on Signal Processing and Information Technology, ISSPIT
    ページ589-593
    ページ数5
    DOI
    出版物ステータスPublished - 2007

    Fingerprint

    Bandwidth
    Natural frequencies
    Acoustics
    Acoustic waves

    ASJC Scopus subject areas

    • Computer Networks and Communications
    • Signal Processing
    • Software

    これを引用

    Yasojima, O., Takahashi, Y., & Tohyama, M. (2007). Resonant bandwidth estimation of vowels using clustered-line spectrum modeling for pressure speech waveforms. : Sixth IEEE International Symposium on Signal Processing and Information Technology, ISSPIT (pp. 589-593). [4042311] https://doi.org/10.1109/ISSPIT.2006.270869

    Resonant bandwidth estimation of vowels using clustered-line spectrum modeling for pressure speech waveforms. / Yasojima, O.; Takahashi, Y.; Tohyama, M.

    Sixth IEEE International Symposium on Signal Processing and Information Technology, ISSPIT. 2007. p. 589-593 4042311.

    研究成果: Conference contribution

    Yasojima, O, Takahashi, Y & Tohyama, M 2007, Resonant bandwidth estimation of vowels using clustered-line spectrum modeling for pressure speech waveforms. : Sixth IEEE International Symposium on Signal Processing and Information Technology, ISSPIT., 4042311, pp. 589-593. https://doi.org/10.1109/ISSPIT.2006.270869
    Yasojima O, Takahashi Y, Tohyama M. Resonant bandwidth estimation of vowels using clustered-line spectrum modeling for pressure speech waveforms. : Sixth IEEE International Symposium on Signal Processing and Information Technology, ISSPIT. 2007. p. 589-593. 4042311 https://doi.org/10.1109/ISSPIT.2006.270869
    Yasojima, O. ; Takahashi, Y. ; Tohyama, M. / Resonant bandwidth estimation of vowels using clustered-line spectrum modeling for pressure speech waveforms. Sixth IEEE International Symposium on Signal Processing and Information Technology, ISSPIT. 2007. pp. 589-593
    @inproceedings{7e47ee2f826f487d94f0f49f32e336a8,
    title = "Resonant bandwidth estimation of vowels using clustered-line spectrum modeling for pressure speech waveforms",
    abstract = "The estimation of resonant frequency bandwidths is a fundamental issue related to the quality of spoken vowels and vocal-tract acoustics. In this article, we discuss our analysis of bandwidths using clustered line-spectrum modeling (CLSM) of the pressure waveforms of vowels on a cycle-by-cycle basis with reference to Lx waveforms from an electrolaryngograph recorded at the same time as the speech signal. We used CLSM to decompose the waveforms into three dominant resonant (modal) oscillations with almost exponentially decaying envelopes. The modal (so-called formant) frequencies were observed in a wide frequency range from 100 (Hz) to over 4 (kHz). The modal bandwidths were estimated from the decaying constants of the modal oscillations and were wider than those reported in the literature under the closed glottis condition. The bandwidths increased for both male and female speakers as the formant frequencies became higher. The bandwidths for females, however, were wider with greater variances than those for males. We could effectively represent a cycle of a vowel record shorter than 10 (ms) by CLSM based on the least squares error criterion in the frequency domain. We thus confirmed that cycle-by-cycle analysis using CLSM is a practical approach to characterizing vowel sounds in terms of dominant frequencies using their modal bandwidths.",
    author = "O. Yasojima and Y. Takahashi and M. Tohyama",
    year = "2007",
    doi = "10.1109/ISSPIT.2006.270869",
    language = "English",
    isbn = "0780397541",
    pages = "589--593",
    booktitle = "Sixth IEEE International Symposium on Signal Processing and Information Technology, ISSPIT",

    }

    TY - GEN

    T1 - Resonant bandwidth estimation of vowels using clustered-line spectrum modeling for pressure speech waveforms

    AU - Yasojima, O.

    AU - Takahashi, Y.

    AU - Tohyama, M.

    PY - 2007

    Y1 - 2007

    N2 - The estimation of resonant frequency bandwidths is a fundamental issue related to the quality of spoken vowels and vocal-tract acoustics. In this article, we discuss our analysis of bandwidths using clustered line-spectrum modeling (CLSM) of the pressure waveforms of vowels on a cycle-by-cycle basis with reference to Lx waveforms from an electrolaryngograph recorded at the same time as the speech signal. We used CLSM to decompose the waveforms into three dominant resonant (modal) oscillations with almost exponentially decaying envelopes. The modal (so-called formant) frequencies were observed in a wide frequency range from 100 (Hz) to over 4 (kHz). The modal bandwidths were estimated from the decaying constants of the modal oscillations and were wider than those reported in the literature under the closed glottis condition. The bandwidths increased for both male and female speakers as the formant frequencies became higher. The bandwidths for females, however, were wider with greater variances than those for males. We could effectively represent a cycle of a vowel record shorter than 10 (ms) by CLSM based on the least squares error criterion in the frequency domain. We thus confirmed that cycle-by-cycle analysis using CLSM is a practical approach to characterizing vowel sounds in terms of dominant frequencies using their modal bandwidths.

    AB - The estimation of resonant frequency bandwidths is a fundamental issue related to the quality of spoken vowels and vocal-tract acoustics. In this article, we discuss our analysis of bandwidths using clustered line-spectrum modeling (CLSM) of the pressure waveforms of vowels on a cycle-by-cycle basis with reference to Lx waveforms from an electrolaryngograph recorded at the same time as the speech signal. We used CLSM to decompose the waveforms into three dominant resonant (modal) oscillations with almost exponentially decaying envelopes. The modal (so-called formant) frequencies were observed in a wide frequency range from 100 (Hz) to over 4 (kHz). The modal bandwidths were estimated from the decaying constants of the modal oscillations and were wider than those reported in the literature under the closed glottis condition. The bandwidths increased for both male and female speakers as the formant frequencies became higher. The bandwidths for females, however, were wider with greater variances than those for males. We could effectively represent a cycle of a vowel record shorter than 10 (ms) by CLSM based on the least squares error criterion in the frequency domain. We thus confirmed that cycle-by-cycle analysis using CLSM is a practical approach to characterizing vowel sounds in terms of dominant frequencies using their modal bandwidths.

    UR - http://www.scopus.com/inward/record.url?scp=44449134240&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=44449134240&partnerID=8YFLogxK

    U2 - 10.1109/ISSPIT.2006.270869

    DO - 10.1109/ISSPIT.2006.270869

    M3 - Conference contribution

    AN - SCOPUS:44449134240

    SN - 0780397541

    SN - 9780780397545

    SP - 589

    EP - 593

    BT - Sixth IEEE International Symposium on Signal Processing and Information Technology, ISSPIT

    ER -