Acceptability for temporal modification of single vowel segments in isolated words

Hiroaki Kato, Minoru Tsuzaki, Yoshinori Sagisaka

Research output: Contribution to journalArticle

20 Citations (Scopus)

Abstract

Few perceptual studies of the temporal aspects of speech have investigated the influence of changes in segmental durations in terms of acceptability. Aiming to contribute tO the assessment of rules for assigning segmental durations in speech synthesis, the current study measured the perceptual acceptability of changes in the segmental duration of vowels as a function of the segment attributes or context, such as base duration, temporal position in a word, vowel quality, and voicing of the following segment. Seven listeners estimated the acceptability of word stimuli in which one of the vowels was subjected to a temporal modification from -50 ms (for shortening) to + 50 ms (for lengthening) in 5-ms steps. The temporal modification was applied to vowel segments in 70 word contexts; their durations ranged from 35-145 ms, the mora position in the word was first or third, the vowel quality was /a/ or /i/, and the following segment was a voiced or an unvoiced consonant. The experimental results showed that the listeners' acceptable range of durational modification was narrower for vowels in the first moraic position in the word than for those in the third moraic position. The acceptable range was also narrower for the vowel /a/ than for the vowel /i/, and similarly narrower for vowels followed by unvoiced consonants than for those followed by voiced consonants. The vowel that fell into the least vulnerable class (the third /i/, followed by a voiced consonant) required 140% of the modification of that which fell into the most vulnerable class (the first /a/, followed by an unvoiced consonant) to yield the same acceptability decrement. In contrast, the effect of the original vowel duration on the acceptability of temporal modifications was not significant despite its wide variation (35-145 ms).

Original languageEnglish
Pages (from-to)540-549
Number of pages10
JournalJournal of the Acoustical Society of America
Volume104
Issue number1
DOIs
Publication statusPublished - 1998
Externally publishedYes

Fingerprint

vowels
acceptability
Consonant
Acceptability
Unvoiced
stimuli
Listeners
Vowel Quality

ASJC Scopus subject areas

  • Acoustics and Ultrasonics

Cite this

Acceptability for temporal modification of single vowel segments in isolated words. / Kato, Hiroaki; Tsuzaki, Minoru; Sagisaka, Yoshinori.

In: Journal of the Acoustical Society of America, Vol. 104, No. 1, 1998, p. 540-549.

Research output: Contribution to journalArticle

@article{9bb61d35f6724752824ea3776801e0d6,
title = "Acceptability for temporal modification of single vowel segments in isolated words",
abstract = "Few perceptual studies of the temporal aspects of speech have investigated the influence of changes in segmental durations in terms of acceptability. Aiming to contribute tO the assessment of rules for assigning segmental durations in speech synthesis, the current study measured the perceptual acceptability of changes in the segmental duration of vowels as a function of the segment attributes or context, such as base duration, temporal position in a word, vowel quality, and voicing of the following segment. Seven listeners estimated the acceptability of word stimuli in which one of the vowels was subjected to a temporal modification from -50 ms (for shortening) to + 50 ms (for lengthening) in 5-ms steps. The temporal modification was applied to vowel segments in 70 word contexts; their durations ranged from 35-145 ms, the mora position in the word was first or third, the vowel quality was /a/ or /i/, and the following segment was a voiced or an unvoiced consonant. The experimental results showed that the listeners' acceptable range of durational modification was narrower for vowels in the first moraic position in the word than for those in the third moraic position. The acceptable range was also narrower for the vowel /a/ than for the vowel /i/, and similarly narrower for vowels followed by unvoiced consonants than for those followed by voiced consonants. The vowel that fell into the least vulnerable class (the third /i/, followed by a voiced consonant) required 140{\%} of the modification of that which fell into the most vulnerable class (the first /a/, followed by an unvoiced consonant) to yield the same acceptability decrement. In contrast, the effect of the original vowel duration on the acceptability of temporal modifications was not significant despite its wide variation (35-145 ms).",
author = "Hiroaki Kato and Minoru Tsuzaki and Yoshinori Sagisaka",
year = "1998",
doi = "10.1121/1.423301",
language = "English",
volume = "104",
pages = "540--549",
journal = "Journal of the Acoustical Society of America",
issn = "0001-4966",
publisher = "Acoustical Society of America",
number = "1",

}

TY - JOUR

T1 - Acceptability for temporal modification of single vowel segments in isolated words

AU - Kato, Hiroaki

AU - Tsuzaki, Minoru

AU - Sagisaka, Yoshinori

PY - 1998

Y1 - 1998

N2 - Few perceptual studies of the temporal aspects of speech have investigated the influence of changes in segmental durations in terms of acceptability. Aiming to contribute tO the assessment of rules for assigning segmental durations in speech synthesis, the current study measured the perceptual acceptability of changes in the segmental duration of vowels as a function of the segment attributes or context, such as base duration, temporal position in a word, vowel quality, and voicing of the following segment. Seven listeners estimated the acceptability of word stimuli in which one of the vowels was subjected to a temporal modification from -50 ms (for shortening) to + 50 ms (for lengthening) in 5-ms steps. The temporal modification was applied to vowel segments in 70 word contexts; their durations ranged from 35-145 ms, the mora position in the word was first or third, the vowel quality was /a/ or /i/, and the following segment was a voiced or an unvoiced consonant. The experimental results showed that the listeners' acceptable range of durational modification was narrower for vowels in the first moraic position in the word than for those in the third moraic position. The acceptable range was also narrower for the vowel /a/ than for the vowel /i/, and similarly narrower for vowels followed by unvoiced consonants than for those followed by voiced consonants. The vowel that fell into the least vulnerable class (the third /i/, followed by a voiced consonant) required 140% of the modification of that which fell into the most vulnerable class (the first /a/, followed by an unvoiced consonant) to yield the same acceptability decrement. In contrast, the effect of the original vowel duration on the acceptability of temporal modifications was not significant despite its wide variation (35-145 ms).

AB - Few perceptual studies of the temporal aspects of speech have investigated the influence of changes in segmental durations in terms of acceptability. Aiming to contribute tO the assessment of rules for assigning segmental durations in speech synthesis, the current study measured the perceptual acceptability of changes in the segmental duration of vowels as a function of the segment attributes or context, such as base duration, temporal position in a word, vowel quality, and voicing of the following segment. Seven listeners estimated the acceptability of word stimuli in which one of the vowels was subjected to a temporal modification from -50 ms (for shortening) to + 50 ms (for lengthening) in 5-ms steps. The temporal modification was applied to vowel segments in 70 word contexts; their durations ranged from 35-145 ms, the mora position in the word was first or third, the vowel quality was /a/ or /i/, and the following segment was a voiced or an unvoiced consonant. The experimental results showed that the listeners' acceptable range of durational modification was narrower for vowels in the first moraic position in the word than for those in the third moraic position. The acceptable range was also narrower for the vowel /a/ than for the vowel /i/, and similarly narrower for vowels followed by unvoiced consonants than for those followed by voiced consonants. The vowel that fell into the least vulnerable class (the third /i/, followed by a voiced consonant) required 140% of the modification of that which fell into the most vulnerable class (the first /a/, followed by an unvoiced consonant) to yield the same acceptability decrement. In contrast, the effect of the original vowel duration on the acceptability of temporal modifications was not significant despite its wide variation (35-145 ms).

UR - http://www.scopus.com/inward/record.url?scp=0031829132&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0031829132&partnerID=8YFLogxK

U2 - 10.1121/1.423301

DO - 10.1121/1.423301

M3 - Article

C2 - 9670545

AN - SCOPUS:0031829132

VL - 104

SP - 540

EP - 549

JO - Journal of the Acoustical Society of America

JF - Journal of the Acoustical Society of America

SN - 0001-4966

IS - 1

ER -