Analysis-and-manipulation approach to pitch and duration of musical instrument sounds without distorting timbral characteristics

Takehiro Abe, Katsutoshi Itoyama, Kazuyoshi Yoshii, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno

研究成果: Conference contribution

1 引用 (Scopus)

抄録

This paper presents an analysis-manipulation method that can generate musical instrument sounds with arbitrary pitches and durations from the sound of a given musical instrument (called seed) without distorting its timbrai characteristics. Based on psychoacoustical knowledge of the auditory effects of timbres, we defined timbrai features based on the spectrogram of the sound of a musical instrument as (i) the relative amplitudes of the harmonic peaks, (ii) the distribution of the inharmonic component, and (iii) temporal envelopes. First, to analyze the timbrai features of a seed, it was separated into harmonic and inharmonic components using Itoyama's integrated model. For pitch manipulation, we took into account the pitch-dependency of features (i) and (ii). We predicted the values of each feature by using a cubic polynomial that approximated the distribution of these features over pitches. To manipulate duration, we focused on preserving feature (iii) in the attack and decay duration of a seed. Therefore, only steady durations were expanded or shrunk. In addition, we propose a method for reproducing the properties of vibrato. Experimental results demonstrated the quality of the synthesized sounds produced using our method. The spectral and MFCC distances between the synthesized sounds and actual sounds of 32 instruments were reduced by 64.70% and 32.31%, respectively.

元の言語English
ホスト出版物のタイトルProceedings - 11th International Conference on Digital Audio Effects, DAFx 2008
ページ249-256
ページ数8
出版物ステータスPublished - 2008
外部発表Yes
イベント11th International Conference on Digital Audio Effects, DAFx 2008 - Espoo
継続期間: 2008 9 12008 9 4

Other

Other11th International Conference on Digital Audio Effects, DAFx 2008
Espoo
期間08/9/108/9/4

Fingerprint

Musical instruments
Acoustic waves
Seed
Polynomials

ASJC Scopus subject areas

  • Signal Processing

これを引用

Abe, T., Itoyama, K., Yoshii, K., Komatani, K., Ogata, T., & Okuno, H. G. (2008). Analysis-and-manipulation approach to pitch and duration of musical instrument sounds without distorting timbral characteristics. : Proceedings - 11th International Conference on Digital Audio Effects, DAFx 2008 (pp. 249-256)

Analysis-and-manipulation approach to pitch and duration of musical instrument sounds without distorting timbral characteristics. / Abe, Takehiro; Itoyama, Katsutoshi; Yoshii, Kazuyoshi; Komatani, Kazunori; Ogata, Tetsuya; Okuno, Hiroshi G.

Proceedings - 11th International Conference on Digital Audio Effects, DAFx 2008. 2008. p. 249-256.

研究成果: Conference contribution

Abe, T, Itoyama, K, Yoshii, K, Komatani, K, Ogata, T & Okuno, HG 2008, Analysis-and-manipulation approach to pitch and duration of musical instrument sounds without distorting timbral characteristics. : Proceedings - 11th International Conference on Digital Audio Effects, DAFx 2008. pp. 249-256, 11th International Conference on Digital Audio Effects, DAFx 2008, Espoo, 08/9/1.
Abe T, Itoyama K, Yoshii K, Komatani K, Ogata T, Okuno HG. Analysis-and-manipulation approach to pitch and duration of musical instrument sounds without distorting timbral characteristics. : Proceedings - 11th International Conference on Digital Audio Effects, DAFx 2008. 2008. p. 249-256
Abe, Takehiro ; Itoyama, Katsutoshi ; Yoshii, Kazuyoshi ; Komatani, Kazunori ; Ogata, Tetsuya ; Okuno, Hiroshi G. / Analysis-and-manipulation approach to pitch and duration of musical instrument sounds without distorting timbral characteristics. Proceedings - 11th International Conference on Digital Audio Effects, DAFx 2008. 2008. pp. 249-256
@inproceedings{b2f5f62fb73946aeafe5fc2bbdfc620f,
title = "Analysis-and-manipulation approach to pitch and duration of musical instrument sounds without distorting timbral characteristics",
abstract = "This paper presents an analysis-manipulation method that can generate musical instrument sounds with arbitrary pitches and durations from the sound of a given musical instrument (called seed) without distorting its timbrai characteristics. Based on psychoacoustical knowledge of the auditory effects of timbres, we defined timbrai features based on the spectrogram of the sound of a musical instrument as (i) the relative amplitudes of the harmonic peaks, (ii) the distribution of the inharmonic component, and (iii) temporal envelopes. First, to analyze the timbrai features of a seed, it was separated into harmonic and inharmonic components using Itoyama's integrated model. For pitch manipulation, we took into account the pitch-dependency of features (i) and (ii). We predicted the values of each feature by using a cubic polynomial that approximated the distribution of these features over pitches. To manipulate duration, we focused on preserving feature (iii) in the attack and decay duration of a seed. Therefore, only steady durations were expanded or shrunk. In addition, we propose a method for reproducing the properties of vibrato. Experimental results demonstrated the quality of the synthesized sounds produced using our method. The spectral and MFCC distances between the synthesized sounds and actual sounds of 32 instruments were reduced by 64.70{\%} and 32.31{\%}, respectively.",
author = "Takehiro Abe and Katsutoshi Itoyama and Kazuyoshi Yoshii and Kazunori Komatani and Tetsuya Ogata and Okuno, {Hiroshi G.}",
year = "2008",
language = "English",
isbn = "9789512295173",
pages = "249--256",
booktitle = "Proceedings - 11th International Conference on Digital Audio Effects, DAFx 2008",

}

TY - GEN

T1 - Analysis-and-manipulation approach to pitch and duration of musical instrument sounds without distorting timbral characteristics

AU - Abe, Takehiro

AU - Itoyama, Katsutoshi

AU - Yoshii, Kazuyoshi

AU - Komatani, Kazunori

AU - Ogata, Tetsuya

AU - Okuno, Hiroshi G.

PY - 2008

Y1 - 2008

N2 - This paper presents an analysis-manipulation method that can generate musical instrument sounds with arbitrary pitches and durations from the sound of a given musical instrument (called seed) without distorting its timbrai characteristics. Based on psychoacoustical knowledge of the auditory effects of timbres, we defined timbrai features based on the spectrogram of the sound of a musical instrument as (i) the relative amplitudes of the harmonic peaks, (ii) the distribution of the inharmonic component, and (iii) temporal envelopes. First, to analyze the timbrai features of a seed, it was separated into harmonic and inharmonic components using Itoyama's integrated model. For pitch manipulation, we took into account the pitch-dependency of features (i) and (ii). We predicted the values of each feature by using a cubic polynomial that approximated the distribution of these features over pitches. To manipulate duration, we focused on preserving feature (iii) in the attack and decay duration of a seed. Therefore, only steady durations were expanded or shrunk. In addition, we propose a method for reproducing the properties of vibrato. Experimental results demonstrated the quality of the synthesized sounds produced using our method. The spectral and MFCC distances between the synthesized sounds and actual sounds of 32 instruments were reduced by 64.70% and 32.31%, respectively.

AB - This paper presents an analysis-manipulation method that can generate musical instrument sounds with arbitrary pitches and durations from the sound of a given musical instrument (called seed) without distorting its timbrai characteristics. Based on psychoacoustical knowledge of the auditory effects of timbres, we defined timbrai features based on the spectrogram of the sound of a musical instrument as (i) the relative amplitudes of the harmonic peaks, (ii) the distribution of the inharmonic component, and (iii) temporal envelopes. First, to analyze the timbrai features of a seed, it was separated into harmonic and inharmonic components using Itoyama's integrated model. For pitch manipulation, we took into account the pitch-dependency of features (i) and (ii). We predicted the values of each feature by using a cubic polynomial that approximated the distribution of these features over pitches. To manipulate duration, we focused on preserving feature (iii) in the attack and decay duration of a seed. Therefore, only steady durations were expanded or shrunk. In addition, we propose a method for reproducing the properties of vibrato. Experimental results demonstrated the quality of the synthesized sounds produced using our method. The spectral and MFCC distances between the synthesized sounds and actual sounds of 32 instruments were reduced by 64.70% and 32.31%, respectively.

UR - http://www.scopus.com/inward/record.url?scp=72449125577&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=72449125577&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:72449125577

SN - 9789512295173

SP - 249

EP - 256

BT - Proceedings - 11th International Conference on Digital Audio Effects, DAFx 2008

ER -