Analysis-and-manipulation approach to pitch and duration of musical instrument sounds without distorting timbral characteristics

Takehiro Abe, Katsutoshi Itoyama, Kazuyoshi Yoshii, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

This paper presents an analysis-manipulation method that can generate musical instrument sounds with arbitrary pitches and durations from the sound of a given musical instrument (called seed) without distorting its timbrai characteristics. Based on psychoacoustical knowledge of the auditory effects of timbres, we defined timbrai features based on the spectrogram of the sound of a musical instrument as (i) the relative amplitudes of the harmonic peaks, (ii) the distribution of the inharmonic component, and (iii) temporal envelopes. First, to analyze the timbrai features of a seed, it was separated into harmonic and inharmonic components using Itoyama's integrated model. For pitch manipulation, we took into account the pitch-dependency of features (i) and (ii). We predicted the values of each feature by using a cubic polynomial that approximated the distribution of these features over pitches. To manipulate duration, we focused on preserving feature (iii) in the attack and decay duration of a seed. Therefore, only steady durations were expanded or shrunk. In addition, we propose a method for reproducing the properties of vibrato. Experimental results demonstrated the quality of the synthesized sounds produced using our method. The spectral and MFCC distances between the synthesized sounds and actual sounds of 32 instruments were reduced by 64.70% and 32.31%, respectively.

Original languageEnglish
Title of host publicationProceedings - 11th International Conference on Digital Audio Effects, DAFx 2008
Pages249-256
Number of pages8
Publication statusPublished - 2008
Externally publishedYes
Event11th International Conference on Digital Audio Effects, DAFx 2008 - Espoo
Duration: 2008 Sep 12008 Sep 4

Other

Other11th International Conference on Digital Audio Effects, DAFx 2008
CityEspoo
Period08/9/108/9/4

Fingerprint

Musical instruments
Acoustic waves
Seed
Polynomials

ASJC Scopus subject areas

  • Signal Processing

Cite this

Abe, T., Itoyama, K., Yoshii, K., Komatani, K., Ogata, T., & Okuno, H. G. (2008). Analysis-and-manipulation approach to pitch and duration of musical instrument sounds without distorting timbral characteristics. In Proceedings - 11th International Conference on Digital Audio Effects, DAFx 2008 (pp. 249-256)

Analysis-and-manipulation approach to pitch and duration of musical instrument sounds without distorting timbral characteristics. / Abe, Takehiro; Itoyama, Katsutoshi; Yoshii, Kazuyoshi; Komatani, Kazunori; Ogata, Tetsuya; Okuno, Hiroshi G.

Proceedings - 11th International Conference on Digital Audio Effects, DAFx 2008. 2008. p. 249-256.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abe, T, Itoyama, K, Yoshii, K, Komatani, K, Ogata, T & Okuno, HG 2008, Analysis-and-manipulation approach to pitch and duration of musical instrument sounds without distorting timbral characteristics. in Proceedings - 11th International Conference on Digital Audio Effects, DAFx 2008. pp. 249-256, 11th International Conference on Digital Audio Effects, DAFx 2008, Espoo, 08/9/1.
Abe T, Itoyama K, Yoshii K, Komatani K, Ogata T, Okuno HG. Analysis-and-manipulation approach to pitch and duration of musical instrument sounds without distorting timbral characteristics. In Proceedings - 11th International Conference on Digital Audio Effects, DAFx 2008. 2008. p. 249-256
Abe, Takehiro ; Itoyama, Katsutoshi ; Yoshii, Kazuyoshi ; Komatani, Kazunori ; Ogata, Tetsuya ; Okuno, Hiroshi G. / Analysis-and-manipulation approach to pitch and duration of musical instrument sounds without distorting timbral characteristics. Proceedings - 11th International Conference on Digital Audio Effects, DAFx 2008. 2008. pp. 249-256
@inproceedings{b2f5f62fb73946aeafe5fc2bbdfc620f,
title = "Analysis-and-manipulation approach to pitch and duration of musical instrument sounds without distorting timbral characteristics",
abstract = "This paper presents an analysis-manipulation method that can generate musical instrument sounds with arbitrary pitches and durations from the sound of a given musical instrument (called seed) without distorting its timbrai characteristics. Based on psychoacoustical knowledge of the auditory effects of timbres, we defined timbrai features based on the spectrogram of the sound of a musical instrument as (i) the relative amplitudes of the harmonic peaks, (ii) the distribution of the inharmonic component, and (iii) temporal envelopes. First, to analyze the timbrai features of a seed, it was separated into harmonic and inharmonic components using Itoyama's integrated model. For pitch manipulation, we took into account the pitch-dependency of features (i) and (ii). We predicted the values of each feature by using a cubic polynomial that approximated the distribution of these features over pitches. To manipulate duration, we focused on preserving feature (iii) in the attack and decay duration of a seed. Therefore, only steady durations were expanded or shrunk. In addition, we propose a method for reproducing the properties of vibrato. Experimental results demonstrated the quality of the synthesized sounds produced using our method. The spectral and MFCC distances between the synthesized sounds and actual sounds of 32 instruments were reduced by 64.70{\%} and 32.31{\%}, respectively.",
author = "Takehiro Abe and Katsutoshi Itoyama and Kazuyoshi Yoshii and Kazunori Komatani and Tetsuya Ogata and Okuno, {Hiroshi G.}",
year = "2008",
language = "English",
isbn = "9789512295173",
pages = "249--256",
booktitle = "Proceedings - 11th International Conference on Digital Audio Effects, DAFx 2008",

}

TY - GEN

T1 - Analysis-and-manipulation approach to pitch and duration of musical instrument sounds without distorting timbral characteristics

AU - Abe, Takehiro

AU - Itoyama, Katsutoshi

AU - Yoshii, Kazuyoshi

AU - Komatani, Kazunori

AU - Ogata, Tetsuya

AU - Okuno, Hiroshi G.

PY - 2008

Y1 - 2008

N2 - This paper presents an analysis-manipulation method that can generate musical instrument sounds with arbitrary pitches and durations from the sound of a given musical instrument (called seed) without distorting its timbrai characteristics. Based on psychoacoustical knowledge of the auditory effects of timbres, we defined timbrai features based on the spectrogram of the sound of a musical instrument as (i) the relative amplitudes of the harmonic peaks, (ii) the distribution of the inharmonic component, and (iii) temporal envelopes. First, to analyze the timbrai features of a seed, it was separated into harmonic and inharmonic components using Itoyama's integrated model. For pitch manipulation, we took into account the pitch-dependency of features (i) and (ii). We predicted the values of each feature by using a cubic polynomial that approximated the distribution of these features over pitches. To manipulate duration, we focused on preserving feature (iii) in the attack and decay duration of a seed. Therefore, only steady durations were expanded or shrunk. In addition, we propose a method for reproducing the properties of vibrato. Experimental results demonstrated the quality of the synthesized sounds produced using our method. The spectral and MFCC distances between the synthesized sounds and actual sounds of 32 instruments were reduced by 64.70% and 32.31%, respectively.

AB - This paper presents an analysis-manipulation method that can generate musical instrument sounds with arbitrary pitches and durations from the sound of a given musical instrument (called seed) without distorting its timbrai characteristics. Based on psychoacoustical knowledge of the auditory effects of timbres, we defined timbrai features based on the spectrogram of the sound of a musical instrument as (i) the relative amplitudes of the harmonic peaks, (ii) the distribution of the inharmonic component, and (iii) temporal envelopes. First, to analyze the timbrai features of a seed, it was separated into harmonic and inharmonic components using Itoyama's integrated model. For pitch manipulation, we took into account the pitch-dependency of features (i) and (ii). We predicted the values of each feature by using a cubic polynomial that approximated the distribution of these features over pitches. To manipulate duration, we focused on preserving feature (iii) in the attack and decay duration of a seed. Therefore, only steady durations were expanded or shrunk. In addition, we propose a method for reproducing the properties of vibrato. Experimental results demonstrated the quality of the synthesized sounds produced using our method. The spectral and MFCC distances between the synthesized sounds and actual sounds of 32 instruments were reduced by 64.70% and 32.31%, respectively.

UR - http://www.scopus.com/inward/record.url?scp=72449125577&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=72449125577&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:72449125577

SN - 9789512295173

SP - 249

EP - 256

BT - Proceedings - 11th International Conference on Digital Audio Effects, DAFx 2008

ER -