An approach to blind source separation based on temporal structure of speech signals

Noboru Murata, Shiro Ikeda, Andreas Ziehe

    Research output: Contribution to journalArticle

    347 Citations (Scopus)

    Abstract

    In this paper, we introduce a new technique for blind source separation of speech signals. We focus on the temporal structure of the signals. The idea is to apply the decorrelation method proposed by Molgedey and Schuster in the time-frequency domain. Since we are applying separation algorithm on each frequency separately, we have to solve the amplitude and permutation ambiguity properly to reconstruct the separated signals. For solving the amplitude ambiguity, we use the matrix inversion and for the permutation ambiguity, we introduce a method based on the temporal structure of speech signals. We show some results of experiments with both artificially controlled data and speech data recorded in the real environment.

    Original languageEnglish
    Pages (from-to)1-24
    Number of pages24
    JournalNeurocomputing
    Volume41
    DOIs
    Publication statusPublished - 2001

    Fingerprint

    Blind source separation
    Experiments

    Keywords

    • Blind source separation
    • Convolutive mixtures
    • Time-frequency domain

    ASJC Scopus subject areas

    • Artificial Intelligence
    • Cellular and Molecular Neuroscience

    Cite this

    An approach to blind source separation based on temporal structure of speech signals. / Murata, Noboru; Ikeda, Shiro; Ziehe, Andreas.

    In: Neurocomputing, Vol. 41, 2001, p. 1-24.

    Research output: Contribution to journalArticle

    @article{f410b6aedee140bbb779c62a3d237789,
    title = "An approach to blind source separation based on temporal structure of speech signals",
    abstract = "In this paper, we introduce a new technique for blind source separation of speech signals. We focus on the temporal structure of the signals. The idea is to apply the decorrelation method proposed by Molgedey and Schuster in the time-frequency domain. Since we are applying separation algorithm on each frequency separately, we have to solve the amplitude and permutation ambiguity properly to reconstruct the separated signals. For solving the amplitude ambiguity, we use the matrix inversion and for the permutation ambiguity, we introduce a method based on the temporal structure of speech signals. We show some results of experiments with both artificially controlled data and speech data recorded in the real environment.",
    keywords = "Blind source separation, Convolutive mixtures, Time-frequency domain",
    author = "Noboru Murata and Shiro Ikeda and Andreas Ziehe",
    year = "2001",
    doi = "10.1016/S0925-2312(00)00345-3",
    language = "English",
    volume = "41",
    pages = "1--24",
    journal = "Neurocomputing",
    issn = "0925-2312",
    publisher = "Elsevier",

    }

    TY - JOUR

    T1 - An approach to blind source separation based on temporal structure of speech signals

    AU - Murata, Noboru

    AU - Ikeda, Shiro

    AU - Ziehe, Andreas

    PY - 2001

    Y1 - 2001

    N2 - In this paper, we introduce a new technique for blind source separation of speech signals. We focus on the temporal structure of the signals. The idea is to apply the decorrelation method proposed by Molgedey and Schuster in the time-frequency domain. Since we are applying separation algorithm on each frequency separately, we have to solve the amplitude and permutation ambiguity properly to reconstruct the separated signals. For solving the amplitude ambiguity, we use the matrix inversion and for the permutation ambiguity, we introduce a method based on the temporal structure of speech signals. We show some results of experiments with both artificially controlled data and speech data recorded in the real environment.

    AB - In this paper, we introduce a new technique for blind source separation of speech signals. We focus on the temporal structure of the signals. The idea is to apply the decorrelation method proposed by Molgedey and Schuster in the time-frequency domain. Since we are applying separation algorithm on each frequency separately, we have to solve the amplitude and permutation ambiguity properly to reconstruct the separated signals. For solving the amplitude ambiguity, we use the matrix inversion and for the permutation ambiguity, we introduce a method based on the temporal structure of speech signals. We show some results of experiments with both artificially controlled data and speech data recorded in the real environment.

    KW - Blind source separation

    KW - Convolutive mixtures

    KW - Time-frequency domain

    UR - http://www.scopus.com/inward/record.url?scp=0035659640&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=0035659640&partnerID=8YFLogxK

    U2 - 10.1016/S0925-2312(00)00345-3

    DO - 10.1016/S0925-2312(00)00345-3

    M3 - Article

    VL - 41

    SP - 1

    EP - 24

    JO - Neurocomputing

    JF - Neurocomputing

    SN - 0925-2312

    ER -