Rectified linear unit can assist griffin-lim phase recovery

Kohei Yatabe, Yoshiki Masuyama, Yasuhiro Oikawa

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    7 Citations (Scopus)

    Abstract

    Phase recovery is an essential process for reconstructing a time-domain signal from the corresponding spectrogram when its phase is contaminated or unavailable. Recently, a phase recovery method using deep neural network (DNN) was proposed, which interested us because the inverse short-time Fourier transform (inverse STFT) was utilized within the network. This inverse STFT converts a spectrogram into its time-domain counterpart, and then the activation function, leaky rectified linear unit (ReLU), is applied. Such nonlinear operation in time domain resembles the speech enhancement method called the harmonic regeneration noise reduction (HRNR). In HRNR, a time-domain nonlinearity, typically ReLU, is applied for assistance in enhancing the higher-order harmonics. From this point of view, one question arose in our mind: Can time-domain ReLU solely assist phase recovery? Inspired by this curious connection between the recent DNN-based phase recovery method and HRNR in speech enhancement, the ReLU assisted Griffin-Lim algorithm is proposed in this paper to investigate the above question. Through an experiment of speech denoising with the oracle Wiener filter, some positive effect of the time-domain nonlinearity is confirmed in terms of the scores of the short-time objective intelligibility (STOI).

    Original languageEnglish
    Title of host publication16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018 - Proceedings
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages555-559
    Number of pages5
    ISBN (Electronic)9781538681510
    DOIs
    Publication statusPublished - 2018 Nov 2
    Event16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018 - Tokyo, Japan
    Duration: 2018 Sep 172018 Sep 20

    Other

    Other16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018
    CountryJapan
    CityTokyo
    Period18/9/1718/9/20

    Keywords

    • Consistency
    • Harmonic regeneration
    • Redundancy
    • Spectrogram
    • Time-domain nonlinearity

    ASJC Scopus subject areas

    • Signal Processing
    • Acoustics and Ultrasonics

    Fingerprint Dive into the research topics of 'Rectified linear unit can assist griffin-lim phase recovery'. Together they form a unique fingerprint.

  • Cite this

    Yatabe, K., Masuyama, Y., & Oikawa, Y. (2018). Rectified linear unit can assist griffin-lim phase recovery. In 16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018 - Proceedings (pp. 555-559). [8521304] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IWAENC.2018.8521304