α-EM algorithm and α-ICA learning based upon extended logarithmic information measures

Yasuo Matsuyama, Takeshi Niimoto, Naoto Katsumata, Yoshitaka Suzuki, Satoshi Furukawa

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    7 Citations (Scopus)

    Abstract

    The α-logarithm extends the logarithm as the special case of α = -1. Usage of α-related information measures based upon this extended logarithm is expected to be effective to speedup of convergence, i.e., on the improvement of learning aptitude. In this paper, two typical cases are investigated. One is the α-EM algorithm (α-Expectation-Maximization algorithm) which is derived from the α-log-likelihood ratio. The other is the α-ICA (α-Independent Component Analysis) which is formulated as minimizing the α-mutual information. In the derivation of both algorithms, the α-divergence plays the main role. For the α-EM algorithm, the reason for the speedup is explained using Hessian and Jacobian matrices for learning. For the α-ICA learning, methods of exploiting the past and future information are presented. Examples are shown on single-loop α-EM's and sample-based α-ICA's. In all cases, effective speedups are observed. Thus, this paper's examples together with formerly reported ones are evidences that the speed improvement by the α-logarithm is a general property beyond individual problems.

    Original languageEnglish
    Title of host publicationProceedings of the International Joint Conference on Neural Networks
    Place of PublicationPiscataway, NJ, United States
    PublisherIEEE
    Pages351-356
    Number of pages6
    Volume3
    Publication statusPublished - 2000
    EventInternational Joint Conference on Neural Networks (IJCNN'2000) - Como, Italy
    Duration: 2000 Jul 242000 Jul 27

    Other

    OtherInternational Joint Conference on Neural Networks (IJCNN'2000)
    CityComo, Italy
    Period00/7/2400/7/27

    ASJC Scopus subject areas

    • Software

    Fingerprint Dive into the research topics of 'α-EM algorithm and α-ICA learning based upon extended logarithmic information measures'. Together they form a unique fingerprint.

    Cite this