TY - JOUR

T1 - A study on distance metric learning by using different metric matrices in each category

AU - Mikawa, Kenta

AU - Goto, Masayuki

N1 - Copyright:
Copyright 2016 Elsevier B.V., All rights reserved.

PY - 2016

Y1 - 2016

N2 - Distance metric learning is the method of learning the relevant distance metric from a training dataset by considering statistical characteristics. In order to gain a desirable distance metric, the optimization problem under an arbitrary constraint is solved. However, the representative algorithms of distance metric learning need to perform eigendecomposition at each iteration. Therefore, if the dimensions of the input data become large, the computational cost will increase drastically and it is difficult to calculate the optimal solution in a realistic amount of time. In addition, those distance metric learning methods are formulated by assuming a global (unique) metric matrix for the total vector space. Therefore, the global metric matrix cannot take into account the difference in statistical characteristics between each category. To improve those problems, the authors introduce different metric matrices for each category and propose a way to estimate the plural matrices using category information that applies the method of Mochihashi et al. The estimated metric matrices reflect the statistical characteristics of each category. The formulation of classifications by template matching and the k-NN method making effective use of the metric matrices is proposed. To verify the effectiveness of the proposed method, a simulation experiment is conducted using the benchmark data of high-dimensional and low-dimensional input data.

AB - Distance metric learning is the method of learning the relevant distance metric from a training dataset by considering statistical characteristics. In order to gain a desirable distance metric, the optimization problem under an arbitrary constraint is solved. However, the representative algorithms of distance metric learning need to perform eigendecomposition at each iteration. Therefore, if the dimensions of the input data become large, the computational cost will increase drastically and it is difficult to calculate the optimal solution in a realistic amount of time. In addition, those distance metric learning methods are formulated by assuming a global (unique) metric matrix for the total vector space. Therefore, the global metric matrix cannot take into account the difference in statistical characteristics between each category. To improve those problems, the authors introduce different metric matrices for each category and propose a way to estimate the plural matrices using category information that applies the method of Mochihashi et al. The estimated metric matrices reflect the statistical characteristics of each category. The formulation of classifications by template matching and the k-NN method making effective use of the metric matrices is proposed. To verify the effectiveness of the proposed method, a simulation experiment is conducted using the benchmark data of high-dimensional and low-dimensional input data.

KW - Distance metric learning

KW - K-NN

KW - Supervised learning

KW - Vector space model

UR - http://www.scopus.com/inward/record.url?scp=84958819036&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84958819036&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:84958819036

VL - 66

SP - 335

EP - 347

JO - Journal of Japan Industrial Management Association

JF - Journal of Japan Industrial Management Association

SN - 0386-4812

IS - 4

ER -