TY - GEN
T1 - Auto-weighted Sequential Wasserstein Distance and Application to Sequence Matching
AU - Horie, Mitsuhiko
AU - Kasai, Hiroyuki
N1 - Funding Information:
ACKNOWLEDGMENT H. Kasai was partially supported by JSPS KAKENHI Grant Numbers JP16K00031 and JP17H01732, and by Support Center for Advanced Telecomm. Technology Research (SCAT).
Publisher Copyright:
© 2022 European Signal Processing Conference, EUSIPCO. All rights reserved.
PY - 2022
Y1 - 2022
N2 - Sequence matching problems have been central to the field of data analysis for decades. Such problems arise in widely diverse areas including computer vision, speech processing, bioinformatics, and natural language processing. However, solving such problems efficiently is difficult because one must consider temporal consistency, neighborhood structure similarity, robustness to noise and outliers, and flexibility on start-end matching points. This paper presents a proposal of a shape-aware Wasserstein distance between sequences building upon optimal transport (OT) framework. The proposed distance considers similarity measures of the elements, their neighborhood structures, and temporal positions. We incorporate these similarity measures into three ground cost matrixes of the OT formulation. The noteworthy contribution is that we formulate these measures as independent OT distances with a single shared optimal transport matrix, and adjust those weights automatically according to their effects on the total OT distance. Numerical evaluations suggest that the sequence matching method using our proposed Wasserstein distance robustly outperforms state-of-the-art methods across different real-world datasets.
AB - Sequence matching problems have been central to the field of data analysis for decades. Such problems arise in widely diverse areas including computer vision, speech processing, bioinformatics, and natural language processing. However, solving such problems efficiently is difficult because one must consider temporal consistency, neighborhood structure similarity, robustness to noise and outliers, and flexibility on start-end matching points. This paper presents a proposal of a shape-aware Wasserstein distance between sequences building upon optimal transport (OT) framework. The proposed distance considers similarity measures of the elements, their neighborhood structures, and temporal positions. We incorporate these similarity measures into three ground cost matrixes of the OT formulation. The noteworthy contribution is that we formulate these measures as independent OT distances with a single shared optimal transport matrix, and adjust those weights automatically according to their effects on the total OT distance. Numerical evaluations suggest that the sequence matching method using our proposed Wasserstein distance robustly outperforms state-of-the-art methods across different real-world datasets.
KW - dynamic time warping
KW - optimal transport
KW - sequence matching
UR - http://www.scopus.com/inward/record.url?scp=85141011802&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85141011802&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85141011802
T3 - European Signal Processing Conference
SP - 1472
EP - 1476
BT - 30th European Signal Processing Conference, EUSIPCO 2022 - Proceedings
PB - European Signal Processing Conference, EUSIPCO
T2 - 30th European Signal Processing Conference, EUSIPCO 2022
Y2 - 29 August 2022 through 2 September 2022
ER -