Discovering latent country words: A step towards cross-cultural emotional communication

Heeryon Cho, Toru Ishida

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Knowing what concepts are substantial to each country can be helpful in enhancing emotional communication between two countries. As a concrete example of identifying substantial country concepts, we focus on a task of finding latent country words from cross-cultural texts of two countries. We do this by combining word embedding and tensor decomposition: common words that appear in both countries’ texts are selected; their country specific word embeddings are learned; a three-way tensor consisting of word factor, word embedding factor, and country factor are constructed; and CANDECOMP/PARAFAC decomposition is performed on the three-way tensor while fixing the country factor values of the decomposed result. We tested our method on a motivating example of finding latent country words from J-pop lyrics from Japan and K-pop lyrics from South Korea. We found that J-pop lyrics words feature nature related motifs such as ‘petal’, ‘cloud’, ‘universe’, ‘star’, and ‘sky’, whereas K-pop lyrics words highlight human body related motifs such as ‘style’, ‘shirt’, ‘head’, ‘foot’, and ‘skin’.

Original languageEnglish
Title of host publicationCollaboration Technologies and Social Computing - 25th International Conference, CRIWG+CollabTech 2019, Proceedings
EditorsHideyuki Nakanishi, Hironori Egi, Irene-Angelica Chounta, Hideyuki Takada, Satoshi Ichimura, Ulrich Hoppe
PublisherSpringer-Verlag
Pages232-241
Number of pages10
ISBN (Print)9783030280109
DOIs
Publication statusPublished - 2019 Jan 1
Event25th International Conference on Collaboration Technologies and Social Computing, CRIWG+CollabTech 2019 - Kyoto, Japan
Duration: 2019 Sep 42019 Sep 6

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11677 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference25th International Conference on Collaboration Technologies and Social Computing, CRIWG+CollabTech 2019
CountryJapan
CityKyoto
Period19/9/419/9/6

Fingerprint

Tensors
Communication
Decomposition
Tensor
Tensor Decomposition
Parallel Factor Analysis
Stars
Skin
Japan
Star
Decompose
Emotion
Text
Concepts

Keywords

  • Cross-cultural text analysis
  • Tensor decomposition
  • Word embedding

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Cho, H., & Ishida, T. (2019). Discovering latent country words: A step towards cross-cultural emotional communication. In H. Nakanishi, H. Egi, I-A. Chounta, H. Takada, S. Ichimura, & U. Hoppe (Eds.), Collaboration Technologies and Social Computing - 25th International Conference, CRIWG+CollabTech 2019, Proceedings (pp. 232-241). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11677 LNCS). Springer-Verlag. https://doi.org/10.1007/978-3-030-28011-6_17

Discovering latent country words : A step towards cross-cultural emotional communication. / Cho, Heeryon; Ishida, Toru.

Collaboration Technologies and Social Computing - 25th International Conference, CRIWG+CollabTech 2019, Proceedings. ed. / Hideyuki Nakanishi; Hironori Egi; Irene-Angelica Chounta; Hideyuki Takada; Satoshi Ichimura; Ulrich Hoppe. Springer-Verlag, 2019. p. 232-241 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11677 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Cho, H & Ishida, T 2019, Discovering latent country words: A step towards cross-cultural emotional communication. in H Nakanishi, H Egi, I-A Chounta, H Takada, S Ichimura & U Hoppe (eds), Collaboration Technologies and Social Computing - 25th International Conference, CRIWG+CollabTech 2019, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11677 LNCS, Springer-Verlag, pp. 232-241, 25th International Conference on Collaboration Technologies and Social Computing, CRIWG+CollabTech 2019, Kyoto, Japan, 19/9/4. https://doi.org/10.1007/978-3-030-28011-6_17
Cho H, Ishida T. Discovering latent country words: A step towards cross-cultural emotional communication. In Nakanishi H, Egi H, Chounta I-A, Takada H, Ichimura S, Hoppe U, editors, Collaboration Technologies and Social Computing - 25th International Conference, CRIWG+CollabTech 2019, Proceedings. Springer-Verlag. 2019. p. 232-241. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-030-28011-6_17
Cho, Heeryon ; Ishida, Toru. / Discovering latent country words : A step towards cross-cultural emotional communication. Collaboration Technologies and Social Computing - 25th International Conference, CRIWG+CollabTech 2019, Proceedings. editor / Hideyuki Nakanishi ; Hironori Egi ; Irene-Angelica Chounta ; Hideyuki Takada ; Satoshi Ichimura ; Ulrich Hoppe. Springer-Verlag, 2019. pp. 232-241 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{e49159bc42c54825872bcbd7d23aabff,
title = "Discovering latent country words: A step towards cross-cultural emotional communication",
abstract = "Knowing what concepts are substantial to each country can be helpful in enhancing emotional communication between two countries. As a concrete example of identifying substantial country concepts, we focus on a task of finding latent country words from cross-cultural texts of two countries. We do this by combining word embedding and tensor decomposition: common words that appear in both countries’ texts are selected; their country specific word embeddings are learned; a three-way tensor consisting of word factor, word embedding factor, and country factor are constructed; and CANDECOMP/PARAFAC decomposition is performed on the three-way tensor while fixing the country factor values of the decomposed result. We tested our method on a motivating example of finding latent country words from J-pop lyrics from Japan and K-pop lyrics from South Korea. We found that J-pop lyrics words feature nature related motifs such as ‘petal’, ‘cloud’, ‘universe’, ‘star’, and ‘sky’, whereas K-pop lyrics words highlight human body related motifs such as ‘style’, ‘shirt’, ‘head’, ‘foot’, and ‘skin’.",
keywords = "Cross-cultural text analysis, Tensor decomposition, Word embedding",
author = "Heeryon Cho and Toru Ishida",
year = "2019",
month = "1",
day = "1",
doi = "10.1007/978-3-030-28011-6_17",
language = "English",
isbn = "9783030280109",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer-Verlag",
pages = "232--241",
editor = "Hideyuki Nakanishi and Hironori Egi and Irene-Angelica Chounta and Hideyuki Takada and Satoshi Ichimura and Ulrich Hoppe",
booktitle = "Collaboration Technologies and Social Computing - 25th International Conference, CRIWG+CollabTech 2019, Proceedings",

}

TY - GEN

T1 - Discovering latent country words

T2 - A step towards cross-cultural emotional communication

AU - Cho, Heeryon

AU - Ishida, Toru

PY - 2019/1/1

Y1 - 2019/1/1

N2 - Knowing what concepts are substantial to each country can be helpful in enhancing emotional communication between two countries. As a concrete example of identifying substantial country concepts, we focus on a task of finding latent country words from cross-cultural texts of two countries. We do this by combining word embedding and tensor decomposition: common words that appear in both countries’ texts are selected; their country specific word embeddings are learned; a three-way tensor consisting of word factor, word embedding factor, and country factor are constructed; and CANDECOMP/PARAFAC decomposition is performed on the three-way tensor while fixing the country factor values of the decomposed result. We tested our method on a motivating example of finding latent country words from J-pop lyrics from Japan and K-pop lyrics from South Korea. We found that J-pop lyrics words feature nature related motifs such as ‘petal’, ‘cloud’, ‘universe’, ‘star’, and ‘sky’, whereas K-pop lyrics words highlight human body related motifs such as ‘style’, ‘shirt’, ‘head’, ‘foot’, and ‘skin’.

AB - Knowing what concepts are substantial to each country can be helpful in enhancing emotional communication between two countries. As a concrete example of identifying substantial country concepts, we focus on a task of finding latent country words from cross-cultural texts of two countries. We do this by combining word embedding and tensor decomposition: common words that appear in both countries’ texts are selected; their country specific word embeddings are learned; a three-way tensor consisting of word factor, word embedding factor, and country factor are constructed; and CANDECOMP/PARAFAC decomposition is performed on the three-way tensor while fixing the country factor values of the decomposed result. We tested our method on a motivating example of finding latent country words from J-pop lyrics from Japan and K-pop lyrics from South Korea. We found that J-pop lyrics words feature nature related motifs such as ‘petal’, ‘cloud’, ‘universe’, ‘star’, and ‘sky’, whereas K-pop lyrics words highlight human body related motifs such as ‘style’, ‘shirt’, ‘head’, ‘foot’, and ‘skin’.

KW - Cross-cultural text analysis

KW - Tensor decomposition

KW - Word embedding

UR - http://www.scopus.com/inward/record.url?scp=85072863236&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85072863236&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-28011-6_17

DO - 10.1007/978-3-030-28011-6_17

M3 - Conference contribution

AN - SCOPUS:85072863236

SN - 9783030280109

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 232

EP - 241

BT - Collaboration Technologies and Social Computing - 25th International Conference, CRIWG+CollabTech 2019, Proceedings

A2 - Nakanishi, Hideyuki

A2 - Egi, Hironori

A2 - Chounta, Irene-Angelica

A2 - Takada, Hideyuki

A2 - Ichimura, Satoshi

A2 - Hoppe, Ulrich

PB - Springer-Verlag

ER -