Hashtag sense induction based on co-occurrence graphs

Mengmeng Wang, Mizuho Iwaihara

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Twitter hashtags are used to categorize tweets for improving search categorizing topic. But the fact that people can create and use hashtags freely leads to a situation such that one hashtag may have multiple senses. In this paper, we propose a method to induce senses of a hashtag in a particular time frame. Our assumption is that for a sense of a hashtag the context words around it are similar. Then we design a method that uses a co-occurrence graph and community detection algorithm. Both words and hashtags are nodes of the cooccurrence graph, and an edge represents the relation of two nodes co-occurring in the same tweet. A list of words with a high node degree representing a sense is extracted as a community of the graph. We take Wikipedia disambiguation list page as word sense inventory to refine the results by removing non-sense topics.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PublisherSpringer Verlag
Pages154-165
Number of pages12
Volume9313
ISBN (Print)9783319252544
DOIs
Publication statusPublished - 2015
Event17th Asia-PacificWeb Conference, APWeb 2015 - Guangzhou, China
Duration: 2015 Sep 182015 Sep 20

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9313
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other17th Asia-PacificWeb Conference, APWeb 2015
CountryChina
CityGuangzhou
Period15/9/1815/9/20

Fingerprint

Proof by induction
Graph in graph theory
Vertex of a graph
Community Detection
Wikipedia
Design
Context
Community

Keywords

  • Co-occurrence Graph
  • Hashtag
  • Sense induction
  • Twitter
  • Wikipedia

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Wang, M., & Iwaihara, M. (2015). Hashtag sense induction based on co-occurrence graphs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9313, pp. 154-165). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9313). Springer Verlag. https://doi.org/10.1007/978-3-319-25255-1_13

Hashtag sense induction based on co-occurrence graphs. / Wang, Mengmeng; Iwaihara, Mizuho.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 9313 Springer Verlag, 2015. p. 154-165 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9313).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Wang, M & Iwaihara, M 2015, Hashtag sense induction based on co-occurrence graphs. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 9313, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 9313, Springer Verlag, pp. 154-165, 17th Asia-PacificWeb Conference, APWeb 2015, Guangzhou, China, 15/9/18. https://doi.org/10.1007/978-3-319-25255-1_13
Wang M, Iwaihara M. Hashtag sense induction based on co-occurrence graphs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 9313. Springer Verlag. 2015. p. 154-165. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-319-25255-1_13
Wang, Mengmeng ; Iwaihara, Mizuho. / Hashtag sense induction based on co-occurrence graphs. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 9313 Springer Verlag, 2015. pp. 154-165 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{bb1a62426abb4153aee6a3c833089ec2,
title = "Hashtag sense induction based on co-occurrence graphs",
abstract = "Twitter hashtags are used to categorize tweets for improving search categorizing topic. But the fact that people can create and use hashtags freely leads to a situation such that one hashtag may have multiple senses. In this paper, we propose a method to induce senses of a hashtag in a particular time frame. Our assumption is that for a sense of a hashtag the context words around it are similar. Then we design a method that uses a co-occurrence graph and community detection algorithm. Both words and hashtags are nodes of the cooccurrence graph, and an edge represents the relation of two nodes co-occurring in the same tweet. A list of words with a high node degree representing a sense is extracted as a community of the graph. We take Wikipedia disambiguation list page as word sense inventory to refine the results by removing non-sense topics.",
keywords = "Co-occurrence Graph, Hashtag, Sense induction, Twitter, Wikipedia",
author = "Mengmeng Wang and Mizuho Iwaihara",
year = "2015",
doi = "10.1007/978-3-319-25255-1_13",
language = "English",
isbn = "9783319252544",
volume = "9313",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "154--165",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - Hashtag sense induction based on co-occurrence graphs

AU - Wang, Mengmeng

AU - Iwaihara, Mizuho

PY - 2015

Y1 - 2015

N2 - Twitter hashtags are used to categorize tweets for improving search categorizing topic. But the fact that people can create and use hashtags freely leads to a situation such that one hashtag may have multiple senses. In this paper, we propose a method to induce senses of a hashtag in a particular time frame. Our assumption is that for a sense of a hashtag the context words around it are similar. Then we design a method that uses a co-occurrence graph and community detection algorithm. Both words and hashtags are nodes of the cooccurrence graph, and an edge represents the relation of two nodes co-occurring in the same tweet. A list of words with a high node degree representing a sense is extracted as a community of the graph. We take Wikipedia disambiguation list page as word sense inventory to refine the results by removing non-sense topics.

AB - Twitter hashtags are used to categorize tweets for improving search categorizing topic. But the fact that people can create and use hashtags freely leads to a situation such that one hashtag may have multiple senses. In this paper, we propose a method to induce senses of a hashtag in a particular time frame. Our assumption is that for a sense of a hashtag the context words around it are similar. Then we design a method that uses a co-occurrence graph and community detection algorithm. Both words and hashtags are nodes of the cooccurrence graph, and an edge represents the relation of two nodes co-occurring in the same tweet. A list of words with a high node degree representing a sense is extracted as a community of the graph. We take Wikipedia disambiguation list page as word sense inventory to refine the results by removing non-sense topics.

KW - Co-occurrence Graph

KW - Hashtag

KW - Sense induction

KW - Twitter

KW - Wikipedia

UR - http://www.scopus.com/inward/record.url?scp=84950298237&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84950298237&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-25255-1_13

DO - 10.1007/978-3-319-25255-1_13

M3 - Conference contribution

AN - SCOPUS:84950298237

SN - 9783319252544

VL - 9313

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 154

EP - 165

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

PB - Springer Verlag

ER -