Revision graph extraction in wikipedia based on supergram decomposition

Jianmin Wu, Mizuho Iwaihara

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

As one of the popular social media that many people turn to in recent years, collaborative encyclopedia Wikipedia provides information in a more "Neutral Point of View" way than others. Towards this core principle, plenty of efforts have been put into collaborative contribution and editing. The trajectories of how such collaboration appears by revisions are valuable for group dynamics and social media research, which suggest that we should extract the underlying derivation relationships among revisions from chronologically-sorted revision history in a precise way. In this paper, we propose a revision graph extraction method based on supergram decomposition in the document collection of near-duplicates. The plain text of revisions would be measured by its frequency distribution of supergram, which is the variable-length token sequence that keeps the same through revisions. We show that this method can effectively perform the task than existing methods. Categories and Subject Descriptors K.4.3 [Computers and Society]: Organizational Impacts - Computer-supported collaborative work. General Terms Algorithms, Experimentation.

Original languageEnglish
Title of host publicationProceedings of the 9th International Symposium on Open Collaboration, WikiSym + OpenSym 2013
DOIs
Publication statusPublished - 2013
Event9th International Symposium on Open Collaboration, WikiSym + OpenSym 2013 - Hong Kong
Duration: 2013 Aug 52013 Aug 7

Other

Other9th International Symposium on Open Collaboration, WikiSym + OpenSym 2013
CityHong Kong
Period13/8/513/8/7

Fingerprint

Computer supported cooperative work
Trajectories
Decomposition

Keywords

  • Collaboration
  • Revision history
  • Wikipedia

ASJC Scopus subject areas

  • Software

Cite this

Wu, J., & Iwaihara, M. (2013). Revision graph extraction in wikipedia based on supergram decomposition. In Proceedings of the 9th International Symposium on Open Collaboration, WikiSym + OpenSym 2013 [10] https://doi.org/10.1145/2491055.2491065

Revision graph extraction in wikipedia based on supergram decomposition. / Wu, Jianmin; Iwaihara, Mizuho.

Proceedings of the 9th International Symposium on Open Collaboration, WikiSym + OpenSym 2013. 2013. 10.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Wu, J & Iwaihara, M 2013, Revision graph extraction in wikipedia based on supergram decomposition. in Proceedings of the 9th International Symposium on Open Collaboration, WikiSym + OpenSym 2013., 10, 9th International Symposium on Open Collaboration, WikiSym + OpenSym 2013, Hong Kong, 13/8/5. https://doi.org/10.1145/2491055.2491065
Wu J, Iwaihara M. Revision graph extraction in wikipedia based on supergram decomposition. In Proceedings of the 9th International Symposium on Open Collaboration, WikiSym + OpenSym 2013. 2013. 10 https://doi.org/10.1145/2491055.2491065
Wu, Jianmin ; Iwaihara, Mizuho. / Revision graph extraction in wikipedia based on supergram decomposition. Proceedings of the 9th International Symposium on Open Collaboration, WikiSym + OpenSym 2013. 2013.
@inproceedings{c458f01d05694bdca38fc1f1c93310f0,
title = "Revision graph extraction in wikipedia based on supergram decomposition",
abstract = "As one of the popular social media that many people turn to in recent years, collaborative encyclopedia Wikipedia provides information in a more {"}Neutral Point of View{"} way than others. Towards this core principle, plenty of efforts have been put into collaborative contribution and editing. The trajectories of how such collaboration appears by revisions are valuable for group dynamics and social media research, which suggest that we should extract the underlying derivation relationships among revisions from chronologically-sorted revision history in a precise way. In this paper, we propose a revision graph extraction method based on supergram decomposition in the document collection of near-duplicates. The plain text of revisions would be measured by its frequency distribution of supergram, which is the variable-length token sequence that keeps the same through revisions. We show that this method can effectively perform the task than existing methods. Categories and Subject Descriptors K.4.3 [Computers and Society]: Organizational Impacts - Computer-supported collaborative work. General Terms Algorithms, Experimentation.",
keywords = "Collaboration, Revision history, Wikipedia",
author = "Jianmin Wu and Mizuho Iwaihara",
year = "2013",
doi = "10.1145/2491055.2491065",
language = "English",
isbn = "9781450318525",
booktitle = "Proceedings of the 9th International Symposium on Open Collaboration, WikiSym + OpenSym 2013",

}

TY - GEN

T1 - Revision graph extraction in wikipedia based on supergram decomposition

AU - Wu, Jianmin

AU - Iwaihara, Mizuho

PY - 2013

Y1 - 2013

N2 - As one of the popular social media that many people turn to in recent years, collaborative encyclopedia Wikipedia provides information in a more "Neutral Point of View" way than others. Towards this core principle, plenty of efforts have been put into collaborative contribution and editing. The trajectories of how such collaboration appears by revisions are valuable for group dynamics and social media research, which suggest that we should extract the underlying derivation relationships among revisions from chronologically-sorted revision history in a precise way. In this paper, we propose a revision graph extraction method based on supergram decomposition in the document collection of near-duplicates. The plain text of revisions would be measured by its frequency distribution of supergram, which is the variable-length token sequence that keeps the same through revisions. We show that this method can effectively perform the task than existing methods. Categories and Subject Descriptors K.4.3 [Computers and Society]: Organizational Impacts - Computer-supported collaborative work. General Terms Algorithms, Experimentation.

AB - As one of the popular social media that many people turn to in recent years, collaborative encyclopedia Wikipedia provides information in a more "Neutral Point of View" way than others. Towards this core principle, plenty of efforts have been put into collaborative contribution and editing. The trajectories of how such collaboration appears by revisions are valuable for group dynamics and social media research, which suggest that we should extract the underlying derivation relationships among revisions from chronologically-sorted revision history in a precise way. In this paper, we propose a revision graph extraction method based on supergram decomposition in the document collection of near-duplicates. The plain text of revisions would be measured by its frequency distribution of supergram, which is the variable-length token sequence that keeps the same through revisions. We show that this method can effectively perform the task than existing methods. Categories and Subject Descriptors K.4.3 [Computers and Society]: Organizational Impacts - Computer-supported collaborative work. General Terms Algorithms, Experimentation.

KW - Collaboration

KW - Revision history

KW - Wikipedia

UR - http://www.scopus.com/inward/record.url?scp=84888148254&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84888148254&partnerID=8YFLogxK

U2 - 10.1145/2491055.2491065

DO - 10.1145/2491055.2491065

M3 - Conference contribution

AN - SCOPUS:84888148254

SN - 9781450318525

BT - Proceedings of the 9th International Symposium on Open Collaboration, WikiSym + OpenSym 2013

ER -