A challenge of authorship identification for ten-thousand-scale microblog users

Syunya Okuno, Hiroki Asai, Hayato Yamana

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    9 Citations (Scopus)

    Abstract

    Internet security issues require authorship identification for all kinds of internet contents; however, authorship identification for microblog users is much harder than other documents because microblog texts are too short. Moreover, when the number of candidates becomes large, i.e., big data, it will take long time to identify. Our proposed method solves these problems. The experimental results show that our method successfully identifies the authorship with 53.2% of precision out of 10,000 microblog users in the almost half execution time of previous method.

    Original languageEnglish
    Title of host publicationProceedings - 2014 IEEE International Conference on Big Data, IEEE Big Data 2014
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages52-54
    Number of pages3
    ISBN (Print)9781479956654
    DOIs
    Publication statusPublished - 2015 Jan 7
    Event2nd IEEE International Conference on Big Data, IEEE Big Data 2014 - Washington
    Duration: 2014 Oct 272014 Oct 30

    Other

    Other2nd IEEE International Conference on Big Data, IEEE Big Data 2014
    CityWashington
    Period14/10/2714/10/30

    Fingerprint

    Internet
    Big data

    Keywords

    • Authorship attribution
    • Authorship detection
    • Authorship identification
    • Microblog
    • Twitter

    ASJC Scopus subject areas

    • Artificial Intelligence
    • Information Systems

    Cite this

    Okuno, S., Asai, H., & Yamana, H. (2015). A challenge of authorship identification for ten-thousand-scale microblog users. In Proceedings - 2014 IEEE International Conference on Big Data, IEEE Big Data 2014 (pp. 52-54). [7004491] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/BigData.2014.7004491

    A challenge of authorship identification for ten-thousand-scale microblog users. / Okuno, Syunya; Asai, Hiroki; Yamana, Hayato.

    Proceedings - 2014 IEEE International Conference on Big Data, IEEE Big Data 2014. Institute of Electrical and Electronics Engineers Inc., 2015. p. 52-54 7004491.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Okuno, S, Asai, H & Yamana, H 2015, A challenge of authorship identification for ten-thousand-scale microblog users. in Proceedings - 2014 IEEE International Conference on Big Data, IEEE Big Data 2014., 7004491, Institute of Electrical and Electronics Engineers Inc., pp. 52-54, 2nd IEEE International Conference on Big Data, IEEE Big Data 2014, Washington, 14/10/27. https://doi.org/10.1109/BigData.2014.7004491
    Okuno S, Asai H, Yamana H. A challenge of authorship identification for ten-thousand-scale microblog users. In Proceedings - 2014 IEEE International Conference on Big Data, IEEE Big Data 2014. Institute of Electrical and Electronics Engineers Inc. 2015. p. 52-54. 7004491 https://doi.org/10.1109/BigData.2014.7004491
    Okuno, Syunya ; Asai, Hiroki ; Yamana, Hayato. / A challenge of authorship identification for ten-thousand-scale microblog users. Proceedings - 2014 IEEE International Conference on Big Data, IEEE Big Data 2014. Institute of Electrical and Electronics Engineers Inc., 2015. pp. 52-54
    @inproceedings{69fa5920a7904b0cb7d31136f33c08b1,
    title = "A challenge of authorship identification for ten-thousand-scale microblog users",
    abstract = "Internet security issues require authorship identification for all kinds of internet contents; however, authorship identification for microblog users is much harder than other documents because microblog texts are too short. Moreover, when the number of candidates becomes large, i.e., big data, it will take long time to identify. Our proposed method solves these problems. The experimental results show that our method successfully identifies the authorship with 53.2{\%} of precision out of 10,000 microblog users in the almost half execution time of previous method.",
    keywords = "Authorship attribution, Authorship detection, Authorship identification, Microblog, Twitter",
    author = "Syunya Okuno and Hiroki Asai and Hayato Yamana",
    year = "2015",
    month = "1",
    day = "7",
    doi = "10.1109/BigData.2014.7004491",
    language = "English",
    isbn = "9781479956654",
    pages = "52--54",
    booktitle = "Proceedings - 2014 IEEE International Conference on Big Data, IEEE Big Data 2014",
    publisher = "Institute of Electrical and Electronics Engineers Inc.",

    }

    TY - GEN

    T1 - A challenge of authorship identification for ten-thousand-scale microblog users

    AU - Okuno, Syunya

    AU - Asai, Hiroki

    AU - Yamana, Hayato

    PY - 2015/1/7

    Y1 - 2015/1/7

    N2 - Internet security issues require authorship identification for all kinds of internet contents; however, authorship identification for microblog users is much harder than other documents because microblog texts are too short. Moreover, when the number of candidates becomes large, i.e., big data, it will take long time to identify. Our proposed method solves these problems. The experimental results show that our method successfully identifies the authorship with 53.2% of precision out of 10,000 microblog users in the almost half execution time of previous method.

    AB - Internet security issues require authorship identification for all kinds of internet contents; however, authorship identification for microblog users is much harder than other documents because microblog texts are too short. Moreover, when the number of candidates becomes large, i.e., big data, it will take long time to identify. Our proposed method solves these problems. The experimental results show that our method successfully identifies the authorship with 53.2% of precision out of 10,000 microblog users in the almost half execution time of previous method.

    KW - Authorship attribution

    KW - Authorship detection

    KW - Authorship identification

    KW - Microblog

    KW - Twitter

    UR - http://www.scopus.com/inward/record.url?scp=84921742594&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=84921742594&partnerID=8YFLogxK

    U2 - 10.1109/BigData.2014.7004491

    DO - 10.1109/BigData.2014.7004491

    M3 - Conference contribution

    AN - SCOPUS:84921742594

    SN - 9781479956654

    SP - 52

    EP - 54

    BT - Proceedings - 2014 IEEE International Conference on Big Data, IEEE Big Data 2014

    PB - Institute of Electrical and Electronics Engineers Inc.

    ER -