ProtAnt: A tool for analysing the prototypicality of texts

Laurence Anthony, Paul Baker

    研究成果: Article

    9 引用 (Scopus)

    抜粋

    Corpus-based researchers and traditional qualitative researchers, such as those interested in critical discourse analysis, are often required to select prototypical texts for close reading that include the language features of interest that are present in a much larger corpus. Traditional approaches to this selection procedure have been largely ad hoc. In this paper, we offer a more principled way of selecting texts for close reading based on a ranking of texts in terms of the number of keywords they contain. To facilitate this analysis, we have developed a multiplatform, freeware software tool called ProtAnt that analyses the texts, generates a ranked list of keywords based on statistical significance and effect size, and then orders the texts by the number of keywords in them. We describe various experiments that demonstrate the ProtAnt analysis is effective not only at identifying prototypical texts, but also identifying outlier texts that may need to be removed from a target corpus.

    元の言語English
    ページ(範囲)273-292
    ページ数20
    ジャーナルInternational Journal of Corpus Linguistics
    20
    発行部数3
    DOI
    出版物ステータスPublished - 2015

    ASJC Scopus subject areas

    • Language and Linguistics
    • Linguistics and Language

    フィンガープリント ProtAnt: A tool for analysing the prototypicality of texts' の研究トピックを掘り下げます。これらはともに一意のフィンガープリントを構成します。

  • これを引用