Zipf's law in phonograms and Weibull distribution in ideograms: Comparison of English with Japanese

Terutaka Nabeshima, Yukio Pegio Gunji

研究成果: Article

9 引用 (Scopus)

抜粋

Frequency distribution of word usage in a word sequence generated by capping is estimated in terms of the number of "hits" in retrieval of web-pages, to evaluate structure of semantics proper not to a particular text but to a language. Especially we compare distribution of English sequences with Japanese ones and obtain that, for English and Japanese phonogram, frequency of word usage against rank follows power-law function with exponent 1 and, for Japanese ideogram, it follows stretched exponential (Weibull distribution) function. We also discuss that such a difference can result from difference of phonogram based- (English) and ideogram-based language (Japanese).

元の言語English
ページ(範囲)131-139
ページ数9
ジャーナルBioSystems
73
発行部数2
DOI
出版物ステータスPublished - 2004 2

    フィンガープリント

ASJC Scopus subject areas

  • Statistics and Probability
  • Modelling and Simulation
  • Biochemistry, Genetics and Molecular Biology(all)
  • Applied Mathematics

これを引用