抄録
To improve the segmentation velocity and storage efficiency of the Chinese word segmentation algorithm, this paper proposes a characteristic matching algorithm based on pair coding. The characteristic value is extracted from the Chinese character position. This method can support fuzzy matching and don't need match multi-character Chinese words, so the characteristic value extraction is extracted from the adjacent Chinese character position. In addition, the data compression method can contribute to reduce storage space and improve the performance of Chinese word segmentation.
本文言語 | English |
---|---|
ページ(範囲) | 526-530 |
ページ数 | 5 |
ジャーナル | Nanjing Li Gong Daxue Xuebao/Journal of Nanjing University of Science and Technology |
巻 | 38 |
号 | 4 |
出版ステータス | Published - 2014 8月 30 |
外部発表 | はい |
ASJC Scopus subject areas
- 工学(全般)