A new method for E. coli DNA segment classification on promoters and non-promoters is presented. The algorithm is based on the Independent Component Analysis (ICA). Since the DNA segments are composed of discrete symbols, this paper contains two major steps: (1) Position-dependent transformation of DNA segments to real number sequences, and (2) Applications of the ICA to the E. coli promoter recognition. These steps are related to each other. Therefore, algorithmic explanations are given in detail while referring mutually. The automatic precision of 93.7% is obtained. Since the presented method allows threshold adjustments, twilight-zone data can be further cross-checked individually so that false negatives are reduced.
|ホスト出版物のタイトル||Proceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004|
|出版物ステータス||Published - 2004|
|イベント||Proceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004 - Stanford, CA|
継続期間: 2004 8 16 → 2004 8 19
|Other||Proceedings - 2004 IEEE Computational Systems Bioinformatics Conference, CSB 2004|
|期間||04/8/16 → 04/8/19|
ASJC Scopus subject areas