TY - JOUR
T1 - Poisson approach to clustering analysis of regulatory sequences
AU - Wang, Haiying
AU - Zheng, Huiru
AU - Hu, Jinglu
PY - 2008/1/1
Y1 - 2008/1/1
N2 - The presence of similar patterns in regulatory sequences may aid users in identifying co-regulated genes or inferring regulatory modules. By modelling pattern occurrences in regulatory regions with Poisson statistics, this paper presents a log likelihood ratio statistics-based distance measure to calculate pair-wise similarities between regulatory sequences. We employed it within three clustering algorithms: hierarchical clustering, Self-Organising Map, and a self-adaptive neural network. The results indicate that, in comparison to traditional clustering algorithms, the incorporation of the log likelihood ratio statistics-based distance into the learning process may offer considerable improvements in the process of regulatory sequence-based classification of genes.
AB - The presence of similar patterns in regulatory sequences may aid users in identifying co-regulated genes or inferring regulatory modules. By modelling pattern occurrences in regulatory regions with Poisson statistics, this paper presents a log likelihood ratio statistics-based distance measure to calculate pair-wise similarities between regulatory sequences. We employed it within three clustering algorithms: hierarchical clustering, Self-Organising Map, and a self-adaptive neural network. The results indicate that, in comparison to traditional clustering algorithms, the incorporation of the log likelihood ratio statistics-based distance into the learning process may offer considerable improvements in the process of regulatory sequence-based classification of genes.
KW - Poisson distribution
KW - hierarchical clustering
KW - log likelihood ratio statistics
KW - neural networks
KW - regulatory sequence
UR - http://www.scopus.com/inward/record.url?scp=75649133555&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=75649133555&partnerID=8YFLogxK
U2 - 10.1504/IJCBDD.2008.020206
DO - 10.1504/IJCBDD.2008.020206
M3 - Article
C2 - 20058486
AN - SCOPUS:75649133555
VL - 1
SP - 141
EP - 157
JO - International Journal of Computational Biology and Drug Design
JF - International Journal of Computational Biology and Drug Design
SN - 1756-0756
IS - 2
ER -