TY - JOUR
T1 - Enhancing Cancer Driver Gene Prediction by Protein-Protein Interaction Network
AU - Liu, Chuang
AU - Dai, Yao
AU - Yu, Keping
AU - Zhang, Zi Ke
N1 - Funding Information:
This work was supported in part by the Natural Science Foundation of China under Grant 61873080 and 61673151, Zhejiang Provincial Natural Science Foundation of China under Grant LY18A050004 and LR18A050001 and the Japan Society for the Promotion of Science (JSPS) Grants-in-Aid for Scientific Research (KAKENHI) under Grant JP18K18044.
Publisher Copyright:
© 2004-2012 IEEE.
PY - 2022
Y1 - 2022
N2 - With the advances in gene sequencing technologies, millions of somatic mutations have been reported in the past decades, but mining cancer driver genes with oncogenic mutations from these data remains a critical and challenging area of research. In this study, we proposed a network-based classification method for identifying cancer driver genes with merging the multi-biological information. In this method, we construct a cancer specific genetic network from the human protein-protein interactome (PPI) to mine the network structure attributes, and combine biological information such as mutation frequency and differential expression of genes to achieve accurate prediction of cancer driver genes. Across seven different cancer types, the proposed algorithm always achieves high prediction accuracy, which is superior to the existing advanced methods. In the analysis of the predicted results, about 40 percent of the top 10 candidate genes overlap with the Cancer Gene Census database. Interestingly, the feature comparison indicates that the network based features are still more important than the biological features, including the mutation frequency and genetic differential expression. Further analyses also show that the integration of network structure attributes and biological information is valuable for predicting new cancer driver genes.
AB - With the advances in gene sequencing technologies, millions of somatic mutations have been reported in the past decades, but mining cancer driver genes with oncogenic mutations from these data remains a critical and challenging area of research. In this study, we proposed a network-based classification method for identifying cancer driver genes with merging the multi-biological information. In this method, we construct a cancer specific genetic network from the human protein-protein interactome (PPI) to mine the network structure attributes, and combine biological information such as mutation frequency and differential expression of genes to achieve accurate prediction of cancer driver genes. Across seven different cancer types, the proposed algorithm always achieves high prediction accuracy, which is superior to the existing advanced methods. In the analysis of the predicted results, about 40 percent of the top 10 candidate genes overlap with the Cancer Gene Census database. Interestingly, the feature comparison indicates that the network based features are still more important than the biological features, including the mutation frequency and genetic differential expression. Further analyses also show that the integration of network structure attributes and biological information is valuable for predicting new cancer driver genes.
KW - Cancer driver gene
KW - Human interactome
KW - Network structure
KW - Random forest
KW - Signed random walk with restart
UR - http://www.scopus.com/inward/record.url?scp=85102254118&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85102254118&partnerID=8YFLogxK
U2 - 10.1109/TCBB.2021.3063532
DO - 10.1109/TCBB.2021.3063532
M3 - Article
AN - SCOPUS:85102254118
SN - 1545-5963
VL - 19
SP - 2231
EP - 2240
JO - IEEE/ACM Transactions on Computational Biology and Bioinformatics
JF - IEEE/ACM Transactions on Computational Biology and Bioinformatics
IS - 4
ER -