Analysis of evolutionary conservation patterns and their influence on identifying protein functional sites

Chun Fang, Tamotsu Noguchi, Hayato Yamana

    Research output: Contribution to journalArticle

    2 Citations (Scopus)

    Abstract

    Evolutionary conservation information included in position-specific scoring matrix (PSSM) has been widely adopted by sequence-based methods for identifying protein functional sites, because all functional sites, whether in ordered or disordered proteins, are found to be conserved at some extent. However, different functional sites have different conservation patterns, some of them are linear contextual, some of them are mingled with highly variable residues, and some others seem to be conserved independently. Every value in PSSMs is calculated independently of each other, without carrying the contextual information of residues in the sequence. Therefore, adopting the direct output of PSSM for prediction fails to consider the relationship between conservation patterns of residues and the distribution of conservation scores in PSSMs. In order to demonstrate the importance of combining PSSMs with the specific conservation patterns of functional sites for prediction, three different PSSMbased methods for identifying three kinds of functional sites have been analyzed. Results suggest that, different PSSM-based methods differ in their capability to identify different patterns of functional sites, and better combining PSSMs with the specific conservation patterns of residues would largely facilitate the prediction.

    Original languageEnglish
    Article number1440003
    JournalJournal of Bioinformatics and Computational Biology
    Volume12
    Issue number5
    DOIs
    Publication statusPublished - 2014 Oct 30

    Fingerprint

    Position-Specific Scoring Matrices
    Conservation
    Proteins
    Binding Sites

    Keywords

    • Conservation patterns
    • Functional sites
    • Position-specific scoring matrix

    ASJC Scopus subject areas

    • Biochemistry
    • Molecular Biology
    • Computer Science Applications

    Cite this

    Analysis of evolutionary conservation patterns and their influence on identifying protein functional sites. / Fang, Chun; Noguchi, Tamotsu; Yamana, Hayato.

    In: Journal of Bioinformatics and Computational Biology, Vol. 12, No. 5, 1440003, 30.10.2014.

    Research output: Contribution to journalArticle

    @article{b93b7ace68f4435b8655076d79913553,
    title = "Analysis of evolutionary conservation patterns and their influence on identifying protein functional sites",
    abstract = "Evolutionary conservation information included in position-specific scoring matrix (PSSM) has been widely adopted by sequence-based methods for identifying protein functional sites, because all functional sites, whether in ordered or disordered proteins, are found to be conserved at some extent. However, different functional sites have different conservation patterns, some of them are linear contextual, some of them are mingled with highly variable residues, and some others seem to be conserved independently. Every value in PSSMs is calculated independently of each other, without carrying the contextual information of residues in the sequence. Therefore, adopting the direct output of PSSM for prediction fails to consider the relationship between conservation patterns of residues and the distribution of conservation scores in PSSMs. In order to demonstrate the importance of combining PSSMs with the specific conservation patterns of functional sites for prediction, three different PSSMbased methods for identifying three kinds of functional sites have been analyzed. Results suggest that, different PSSM-based methods differ in their capability to identify different patterns of functional sites, and better combining PSSMs with the specific conservation patterns of residues would largely facilitate the prediction.",
    keywords = "Conservation patterns, Functional sites, Position-specific scoring matrix",
    author = "Chun Fang and Tamotsu Noguchi and Hayato Yamana",
    year = "2014",
    month = "10",
    day = "30",
    doi = "10.1142/S0219720014400034",
    language = "English",
    volume = "12",
    journal = "Journal of Bioinformatics and Computational Biology",
    issn = "0219-7200",
    publisher = "World Scientific Publishing Co. Pte Ltd",
    number = "5",

    }

    TY - JOUR

    T1 - Analysis of evolutionary conservation patterns and their influence on identifying protein functional sites

    AU - Fang, Chun

    AU - Noguchi, Tamotsu

    AU - Yamana, Hayato

    PY - 2014/10/30

    Y1 - 2014/10/30

    N2 - Evolutionary conservation information included in position-specific scoring matrix (PSSM) has been widely adopted by sequence-based methods for identifying protein functional sites, because all functional sites, whether in ordered or disordered proteins, are found to be conserved at some extent. However, different functional sites have different conservation patterns, some of them are linear contextual, some of them are mingled with highly variable residues, and some others seem to be conserved independently. Every value in PSSMs is calculated independently of each other, without carrying the contextual information of residues in the sequence. Therefore, adopting the direct output of PSSM for prediction fails to consider the relationship between conservation patterns of residues and the distribution of conservation scores in PSSMs. In order to demonstrate the importance of combining PSSMs with the specific conservation patterns of functional sites for prediction, three different PSSMbased methods for identifying three kinds of functional sites have been analyzed. Results suggest that, different PSSM-based methods differ in their capability to identify different patterns of functional sites, and better combining PSSMs with the specific conservation patterns of residues would largely facilitate the prediction.

    AB - Evolutionary conservation information included in position-specific scoring matrix (PSSM) has been widely adopted by sequence-based methods for identifying protein functional sites, because all functional sites, whether in ordered or disordered proteins, are found to be conserved at some extent. However, different functional sites have different conservation patterns, some of them are linear contextual, some of them are mingled with highly variable residues, and some others seem to be conserved independently. Every value in PSSMs is calculated independently of each other, without carrying the contextual information of residues in the sequence. Therefore, adopting the direct output of PSSM for prediction fails to consider the relationship between conservation patterns of residues and the distribution of conservation scores in PSSMs. In order to demonstrate the importance of combining PSSMs with the specific conservation patterns of functional sites for prediction, three different PSSMbased methods for identifying three kinds of functional sites have been analyzed. Results suggest that, different PSSM-based methods differ in their capability to identify different patterns of functional sites, and better combining PSSMs with the specific conservation patterns of residues would largely facilitate the prediction.

    KW - Conservation patterns

    KW - Functional sites

    KW - Position-specific scoring matrix

    UR - http://www.scopus.com/inward/record.url?scp=84908637319&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=84908637319&partnerID=8YFLogxK

    U2 - 10.1142/S0219720014400034

    DO - 10.1142/S0219720014400034

    M3 - Article

    VL - 12

    JO - Journal of Bioinformatics and Computational Biology

    JF - Journal of Bioinformatics and Computational Biology

    SN - 0219-7200

    IS - 5

    M1 - 1440003

    ER -