IPknot: Fast and accurate prediction of RNA secondary structures with pseudoknots using integer programming

Kengo Sato, Yuki Kato, Michiaki Hamada, Tatsuya Akutsu, Kiyoshi Asai

Research output: Contribution to journalArticle

89 Citations (Scopus)

Abstract

Motivation: Pseudoknots found in secondary structures of a number of functional RNAs play various roles in biological processes. Recent methods for predicting RNA secondary structures cover certain classes of pseudoknotted structures, but only a few of them achieve satisfying predictions in terms of both speed and accuracy. Results: We propose IPknot, a novel computational method for predicting RNA secondary structures with pseudoknots based on maximizing expected accuracy of a predicted structure. IPknot decomposes a pseudoknotted structure into a set of pseudoknot-free substructures and approximates a base-pairing probability distribution that considers pseudoknots, leading to the capability of modeling a wide class of pseudoknots and running quite fast. In addition, we propose a heuristic algorithm for refining base-paring probabilities to improve the prediction accuracy of IPknot. The problem of maximizing expected accuracy is solved by using integer programming with threshold cut. We also extend IPknot so that it can predict the consensus secondary structure with pseudoknots when a multiple sequence alignment is given. IPknot is validated through extensive experiments on various datasets, showing that IPknot achieves better prediction accuracy and faster running time as compared with several competitive prediction methods.

Original languageEnglish
Article numberbtr215
JournalBioinformatics
Volume27
Issue number13
DOIs
Publication statusPublished - 2011 Jul
Externally publishedYes

Fingerprint

RNA Secondary Structure
Integer programming
Integer Programming
RNA
Prediction
Biological Phenomena
Secondary Structure
Sequence Alignment
Base Pairing
Consensus
Heuristic algorithms
Computational methods
Multiple Sequence Alignment
Probability distributions
Refining
Substructure
Pairing
Heuristic algorithm
Computational Methods
Probability Distribution

ASJC Scopus subject areas

  • Biochemistry
  • Molecular Biology
  • Computational Theory and Mathematics
  • Computer Science Applications
  • Computational Mathematics
  • Statistics and Probability
  • Medicine(all)

Cite this

IPknot : Fast and accurate prediction of RNA secondary structures with pseudoknots using integer programming. / Sato, Kengo; Kato, Yuki; Hamada, Michiaki; Akutsu, Tatsuya; Asai, Kiyoshi.

In: Bioinformatics, Vol. 27, No. 13, btr215, 07.2011.

Research output: Contribution to journalArticle

@article{83e30b06a7eb41fd836044d2a276dae8,
title = "IPknot: Fast and accurate prediction of RNA secondary structures with pseudoknots using integer programming",
abstract = "Motivation: Pseudoknots found in secondary structures of a number of functional RNAs play various roles in biological processes. Recent methods for predicting RNA secondary structures cover certain classes of pseudoknotted structures, but only a few of them achieve satisfying predictions in terms of both speed and accuracy. Results: We propose IPknot, a novel computational method for predicting RNA secondary structures with pseudoknots based on maximizing expected accuracy of a predicted structure. IPknot decomposes a pseudoknotted structure into a set of pseudoknot-free substructures and approximates a base-pairing probability distribution that considers pseudoknots, leading to the capability of modeling a wide class of pseudoknots and running quite fast. In addition, we propose a heuristic algorithm for refining base-paring probabilities to improve the prediction accuracy of IPknot. The problem of maximizing expected accuracy is solved by using integer programming with threshold cut. We also extend IPknot so that it can predict the consensus secondary structure with pseudoknots when a multiple sequence alignment is given. IPknot is validated through extensive experiments on various datasets, showing that IPknot achieves better prediction accuracy and faster running time as compared with several competitive prediction methods.",
author = "Kengo Sato and Yuki Kato and Michiaki Hamada and Tatsuya Akutsu and Kiyoshi Asai",
year = "2011",
month = "7",
doi = "10.1093/bioinformatics/btr215",
language = "English",
volume = "27",
journal = "Bioinformatics",
issn = "1367-4803",
publisher = "Oxford University Press",
number = "13",

}

TY - JOUR

T1 - IPknot

T2 - Fast and accurate prediction of RNA secondary structures with pseudoknots using integer programming

AU - Sato, Kengo

AU - Kato, Yuki

AU - Hamada, Michiaki

AU - Akutsu, Tatsuya

AU - Asai, Kiyoshi

PY - 2011/7

Y1 - 2011/7

N2 - Motivation: Pseudoknots found in secondary structures of a number of functional RNAs play various roles in biological processes. Recent methods for predicting RNA secondary structures cover certain classes of pseudoknotted structures, but only a few of them achieve satisfying predictions in terms of both speed and accuracy. Results: We propose IPknot, a novel computational method for predicting RNA secondary structures with pseudoknots based on maximizing expected accuracy of a predicted structure. IPknot decomposes a pseudoknotted structure into a set of pseudoknot-free substructures and approximates a base-pairing probability distribution that considers pseudoknots, leading to the capability of modeling a wide class of pseudoknots and running quite fast. In addition, we propose a heuristic algorithm for refining base-paring probabilities to improve the prediction accuracy of IPknot. The problem of maximizing expected accuracy is solved by using integer programming with threshold cut. We also extend IPknot so that it can predict the consensus secondary structure with pseudoknots when a multiple sequence alignment is given. IPknot is validated through extensive experiments on various datasets, showing that IPknot achieves better prediction accuracy and faster running time as compared with several competitive prediction methods.

AB - Motivation: Pseudoknots found in secondary structures of a number of functional RNAs play various roles in biological processes. Recent methods for predicting RNA secondary structures cover certain classes of pseudoknotted structures, but only a few of them achieve satisfying predictions in terms of both speed and accuracy. Results: We propose IPknot, a novel computational method for predicting RNA secondary structures with pseudoknots based on maximizing expected accuracy of a predicted structure. IPknot decomposes a pseudoknotted structure into a set of pseudoknot-free substructures and approximates a base-pairing probability distribution that considers pseudoknots, leading to the capability of modeling a wide class of pseudoknots and running quite fast. In addition, we propose a heuristic algorithm for refining base-paring probabilities to improve the prediction accuracy of IPknot. The problem of maximizing expected accuracy is solved by using integer programming with threshold cut. We also extend IPknot so that it can predict the consensus secondary structure with pseudoknots when a multiple sequence alignment is given. IPknot is validated through extensive experiments on various datasets, showing that IPknot achieves better prediction accuracy and faster running time as compared with several competitive prediction methods.

UR - http://www.scopus.com/inward/record.url?scp=79959465715&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79959465715&partnerID=8YFLogxK

U2 - 10.1093/bioinformatics/btr215

DO - 10.1093/bioinformatics/btr215

M3 - Article

C2 - 21685106

AN - SCOPUS:79959465715

VL - 27

JO - Bioinformatics

JF - Bioinformatics

SN - 1367-4803

IS - 13

M1 - btr215

ER -