Fighting against uncertainty: An essential issue in bioinformatics

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

Many bioinformatics problems, such as sequence alignment, gene prediction, phylogenetic tree estimation and RNA secondary structure prediction, are often affected by the 'uncertainty' of a solution, that is, the probability of the solution is extremely small. This situation arises for estimation problems on high-dimensional discrete spaces in which the number of possible discrete solutions is immense. In the analysis of biological data or the development of prediction algorithms, this uncertainty should be handled carefully and appropriately. In this review, I will explain several methods to combat this uncertainty, presenting a number of examples in bioinformatics. The methods include (i) avoiding point estimation, (ii) maximum expected accuracy (MEA) estimations and (iii) several strategies to design a pipeline involving several prediction methods. I believe that the basic concepts and ideas described in this review will be generally useful for estimation problems in various areas of bioinformatics.

Original languageEnglish
Pages (from-to)748-767
Number of pages20
JournalBriefings in Bioinformatics
Volume15
Issue number5
DOIs
Publication statusPublished - 2013 May 7
Externally publishedYes

Fingerprint

Bioinformatics
Computational Biology
Uncertainty
Sequence Alignment
RNA
Trees (mathematics)
Pipelines
Genes

Keywords

  • Bioinformatics
  • Estimation problems
  • Sequence analysis
  • Uncertainty of solutions

ASJC Scopus subject areas

  • Molecular Biology
  • Information Systems

Cite this

Fighting against uncertainty : An essential issue in bioinformatics. / Hamada, Michiaki.

In: Briefings in Bioinformatics, Vol. 15, No. 5, 07.05.2013, p. 748-767.

Research output: Contribution to journalArticle

@article{92744904b4ff412bb9299133bc72d487,
title = "Fighting against uncertainty: An essential issue in bioinformatics",
abstract = "Many bioinformatics problems, such as sequence alignment, gene prediction, phylogenetic tree estimation and RNA secondary structure prediction, are often affected by the 'uncertainty' of a solution, that is, the probability of the solution is extremely small. This situation arises for estimation problems on high-dimensional discrete spaces in which the number of possible discrete solutions is immense. In the analysis of biological data or the development of prediction algorithms, this uncertainty should be handled carefully and appropriately. In this review, I will explain several methods to combat this uncertainty, presenting a number of examples in bioinformatics. The methods include (i) avoiding point estimation, (ii) maximum expected accuracy (MEA) estimations and (iii) several strategies to design a pipeline involving several prediction methods. I believe that the basic concepts and ideas described in this review will be generally useful for estimation problems in various areas of bioinformatics.",
keywords = "Bioinformatics, Estimation problems, Sequence analysis, Uncertainty of solutions",
author = "Michiaki Hamada",
year = "2013",
month = "5",
day = "7",
doi = "10.1093/bib/bbt038",
language = "English",
volume = "15",
pages = "748--767",
journal = "Briefings in Bioinformatics",
issn = "1467-5463",
publisher = "Oxford University Press",
number = "5",

}

TY - JOUR

T1 - Fighting against uncertainty

T2 - An essential issue in bioinformatics

AU - Hamada, Michiaki

PY - 2013/5/7

Y1 - 2013/5/7

N2 - Many bioinformatics problems, such as sequence alignment, gene prediction, phylogenetic tree estimation and RNA secondary structure prediction, are often affected by the 'uncertainty' of a solution, that is, the probability of the solution is extremely small. This situation arises for estimation problems on high-dimensional discrete spaces in which the number of possible discrete solutions is immense. In the analysis of biological data or the development of prediction algorithms, this uncertainty should be handled carefully and appropriately. In this review, I will explain several methods to combat this uncertainty, presenting a number of examples in bioinformatics. The methods include (i) avoiding point estimation, (ii) maximum expected accuracy (MEA) estimations and (iii) several strategies to design a pipeline involving several prediction methods. I believe that the basic concepts and ideas described in this review will be generally useful for estimation problems in various areas of bioinformatics.

AB - Many bioinformatics problems, such as sequence alignment, gene prediction, phylogenetic tree estimation and RNA secondary structure prediction, are often affected by the 'uncertainty' of a solution, that is, the probability of the solution is extremely small. This situation arises for estimation problems on high-dimensional discrete spaces in which the number of possible discrete solutions is immense. In the analysis of biological data or the development of prediction algorithms, this uncertainty should be handled carefully and appropriately. In this review, I will explain several methods to combat this uncertainty, presenting a number of examples in bioinformatics. The methods include (i) avoiding point estimation, (ii) maximum expected accuracy (MEA) estimations and (iii) several strategies to design a pipeline involving several prediction methods. I believe that the basic concepts and ideas described in this review will be generally useful for estimation problems in various areas of bioinformatics.

KW - Bioinformatics

KW - Estimation problems

KW - Sequence analysis

KW - Uncertainty of solutions

UR - http://www.scopus.com/inward/record.url?scp=84921758344&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84921758344&partnerID=8YFLogxK

U2 - 10.1093/bib/bbt038

DO - 10.1093/bib/bbt038

M3 - Article

C2 - 23803300

AN - SCOPUS:84921758344

VL - 15

SP - 748

EP - 767

JO - Briefings in Bioinformatics

JF - Briefings in Bioinformatics

SN - 1467-5463

IS - 5

ER -