Generalized centroid estimators in bioinformatics

Michiaki Hamada, Hisanori Kiryu, Wataru Iwasaki, Kiyoshi Asai

研究成果: Article

11 引用 (Scopus)

抄録

In a number of estimation problems in bioinformatics, accuracy measures of the target problem are usually given, and it is important to design estimators that are suitable to those accuracy measures. However, there is often a discrepancy between an employed estimator and a given accuracy measure of the problem. In this study, we introduce a general class of efficient estimators for estimation problems on high-dimensional binary spaces, which represent many fundamental problems in bioinformatics. Theoretical analysis reveals that the proposed estimators generally fit with commonly-used accuracy measures (e.g. sensitivity, PPV, MCC and F-score) as well as it can be computed efficiently in many cases, and cover a wide range of problems in bioinformatics from the viewpoint of the principle of maximum expected accuracy (MEA). It is also shown that some important algorithms in bioinformatics can be interpreted in a unified manner. Not only the concept presented in this paper gives a useful framework to design MEA-based estimators but also it is highly extendable and sheds new light on many problems in bioinformatics.

元の言語English
記事番号e16450
ジャーナルPLoS One
6
発行部数2
DOI
出版物ステータスPublished - 2011
外部発表Yes

Fingerprint

Bioinformatics
Computational Biology
bioinformatics

ASJC Scopus subject areas

  • Agricultural and Biological Sciences(all)
  • Biochemistry, Genetics and Molecular Biology(all)
  • Medicine(all)

これを引用

Generalized centroid estimators in bioinformatics. / Hamada, Michiaki; Kiryu, Hisanori; Iwasaki, Wataru; Asai, Kiyoshi.

:: PLoS One, 巻 6, 番号 2, e16450, 2011.

研究成果: Article

Hamada, Michiaki ; Kiryu, Hisanori ; Iwasaki, Wataru ; Asai, Kiyoshi. / Generalized centroid estimators in bioinformatics. :: PLoS One. 2011 ; 巻 6, 番号 2.
@article{11e4473f0079482fb8e8d31b0aace3fc,
title = "Generalized centroid estimators in bioinformatics",
abstract = "In a number of estimation problems in bioinformatics, accuracy measures of the target problem are usually given, and it is important to design estimators that are suitable to those accuracy measures. However, there is often a discrepancy between an employed estimator and a given accuracy measure of the problem. In this study, we introduce a general class of efficient estimators for estimation problems on high-dimensional binary spaces, which represent many fundamental problems in bioinformatics. Theoretical analysis reveals that the proposed estimators generally fit with commonly-used accuracy measures (e.g. sensitivity, PPV, MCC and F-score) as well as it can be computed efficiently in many cases, and cover a wide range of problems in bioinformatics from the viewpoint of the principle of maximum expected accuracy (MEA). It is also shown that some important algorithms in bioinformatics can be interpreted in a unified manner. Not only the concept presented in this paper gives a useful framework to design MEA-based estimators but also it is highly extendable and sheds new light on many problems in bioinformatics.",
author = "Michiaki Hamada and Hisanori Kiryu and Wataru Iwasaki and Kiyoshi Asai",
year = "2011",
doi = "10.1371/journal.pone.0016450",
language = "English",
volume = "6",
journal = "PLoS One",
issn = "1932-6203",
publisher = "Public Library of Science",
number = "2",

}

TY - JOUR

T1 - Generalized centroid estimators in bioinformatics

AU - Hamada, Michiaki

AU - Kiryu, Hisanori

AU - Iwasaki, Wataru

AU - Asai, Kiyoshi

PY - 2011

Y1 - 2011

N2 - In a number of estimation problems in bioinformatics, accuracy measures of the target problem are usually given, and it is important to design estimators that are suitable to those accuracy measures. However, there is often a discrepancy between an employed estimator and a given accuracy measure of the problem. In this study, we introduce a general class of efficient estimators for estimation problems on high-dimensional binary spaces, which represent many fundamental problems in bioinformatics. Theoretical analysis reveals that the proposed estimators generally fit with commonly-used accuracy measures (e.g. sensitivity, PPV, MCC and F-score) as well as it can be computed efficiently in many cases, and cover a wide range of problems in bioinformatics from the viewpoint of the principle of maximum expected accuracy (MEA). It is also shown that some important algorithms in bioinformatics can be interpreted in a unified manner. Not only the concept presented in this paper gives a useful framework to design MEA-based estimators but also it is highly extendable and sheds new light on many problems in bioinformatics.

AB - In a number of estimation problems in bioinformatics, accuracy measures of the target problem are usually given, and it is important to design estimators that are suitable to those accuracy measures. However, there is often a discrepancy between an employed estimator and a given accuracy measure of the problem. In this study, we introduce a general class of efficient estimators for estimation problems on high-dimensional binary spaces, which represent many fundamental problems in bioinformatics. Theoretical analysis reveals that the proposed estimators generally fit with commonly-used accuracy measures (e.g. sensitivity, PPV, MCC and F-score) as well as it can be computed efficiently in many cases, and cover a wide range of problems in bioinformatics from the viewpoint of the principle of maximum expected accuracy (MEA). It is also shown that some important algorithms in bioinformatics can be interpreted in a unified manner. Not only the concept presented in this paper gives a useful framework to design MEA-based estimators but also it is highly extendable and sheds new light on many problems in bioinformatics.

UR - http://www.scopus.com/inward/record.url?scp=79951974555&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79951974555&partnerID=8YFLogxK

U2 - 10.1371/journal.pone.0016450

DO - 10.1371/journal.pone.0016450

M3 - Article

C2 - 21365017

AN - SCOPUS:79951974555

VL - 6

JO - PLoS One

JF - PLoS One

SN - 1932-6203

IS - 2

M1 - e16450

ER -