Epistemology of computational biology: Mathematical models and experimental prediction as the basis of their validity

被引:37
作者
Dougherty, ER [1 ]
Braga-Neto, U
机构
[1] Fiocruz MS, Aggeu Magalhaes Res Ctr, Virol & Expt Therapy Lab, CPqAM, BR-50670420 Recife, PE, Brazil
[2] Texas A&M Univ, Dept Elect Engn, College Stn, TX 77483 USA
[3] Translat Gen Res Inst, Computat Biol Div, Phoenix, AZ 85004 USA
关键词
philosophy of science; epistemology; computational biology; classification; clustering; regulatory networks; error estimation;
D O I
10.1142/S0218339006001726
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Knowing the roles of mathematics and computation in experimental science is important for computational biology because these roles determine to a great extent how research in this field should be pursued and how it should relate to biology in general. The present paper examines the epistemology of computational biology from the perspective of modern science, the underlying principle of which is that a scientific theory must have two parts: (1) a structural model, which is a mathematical construct that aims to represent a selected portion of physical reality and (2) a well-defined procedure for relating consequences of the model to quantifiable observations. We also explore the contingency and creative nature of a scientific theory. Among the questions considered are: Can computational biology form the theoretical core of biology? What is the basis, if any, for choosing one particular model over another? And what is the role of computation in science, and in biology in particular? We examine how this broad epistemological framework applies to important statistical methodologies pertaining to computational biology, such as expression-based phenotype classification, gene regulatory networks, and clustering. We consider classification in detail, as the epistemological issues raised by classification are related to all computational-biology topics in which statistical prediction plays a key role. We pay particular attention to classifier-model validity and its relation to estimation rules.
引用
收藏
页码:65 / 90
页数:26
相关论文
共 55 条
[1]  
[Anonymous], FDN SCI
[2]   Classifier performance as a function of distributional complexity [J].
Attoor, SN ;
Dougherty, ER .
PATTERN RECOGNITION, 2004, 37 (08) :1641-1651
[3]   Clustering gene expression patterns [J].
Ben-Dor, A ;
Shamir, R ;
Yakhini, Z .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (3-4) :281-297
[4]   Tissue classification with gene expression profiles [J].
Ben-Dor, A ;
Bruhn, L ;
Friedman, N ;
Nachman, I ;
Schummer, M ;
Yakhini, Z .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (3-4) :559-583
[5]   Molecular classification of cutaneous malignant melanoma by gene expression profiling [J].
Bittner, M ;
Meitzer, P ;
Chen, Y ;
Jiang, Y ;
Seftor, E ;
Hendrix, M ;
Radmacher, M ;
Simon, R ;
Yakhini, Z ;
Ben-Dor, A ;
Sampas, N ;
Dougherty, E ;
Wang, E ;
Marincola, F ;
Gooden, C ;
Lueders, J ;
Glatfelter, A ;
Pollock, P ;
Carpten, J ;
Gillanders, E ;
Leja, D ;
Dietrich, K ;
Beaudry, C ;
Berens, M ;
Alberts, D ;
Sondak, V ;
Hayward, N ;
Trent, J .
NATURE, 2000, 406 (6795) :536-540
[6]   Exact performance of error estimators for discrete classifiers [J].
Braga-Neto, U ;
Dougherty, E .
PATTERN RECOGNITION, 2005, 38 (11) :1799-1814
[7]   Is cross-validation valid for small-sample microarray classification? [J].
Braga-Neto, UM ;
Dougherty, ER .
BIOINFORMATICS, 2004, 20 (03) :374-380
[8]  
CHEN J, 2005, IEEE CIRCUITS SYSTEM, V5, P46
[9]   External control in Markovian Genetic Regulatory Networks [J].
Datta, A ;
Choudhary, A ;
Bittner, ML ;
Dougherty, ER .
MACHINE LEARNING, 2003, 52 (1-2) :169-191
[10]   A genomic regulatory network for development [J].
Davidson, EH ;
Rast, JP ;
Oliveri, P ;
Ransick, A ;
Calestani, C ;
Yuh, CH ;
Minokawa, T ;
Amore, G ;
Hinman, V ;
Arenas-Mena, C ;
Otim, O ;
Brown, CT ;
Livi, CB ;
Lee, PY ;
Revilla, R ;
Rust, AG ;
Pan, ZJ ;
Schilstra, MJ ;
Clarke, PJC ;
Arnone, MI ;
Rowen, L ;
Cameron, RA ;
McClay, DR ;
Hood, L ;
Bolouri, H .
SCIENCE, 2002, 295 (5560) :1669-1678