Microparadigms: Chains of collective reasoning in publications about molecular interactions

被引:34
作者
Rzhetsky, A [1 ]
Iossifov, I
Loh, JM
White, KP
机构
[1] Columbia Univ, Dept Biomed Informat, New York, NY 10032 USA
[2] Columbia Univ, Columbia Genome Ctr, New York, NY 10032 USA
[3] Columbia Univ, Ctr Computat Biol & Bioinformat, New York, NY 10032 USA
[4] Columbia Univ, Dept Stat, New York, NY 10032 USA
[5] Yale Univ, Dept Genet, New Haven, CT 06520 USA
关键词
Bayesian inference; quality of science; text mining; experiment interpretation; information cascade;
D O I
10.1073/pnas.0600591103
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We analyzed a very large set of molecular interactions that had been derived automatically from biological texts. We found that published statements, regardless of their verity, tend to interfere with interpretation of the subsequent experiments and, therefore, can act as scientific "microparadigms," similar to dominant scientific theories [Kuhn, T. S. (1996) The Structure of Scientific Revolutions (Univ. Chicago Press, Chicago)]. Using statistical tools, we measured the strength of the influence of a single published statement on subsequent interpretations. We call these measured values the momentums of the published statements and treat separately the majority and minority of conflicting statements about the same molecular event. Our results indicate that, when building biological models based on published experimental data, we may have to treat the data as highly dependent-ordered sequences of statements (i.e., chains of collective reasoning) rather than unordered and independent experimental observations. Furthermore, our computations indicate that our data set can be interpreted in two very different ways (two "alternative universes"): one is an "optimists' universe" with a very low incidence of false results (< 5%), and another is a "pessimists' universe" with an extraordinarily high rate of false results (> 90%). Our computations deem highly unlikely any milder intermediate explanation between these two extremes.
引用
收藏
页码:4940 / 4945
页数:6
相关论文
共 9 条
[1]   Parallel metropolis coupled Markov chain Monte Carlo for Bayesian phylogenetic inference [J].
Altekar, G ;
Dwarkadas, S ;
Huelsenbeck, JP ;
Ronquist, F .
BIOINFORMATICS, 2004, 20 (03) :407-415
[2]  
Anderson LR, 1997, AM ECON REV, V87, P847
[3]  
FRIEDMAN C, 2001, BIOINFORMATICS S1, V17, P74
[4]  
GEYER CJ, 1991, COMPUTING SCIENCE AND STATISTICS, P156
[5]   ASSESSING UNCERTAINTY IN PHYSICAL CONSTANTS [J].
HENRION, M ;
FISCHHOFF, B .
AMERICAN JOURNAL OF PHYSICS, 1986, 54 (09) :791-798
[6]  
Krauthammer Michael, 2002, Bioinformatics, V18 Suppl 1, pS249
[7]   GeneWays:: a system for extracting, analyzing, visualizing, and integrating molecular pathway data [J].
Rzhetsky, A ;
Iossifov, I ;
Koike, T ;
Krauthammer, M ;
Kra, P ;
Morris, M ;
Yu, H ;
Duboué, PA ;
Weng, WB ;
Wilbur, WJ ;
Hatzivassiloglou, V ;
Friedman, C .
JOURNAL OF BIOMEDICAL INFORMATICS, 2004, 37 (01) :43-53
[8]  
Spiegelhalter D., 1995, MARKOV CHAIN MONTE C
[9]   Assessment of progress over the CASP experiments [J].
Venclovas, C ;
Zemla, A ;
Fidelis, K ;
Moult, J .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 53 :585-595