Advancing Science through Mining Libraries, Ontologies, and Communities

被引:12
作者
Evans, James A. [1 ,2 ]
Rzhetsky, Andrey [2 ,3 ,4 ]
机构
[1] Univ Chicago, Dept Sociol, Chicago, IL 60637 USA
[2] Univ Chicago, Computat Inst, Argonne Natl Lab, Chicago, IL 60637 USA
[3] Univ Chicago, Inst Genom & Syst Biol, Dept Med, Chicago, IL 60637 USA
[4] Univ Chicago, Inst Genom & Syst Biol, Dept Human Genet, Chicago, IL 60637 USA
关键词
MOLECULAR-INTERACTIONS; KNOWLEDGE; COLLABORATION; NETWORKS; BEHAVIOR; NAME;
D O I
10.1074/jbc.R110.176370
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Life scientists today cannot hope to read everything relevant to their research. Emerging text-mining tools can help by identifying topics and distilling statements from books and articles with increased accuracy. Researchers often organize these statements into ontologies, consistent systems of reality claims. Like scientific thinking and interchange, however, text-mined information (even when accurately captured) is complex, redundant, sometimes incoherent, and often contradictory: it is rooted in a mixture of only partially consistent ontologies. We review work that models scientific reason and suggest how computational reasoning across ontologies and the broader distribution of textual statements can assess the certainty of statements and the process by which statements become certain. With the emergence of digitized data regarding networks of scientific authorship, institutions, and resources, we explore the possibility of accounting for social dependences and cultural biases in reasoning models. Computational reasoning is starting to fill out ontologies and flag internal inconsistencies in several areas of bioscience. In the not too distant future, scientists may be able to use statements and rich models of the processes that produced them to identify underexplored areas, resurrect forgotten findings and ideas, deconvolute the spaghetti of underlying ontologies, and synthesize novel knowledge and hypotheses.
引用
收藏
页码:23659 / 23666
页数:8
相关论文
共 33 条
[1]  
[Anonymous], SUBLANGUAGE STUDIES
[2]  
[Anonymous], 2006, Infotopia: How Many Minds Produce Knowledge
[3]  
[Anonymous], 1975, MATH BIOSCI, DOI 10.1016/0025-5564(75)90047-4
[4]  
[Anonymous], 2000, Speech and language processing: An introduction to natural language processing, computational linguistics, and speech recognition
[5]  
[Anonymous], 1999, The genetical theory of natural selection: a complete variorum edition
[6]   A SIMPLE-MODEL OF HERD BEHAVIOR [J].
BANERJEE, AV .
QUARTERLY JOURNAL OF ECONOMICS, 1992, 107 (03) :797-817
[7]   A THEORY OF FADS, FASHION, CUSTOM, AND CULTURAL-CHANGE AS INFORMATIONAL CASCADES [J].
BIKHCHANDANI, S ;
HIRSHLEIFER, D ;
WELCH, I .
JOURNAL OF POLITICAL ECONOMY, 1992, 100 (05) :992-1026
[8]  
Bodenreider O, 2008, Yearb Med Inform, P67
[9]   Emergent behavior of growing knowledge about molecular interactions [J].
Cokol, M ;
Iossifov, I ;
Weinreb, C ;
Rzhetsky, A .
NATURE BIOTECHNOLOGY, 2005, 23 (10) :1243-1247
[10]   Evolvability of physiological and biochemical traits: evolutionary mechanisms including and beyond single-nucleotide mutation [J].
Feder, Martin E. .
JOURNAL OF EXPERIMENTAL BIOLOGY, 2007, 210 (09) :1653-1660