Mining experimental evidence of molecular function claims from the literature

被引:7
作者
Crangle, Colleen E.
Cherry, J. Michael
Hong, Eurie L.
Zbyslaw, Alex
机构
[1] Converspeech LLC, Palo Alto, CA 94301 USA
[2] Stanford Univ, Dept Genom, Stanford, CA 94025 USA
关键词
D O I
10.1093/bioinformatics/btm495
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The rate at which gene-related findings appear in the scientific literature makes it difficult if not impossible for biomedical scientists to keep fully informed and up to date. The importance of these findings argues for the development of automated methods that can find, extract and summarize this information. This article reports on methods for determining the molecular function claims that are being made in a scientific article, specifically those that are backed by experimental evidence. Results: The most significant result is that for molecular function claims based on direct assays, our methods achieved recall of 70.7 and precision of 65.7. Furthermore, our methods correctly identified in the text 44.6 of the specific molecular function claims backed up by direct assays, but with a precision of only 0.92, a disappointing outcome that led to an examination of the different kinds of errors. These results were based on an analysis of 1823 articles from the literature of Saccharomyces cerevisiae (budding yeast).
引用
收藏
页码:3232 / 3240
页数:9
相关论文
共 32 条
[1]  
[Anonymous], 2007, CURRENT PROTOCOLS MO
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]   The ENZYME database in 2000 [J].
Bairoch, A .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :304-305
[4]   BRENDA, AMENDA and FRENDA: the enzyme information system in 2007 [J].
Barthelmes, Jens ;
Ebeling, Christian ;
Chang, Antje ;
Schomburg, Ida ;
Schomburg, Dietmar .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D511-D514
[5]   Evaluation of BioCreAtIvE assessment of task 2 [J].
Blaschke, Christian ;
Leon, Eduardo Andres ;
Krallinger, Martin ;
Valencia, Alfonso .
BMC Bioinformatics, 2005, 6 (SUPPL.1)
[6]  
Camon EB, 2005, BMC BIOINFORMATICS, V6, DOI 10.1186/1471-2105-6-S1-S17
[7]  
CHANG J, 2004, P BIOCREATLVE CHALL
[8]   RMI1/NCE4, a suppressor of genome instability, encodes a member of the RecQ helicase/Topo III complex [J].
Chang, M ;
Bellaoui, M ;
Zhang, CY ;
Desai, R ;
Morozov, P ;
Delgado-Cruzata, L ;
Rothstein, R ;
Freyer, GA ;
Boone, C ;
Brown, GW .
EMBO JOURNAL, 2005, 24 (11) :2024-2033
[9]   MeKE: discovering the functions of gene products from biomedical literature via sentence alignment [J].
Chiang, JH ;
Yu, HC .
BIOINFORMATICS, 2003, 19 (11) :1417-1422
[10]  
Couto FM, 2005, BMC BIOINFORMATICS, V6, DOI 10.1186/1471-2105-6-S1-S21