PhenoGO: an integrated resource for the multiscale mining of clinical and biological data

被引:21
作者
Sam, Lee T. [2 ,4 ]
Mendonca, Eneida A. [2 ]
Li, Jianrong [2 ]
Blake, Judith [3 ]
Friedman, Carol [1 ]
Lussier, Yves A. [2 ]
机构
[1] Columbia Univ, Dept Biomed Informat, New York, NY 10027 USA
[2] Univ Chicago, Dept Med, Ctr Biomed Informat, Chicago, IL 60637 USA
[3] Jackson Lab, Bar Harbor, ME 04609 USA
[4] Univ Michigan, Ann Arbor, MI 48109 USA
来源
BMC BIOINFORMATICS | 2009年 / 10卷
关键词
PHENOTYPIC INFORMATION; GENE-FUNCTION; ONTOLOGY; NETWORK; TOOL;
D O I
10.1186/1471-2105-10-S2-S8
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The evolving complexity of genome-scale experiments has increasingly centralized the role of a highly computable, accurate, and comprehensive resource spanning multiple biological scales and viewpoints. To provide a resource to meet this need, we have significantly extended the PhenoGO database with gene-disease specific annotations and included an additional ten species. This a computationally-derived resource is primarily intended to provide phenotypic context (cell type, tissue, organ, and disease) for mining existing associations between gene products and GO terms specified in the Gene Ontology Databases Automated natural language processing (BioMedLEE) and computational ontology (PhenOS) methods were used to derive these relationships from the literature, expanding the database with information from ten additional species to include over 600,000 phenotypic contexts spanning eleven species from five GO annotation databases. A comprehensive evaluation evaluating the mappings (n=300) found precision (positive predictive value) at 85%, and recall (sensitivity) at 76%. Phenotypes are encoded in general purpose ontologies such as Cell Ontology, the Unified Medical Language System, and in specialized ontologies such as the Mouse Anatomy and the Mammalian Phenotype Ontology. A web portal has also been developed, allowing for advanced filtering and querying of the database as well as download of the entire dataset http://www.phenogo.org.
引用
收藏
页数:8
相关论文
共 30 条
[1]   An ontology for cell types [J].
Bard, J ;
Rhee, SY ;
Ashburner, M .
GENOME BIOLOGY, 2005, 6 (02)
[2]   Predicting function: From genes to genomes and back [J].
Bork, P ;
Dandekar, T ;
Diaz-Lazcoz, Y ;
Eisenhaber, F ;
Huynen, M ;
Yuan, YP .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 283 (04) :707-725
[3]  
Camon Evelyn, 2004, In Silico Biology, V4, P5
[4]  
Cantor MN, 2005, PACIFIC SYMPOSIUM ON BIOCOMPUTING 2005, P103
[5]  
Chen LF, 2004, STUD HEALTH TECHNOL, V107, P758
[6]   The Mouse Genome Database (MGD): from genes to mice - a community resource for mouse biology [J].
Eppig, JT ;
Bult, CJ ;
Kadin, JA ;
Richardson, JE ;
Blake, JA .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D471-D475
[7]   Reconstruction of a functional human gene network, with an application for prioritizing positional candidate genes [J].
Franke, Lude ;
van Bakel, Harm ;
Fokkens, Like ;
de Jong, Edwin D. ;
Egmont-Petersen, Michael ;
Wijmenga, Cisca .
AMERICAN JOURNAL OF HUMAN GENETICS, 2006, 78 (06) :1011-1025
[8]   Analysis of protein sequence and interaction data for candidate disease gene prediction [J].
George, Richard A. ;
Liu, Jason Y. ;
Feng, Lina L. ;
Bryson-Richardson, Robert J. ;
Fatkin, Diane ;
Wouters, Merridee A. .
NUCLEIC ACIDS RESEARCH, 2006, 34 (19)
[9]   The human disease network [J].
Goh, Kwang-Il ;
Cusick, Michael E. ;
Valle, David ;
Childs, Barton ;
Vidal, Marc ;
Barabasi, Albert-Laszlo .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (21) :8685-8690
[10]   WormBase: a multi-species resource for nematode biology and genomics [J].
Harris, TW ;
Chen, NS ;
Cunningham, F ;
Tello-Ruiz, M ;
Antoshechkin, I ;
Bastiani, C ;
Bieri, T ;
Blasiar, D ;
Bradnam, K ;
Chan, J ;
Chen, CK ;
Chen, WJ ;
Davis, P ;
Kenny, E ;
Kishore, R ;
Lawson, D ;
Lee, R ;
Muller, HM ;
Nakamura, C ;
Ozersky, P ;
Petcherski, A ;
Rogers, A ;
Sabo, A ;
Schwarz, EM ;
Van Auken, K ;
Wang, QH ;
Durbin, R ;
Spieth, J ;
Sternberg, PW ;
Stein, LD .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D411-D417