PhenoGO: an integrated resource for the multiscale mining of clinical and biological data

被引:21
作者
Sam, Lee T. [2 ,4 ]
Mendonca, Eneida A. [2 ]
Li, Jianrong [2 ]
Blake, Judith [3 ]
Friedman, Carol [1 ]
Lussier, Yves A. [2 ]
机构
[1] Columbia Univ, Dept Biomed Informat, New York, NY 10027 USA
[2] Univ Chicago, Dept Med, Ctr Biomed Informat, Chicago, IL 60637 USA
[3] Jackson Lab, Bar Harbor, ME 04609 USA
[4] Univ Michigan, Ann Arbor, MI 48109 USA
来源
BMC BIOINFORMATICS | 2009年 / 10卷
关键词
PHENOTYPIC INFORMATION; GENE-FUNCTION; ONTOLOGY; NETWORK; TOOL;
D O I
10.1186/1471-2105-10-S2-S8
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The evolving complexity of genome-scale experiments has increasingly centralized the role of a highly computable, accurate, and comprehensive resource spanning multiple biological scales and viewpoints. To provide a resource to meet this need, we have significantly extended the PhenoGO database with gene-disease specific annotations and included an additional ten species. This a computationally-derived resource is primarily intended to provide phenotypic context (cell type, tissue, organ, and disease) for mining existing associations between gene products and GO terms specified in the Gene Ontology Databases Automated natural language processing (BioMedLEE) and computational ontology (PhenOS) methods were used to derive these relationships from the literature, expanding the database with information from ten additional species to include over 600,000 phenotypic contexts spanning eleven species from five GO annotation databases. A comprehensive evaluation evaluating the mappings (n=300) found precision (positive predictive value) at 85%, and recall (sensitivity) at 76%. Phenotypes are encoded in general purpose ontologies such as Cell Ontology, the Unified Medical Language System, and in specialized ontologies such as the Mouse Anatomy and the Mammalian Phenotype Ontology. A web portal has also been developed, allowing for advanced filtering and querying of the database as well as download of the entire dataset http://www.phenogo.org.
引用
收藏
页数:8
相关论文
共 30 条
[11]   The adult mouse anatomical dictionary: a tool for annotating and integrating data [J].
Hayamizu, TF ;
Mangan, M ;
Corradi, JP ;
Kadin, JA ;
Ringwald, M .
GENOME BIOLOGY, 2005, 6 (03)
[12]   Gene Ontology annotations at SGD: new data sources and annotation methods [J].
Hong, Eurie L. ;
Balakrishnan, Rama ;
Dong, Qing ;
Christie, Karen R. ;
Park, Julie ;
Binkley, Gail ;
Costanzo, Maria C. ;
Dwight, Selina S. ;
Engel, Stacia R. ;
Fisk, Dianna G. ;
Hirschman, Jodi E. ;
Hitz, Benjamin C. ;
Krieger, Cynthia J. ;
Livstone, Michael S. ;
Miyasato, Stuart R. ;
Nash, Robert S. ;
Oughtred, Rose ;
Skrzypek, Marek S. ;
Weng, Shuai ;
Wong, Edith D. ;
Zhu, Kathy K. ;
Dolinski, Kara ;
Botstein, David ;
Cherry, J. Michael .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D577-D581
[13]  
*JACKS LAB, 2005, MOUS GEN DAT MGD MGI
[14]   Predicting gene function from patterns of annotation [J].
King, OD ;
Foulger, RE ;
Dwight, SS ;
White, JV ;
Roth, FP .
GENOME RESEARCH, 2003, 13 (05) :896-904
[15]   A human phenome-interactome network of protein complexes implicated in genetic disorders [J].
Lage, Kasper ;
Karlberg, E. Olof ;
Storling, Zenia M. ;
Olason, Pall I. ;
Pedersen, Anders G. ;
Rigina, Olga ;
Hinsby, Anders M. ;
Tumer, Zeynep ;
Pociot, Flemming ;
Tommerup, Niels ;
Moreau, Yves ;
Brunak, Soren .
NATURE BIOTECHNOLOGY, 2007, 25 (03) :309-316
[16]  
Lindberg C, 1990, J Am Med Rec Assoc, V61, P40
[17]  
LUSSIER Y, 2007, ISMB
[18]  
Lussier YA, 2003, PACIFIC SYMPOSIUM ON BIOCOMPUTING 2004, P202
[19]  
Lussier Yves, 2006, Pac Symp Biocomput, P64
[20]  
ROGERS FB, 1963, B MED LIBR ASSOC, V51, P114