The Human Phenotype Ontology: Semantic Unification of Common and Rare Disease

被引:157
作者
Groza, Tudor [1 ,2 ]
Koehler, Sebastian [3 ]
Moldenhauer, Dawid [3 ,4 ]
Vasilevsky, Nicole [5 ]
Baynam, Gareth [6 ,7 ,8 ,9 ,10 ]
Zemojtel, Tomasz [3 ,11 ]
Schriml, Lynn Marie [12 ,13 ]
Kibbe, Warren Alden [14 ]
Schofield, Paul N. [15 ,16 ]
Beck, Tim [17 ]
Vasant, Drashtti [18 ]
Brookes, Anthony J. [17 ]
Zankl, Andreas [2 ,19 ,20 ]
Washington, Nicole L. [21 ]
Mungall, Christopher J. [21 ]
Lewis, Suzanna E. [21 ]
Haendel, Melissa A.
Parkinson, Helen [18 ]
Robinson, Peter N. [3 ,22 ,23 ,24 ]
机构
[1] Univ Queensland, Sch Informat Technol & Elect Engn, St Lucia, Qld 4072, Australia
[2] Garvan Inst Med Res, Sydney, NSW 2010, Australia
[3] Charite, Inst Med & Human Genet, D-13353 Berlin, Germany
[4] Univ Appl Sci, D-35390 Giessen, Germany
[5] Oregon Hlth & Sci Univ, Lib, Portland, OR 97239 USA
[6] Univ Western Australia, Sch Paediat & Child Hlth, Perth, WA 6840, Australia
[7] Murdoch Univ, Inst Immunol & Infect Dis, Perth, WA 6150, Australia
[8] Off Populat Hlth Genom, Publ Hlth & Clin Serv Div, Dept Hlth, Perth, WA 6004, Australia
[9] King Edward Mem Hosp, Genet Serv Western Australia, Perth, WA 6008, Australia
[10] Telethon Kids Inst, Perth, WA 6008, Australia
[11] Polish Acad Sci, Inst Bioorgan Chem, PL-61704 Poznan, Poland
[12] Univ Maryland, Sch Med, Dept Epidemiol & Publ Hlth, Baltimore, MD 21201 USA
[13] Univ Maryland, Sch Med, Inst Genome Sci, Baltimore, MD 21201 USA
[14] NCI, Ctr Biomed Informat & Informat Technol, Rockville, MD 20850 USA
[15] Univ Cambridge, Dept Physiol Dev & Neurosci, Cambridge CB2 3EG, England
[16] Jackson Lab, Bar Harbor, ME 04609 USA
[17] Univ Leicester, Dept Genet, Leicester LE1 7RH, Leics, England
[18] European Bioinformat Inst, European Mol Biol Lab, Cambridge CB10 1SD, England
[19] Childrens Hosp Westmead, Acad Dept Med Genet, Sydney, NSW 2145, Australia
[20] Univ Sydney, Discipline Genet Med, Sydney Med Sch, Sydney, NSW 2145, Australia
[21] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Genom Div, Berkeley, CA 94720 USA
[22] Max Planck Inst Mol Genet, D-14195 Berlin, Germany
[23] Charite, Berlin Brandenburg Ctr Regenerat Therapies, D-13353 Berlin, Germany
[24] Free Univ Berlin, Dept Math & Comp Sci, Inst Bioinformat, D-14195 Berlin, Germany
基金
澳大利亚研究理事会; 英国医学研究理事会;
关键词
GENOME-WIDE ASSOCIATION; RISK LOCI; GENETIC-VARIANTS; DATABASE; SUSCEPTIBILITY; DISORDERS; MEDICINE; RESOURCE; BIOLOGY; NETWORK;
D O I
10.1016/j.ajhg.2015.05.020
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The Human Phenotype Ontology (HPO) is widely used in the rare disease community for differential diagnostics, phenotype-driven analysis of next-generation sequence-variation data, and translational research, but a comparable resource has not been available for common disease. Here, we have developed a concept-recognition procedure that analyzes the frequencies of HPO disease annotations as identified in over five million Pub Med abstracts by employing an iterative procedure to optimize precision and recall of the identified terms. We derived disease models for 3,145 common human diseases comprising a total of 132,006 HPO annotations. The HPO now comprises over 250,000 phenotypic annotations for over 10,000 rare and common diseases and can be used for examining the phenotypic overlap among common diseases that share risk alleles, as well as between Mendelian diseases and common diseases linked by genomic location. The annotations, as well as the HPO itself, are freely available.
引用
收藏
页码:111 / 124
页数:14
相关论文
共 85 条
[1]   A New Face and New Challenges for Online Mendelian Inheritance in Man (OMIM®) [J].
Amberger, Joanna ;
Bocchini, Carol ;
Hamosh, Ada .
HUMAN MUTATION, 2011, 32 (05) :564-567
[2]   OMIM.org: Online Mendelian Inheritance in Man (OMIM®), an online catalog of human genes and genetic disorders [J].
Amberger, Joanna S. ;
Bocchini, Carol A. ;
Schiettecatte, Francois ;
Scott, Alan F. ;
Hamosh, Ada .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D789-D798
[3]  
[Anonymous], INTRO BIOL ONTOLOGIE
[4]   A public resource facilitating clinical use of genomes [J].
Ball, Madeleine P. ;
Thakuria, Joseph V. ;
Zaranek, Alexander Wait ;
Clegg, Tom ;
Rosenbaum, Abraham M. ;
Wu, Xiaodi ;
Angrist, Misha ;
Bhak, Jong ;
Bobe, Jason ;
Callow, Matthew J. ;
Cano, Carlos ;
Chou, Michael F. ;
Chung, Wendy K. ;
Douglas, Shawn M. ;
Estep, Preston W. ;
Gore, Athurva ;
Hulick, Peter ;
Labarga, Alberto ;
Lee, Je-Hyuk ;
Lunshof, Jeantine E. ;
Kim, Byung Chul ;
Kim, Jong-Il ;
Li, Zhe ;
Murray, Michael F. ;
Nilsen, Geoffrey B. ;
Peters, Brock A. ;
Raman, Anugraha M. ;
Rienhoff, Hugh Y. ;
Robasky, Kimberly ;
Wheeler, Matthew T. ;
Vandewege, Ward ;
Vorhaus, Daniel B. ;
Yang, Joyce L. ;
Yang, Luhan ;
Aach, John ;
Ashley, Euan A. ;
Drmanac, Radoje ;
Kim, Seong-Jin ;
Li, Jin Billy ;
Peshkin, Leonid ;
Seidman, Christine E. ;
Seo, Jeong-Sun ;
Zhang, Kun ;
Rehm, Heidi L. ;
Church, George M. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2012, 109 (30) :11920-11927
[5]   Network medicine -: From obesity to the "Diseasome'' [J].
Barabasi, Albert-Laszlo .
NEW ENGLAND JOURNAL OF MEDICINE, 2007, 357 (04) :404-407
[6]   An ontology-based measure to compute semantic similarity in biomedicine [J].
Batet, Montserrat ;
Sanchez, David ;
Valls, Aida .
JOURNAL OF BIOMEDICAL INFORMATICS, 2011, 44 (01) :118-125
[7]   Bayesian ontology querying for accurate and noise-tolerant semantic searches [J].
Bauer, Sebastian ;
Koehler, Sebastian ;
Schulz, Marcel H. ;
Robinson, Peter N. .
BIOINFORMATICS, 2012, 28 (19) :2502-2508
[8]   Characterization of the proteome, diseases and evolution of the human postsynaptic density [J].
Bayes, Alex ;
van de Lagemaat, Louie N. ;
Collins, Mark O. ;
Croning, Mike D. R. ;
Whittle, Ian R. ;
Choudhary, Jyoti S. ;
Grant, Seth G. N. .
NATURE NEUROSCIENCE, 2011, 14 (01) :19-21
[9]   GWAS Central: a comprehensive resource for the comparison and interrogation of genome-wide association studies [J].
Beck, Tim ;
Hastings, Robert K. ;
Gollapudi, Sirisha ;
Free, Robert C. ;
Brookes, Anthony J. .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2014, 22 (07) :949-952
[10]   Carrier Testing for Severe Childhood Recessive Diseases by Next-Generation Sequencing [J].
Bell, Callum J. ;
Dinwiddie, Darrell L. ;
Miller, Neil A. ;
Hateley, Shannon L. ;
Ganusova, Elena E. ;
Mudge, Joann ;
Langley, Ray J. ;
Zhang, Lu ;
Lee, Clarence C. ;
Schilkey, Faye D. ;
Sheth, Vrunda ;
Woodward, Jimmy E. ;
Peckham, Heather E. ;
Schroth, Gary P. ;
Kim, Ryan W. ;
Kingsmore, Stephen F. .
SCIENCE TRANSLATIONAL MEDICINE, 2011, 3 (65)