Phenotypic overlap in the contribution of individual genes to CNV pathogenicity revealed by cross-species computational analysis of single-gene mutations in humans, mice and zebrafish

被引:35
作者
Doelken, Sandra C. [1 ,2 ]
Koehler, Sebastian [1 ,2 ]
Mungall, Christopher J. [3 ]
Gkoutos, Georgios V. [4 ]
Ruef, Barbara J. [5 ]
Smith, Cynthia [6 ]
Smedley, Damian [7 ]
Bauer, Sebastian [1 ]
Klopocki, Eva [1 ,2 ]
Schofield, Paul N. [6 ,8 ]
Westerfield, Monte [5 ]
Robinson, Peter N. [1 ,2 ,9 ]
Lewis, Suzanna E. [3 ]
机构
[1] Charite, Inst Med & Human Genet, D-13353 Berlin, Germany
[2] Max Planck Inst Mol Genet, D-14195 Berlin, Germany
[3] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Genom Div, Berkeley, CA 94720 USA
[4] Aberystwyth Univ, Dept Comp Sci, Aberystwyth SY23 2AX, Dyfed, Wales
[5] Univ Oregon, ZFIN, Eugene, OR 97403 USA
[6] Jackson Lab, Bar Harbor, ME 04609 USA
[7] European Bioinformat Inst, Cambridge CB10 1SD, England
[8] Univ Cambridge, Dept Physiol Dev & Neurosci, Cambridge CB2 3EG, England
[9] Charite, BCRT, D-13353 Berlin, Germany
基金
美国国家卫生研究院;
关键词
ONLINE MENDELIAN INHERITANCE; COPY-NUMBER VARIATION; DEVELOPMENTAL DELAY; MICRODELETION SYNDROME; MENTAL-RETARDATION; MOUSE; DELETION; ONTOLOGIES; DISEASE; ASSOCIATION;
D O I
10.1242/dmm.010322
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
Numerous disease syndromes are associated with regions of copy number variation (CNV) in the human genome and, in most cases, the pathogenicity of the CNV is thought to be related to altered dosage of the genes contained within the affected segment. However, establishing the contribution of individual genes to the overall pathogenicity of CNV syndromes is difficult and often relies on the identification of potential candidates through manual searches of the literature and online resources. We describe here the development of a computational framework to comprehensively search phenotypic information from model organisms and single-gene human hereditary disorders, and thus speed the interpretation of the complex phenotypes of CNV disorders. There are currently more than 5000 human genes about which nothing is known phenotypically but for which detailed phenotypic information for the mouse and/or zebrafish orthologs is available. Here, we present an ontology-based approach to identify similarities between human disease manifestations and the mutational phenotypes in characterized model organism genes; this approach can therefore be used even in cases where there is little or no information about the function of the human genes. We applied this algorithm to detect candidate genes for 27 recurrent CNV disorders and identified 802 gene-phenotype associations, approximately half of which involved genes that were previously reported to be associated with individual phenotypic features and half of which were novel candidates. A total of 431 associations were made solely on the basis of model organism phenotype data. Additionally, we observed a striking, statistically significant tendency for individual disease phenotypes to be associated with multiple genes located within a single CNV region, a phenomenon that we denote as pheno-clustering. Many of the clusters also display statistically significant similarities in protein function or vicinity within the protein-protein interaction network. Our results provide a basis for understanding previously un-interpretable genotype-phenotype correlations in pathogenic CNVs and for mobilizing the large amount of model organism phenotype data to provide insights into human genetic disorders.
引用
收藏
页码:358 / 372
页数:15
相关论文
共 86 条
[51]   Integrating phenotype ontologies across multiple species [J].
Mungall, Christopher J. ;
Gkoutos, Georgios V. ;
Smith, Cynthia L. ;
Haendel, Melissa A. ;
Lewis, Suzanna E. ;
Ashburner, Michael .
GENOME BIOLOGY, 2010, 11 (01)
[52]   Evolution and functional classification of vertebrate gene deserts [J].
Ovcharenko, I ;
Loots, GG ;
Nobrega, MA ;
Hardison, RC ;
Miller, W ;
Stubbs, L .
GENOME RESEARCH, 2005, 15 (01) :137-145
[53]   Evidence of a large-scale functional organization of mammalian chromosomes [J].
Petkov, PM ;
Graber, JH ;
Churchill, GA ;
DiPetrillo, K ;
King, BL ;
Paigen, K .
PLOS GENETICS, 2005, 1 (03) :312-322
[54]   MEDICAL PROGRESS Williams-Beuren Syndrome [J].
Pober, Barbara R. .
NEW ENGLAND JOURNAL OF MEDICINE, 2010, 362 (03) :239-252
[55]   Characterization of Potocki-Lupski syndrome (dup(17)(p11.2p11.2)) and delineation of a dosage-sensitive critical interval that can convey an autism phenotype [J].
Potocki, Lorraine ;
Bi, Weimin ;
Treadwell-Deering, Diane ;
Carvalho, Claudia M. B. ;
Eifert, Anna ;
Friedman, Ellen M. ;
Glaze, Daniel ;
Krull, Kevin ;
Lee, Jennifer A. ;
Lewis, Richard Alan ;
Mendoza-Londono, Roberto ;
Robbins-Furman, Patricia ;
Shaw, Chad ;
Shi, Xin ;
Weissenberger, George ;
Withers, Marjorie ;
Yatsenko, Svetlana A. ;
Zackai, Elaine H. ;
Stankiewicz, Pawel ;
Lupski, James R. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 80 (04) :633-649
[56]   Deletions of the homeobox gene SHOX (short stature homeobox) are an important cause of growth failure in children with short stature [J].
Rappold, GA ;
Fukami, M ;
Niesler, B ;
Schiller, S ;
Zumkeller, W ;
Bettendorf, M ;
Heinrich, U ;
Vlachopapadoupoulou, E ;
Reinehr, T ;
Onigata, K ;
Ogata, T .
JOURNAL OF CLINICAL ENDOCRINOLOGY & METABOLISM, 2002, 87 (03) :1402-1406
[57]  
Resnik P, 1995, INT JOINT CONF ARTIF, P448
[58]   Phenotypic Information in Genomic Variant Databases Enhances Clinical Care and Research: The International Standards for Cytogenomic Arrays Consortium Experience [J].
Riggs, Erin Rooney ;
Jackson, Laird ;
Miller, David T. ;
Van Vooren, Steven .
HUMAN MUTATION, 2012, 33 (05) :787-796
[59]   The Human Phenotype Ontology [J].
Robinson, P. N. ;
Mundlos, S. .
CLINICAL GENETICS, 2010, 77 (06) :525-534
[60]   The Human Phenotype Ontology: A Tool for Annotating and Analyzing Human Hereditary Disease [J].
Robinson, Peter N. ;
Koehler, Sebastian ;
Bauer, Sebastian ;
Seelow, Dominik ;
Horn, Denise ;
Mundlos, Stefan .
AMERICAN JOURNAL OF HUMAN GENETICS, 2008, 83 (05) :610-615