Identifying Relationships among Genomic Disease Regions: Predicting Genes at Pathogenic SNP Associations and Rare Deletions

被引:319
作者
Raychaudhuri, Soumya [1 ,2 ,3 ]
Plenge, Robert M. [1 ,3 ]
Rossin, Elizabeth J. [1 ,2 ,4 ]
Ng, Aylwin C. Y. [5 ,6 ]
Purcell, Shaun M. [2 ,7 ,8 ]
Sklar, Pamela [2 ,7 ,8 ,9 ]
Scolnick, Edward M. [2 ,7 ,9 ]
Xavier, Ramnik J. [5 ,6 ]
Altshuler, David [1 ,2 ,10 ,11 ,12 ]
Daly, Mark J. [1 ,2 ]
机构
[1] Broad Inst MIT & Harvard, Program Med & Populat Genet, Cambridge, MA USA
[2] Massachusetts Gen Hosp, Ctr Human Genet Res, Boston, MA 02114 USA
[3] Harvard Univ, Brigham & Womens Hosp, Sch Med, Div Rheumatol Allergy & Immunol, Boston, MA 02115 USA
[4] Harvard MIT Hlth Sci & Technol, Cambridge, MA USA
[5] Massachusetts Gen Hosp, Ctr Computat & Integrat Biol, Boston, MA 02114 USA
[6] Massachusetts Gen Hosp, Gastroenterol Unit, Boston, MA 02114 USA
[7] Broad Inst MIT & Harvard, Stanley Ctr Psychiat Res, Cambridge, MA USA
[8] Massachusetts Gen Hosp, Psychiat & Neurodev Genet Unit, Boston, MA 02114 USA
[9] Massachusetts Gen Hosp, Dept Psychiat, Boston, MA 02114 USA
[10] Massachusetts Gen Hosp, Dept Mol Biol, Boston, MA 02114 USA
[11] Harvard Univ, Sch Med, Dept Genet, Boston, MA USA
[12] Massachusetts Gen Hosp, Diabet Unit, Boston, MA 02114 USA
来源
PLOS GENETICS | 2009年 / 5卷 / 06期
关键词
INFORMATION-RETRIEVAL; CANDIDATE GENES; PROTEIN; LOCI; EXPRESSION; MOUSE; IMMUNITY; NETWORK; LOCALIZATION; DEFICIENT;
D O I
10.1371/journal.pgen.1000534
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Translating a set of disease regions into insight about pathogenic mechanisms requires not only the ability to identify the key disease genes within them, but also the biological relationships among those key genes. Here we describe a statistical method, Gene Relationships Among Implicated Loci (GRAIL), that takes a list of disease regions and automatically assesses the degree of relatedness of implicated genes using 250,000 PubMed abstracts. We first evaluated GRAIL by assessing its ability to identify subsets of highly related genes in common pathways from validated lipid and height SNP associations from recent genome-wide studies. We then tested GRAIL, by assessing its ability to separate true disease regions from many false positive disease regions in two separate practical applications in human genetics. First, we took 74 nominally associated Crohn's disease SNPs and applied GRAIL to identify a subset of 13 SNPs with highly related genes. Of these, ten convincingly validated in follow-up genotyping; genotyping results for the remaining three were inconclusive. Next, we applied GRAIL to 165 rare deletion events seen in schizophrenia cases (less than one-third of which are contributing to disease risk). We demonstrate that GRAIL is able to identify a subset of 16 deletions containing highly related genes; many of these genes are expressed in the central nervous system and play a role in neuronal synapses. GRAIL offers a statistically robust approach to identifying functionally related genes from across multiple disease regions-that likely represent key disease pathways. An online version of this method is available for public use (http://www.broad.mit.edu/mpg/grail/).
引用
收藏
页数:15
相关论文
共 55 条
[1]   A haplotype map of the human genome [J].
Altshuler, D ;
Brooks, LD ;
Chakravarti, A ;
Collins, FS ;
Daly, MJ ;
Donnelly, P ;
Gibbs, RA ;
Belmont, JW ;
Boudreau, A ;
Leal, SM ;
Hardenbol, P ;
Pasternak, S ;
Wheeler, DA ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Zeng, CQ ;
Gao, Y ;
Hu, HR ;
Hu, WT ;
Li, CH ;
Lin, W ;
Liu, SQ ;
Pan, H ;
Tang, XL ;
Wang, J ;
Wang, W ;
Yu, J ;
Zhang, B ;
Zhang, QR ;
Zhao, HB ;
Zhao, H ;
Zhou, J ;
Gabriel, SB ;
Barry, R ;
Blumenstiel, B ;
Camargo, A ;
Defelice, M ;
Faggart, M ;
Goyette, M ;
Gupta, S ;
Moore, J ;
Nguyen, H ;
Onofrio, RC ;
Parkin, M ;
Roy, J ;
Stahl, E ;
Winchester, E ;
Ziaugra, L ;
Shen, Y .
NATURE, 2005, 437 (7063) :1299-1320
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]   Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease [J].
Barrett, Jeffrey C. ;
Hansoul, Sarah ;
Nicolae, Dan L. ;
Cho, Judy H. ;
Duerr, Richard H. ;
Rioux, John D. ;
Brant, Steven R. ;
Silverberg, Mark S. ;
Taylor, Kent D. ;
Barmada, M. Michael ;
Bitton, Alain ;
Dassopoulos, Themistocles ;
Datta, Lisa Wu ;
Green, Todd ;
Griffiths, Anne M. ;
Kistner, Emily O. ;
Murtha, Michael T. ;
Regueiro, Miguel D. ;
Rotter, Jerome I. ;
Schumm, L. Philip ;
Steinhart, A. Hillary ;
Targan, Stephan R. ;
Xavier, Ramnik J. ;
Libioulle, Cecile ;
Sandor, Cynthia ;
Lathrop, Mark ;
Belaiche, Jacques ;
Dewit, Olivier ;
Gut, Ivo ;
Heath, Simon ;
Laukens, Debby ;
Mni, Myriam ;
Rutgeerts, Paul ;
Van Gossum, Andre ;
Zelenika, Diana ;
Franchimont, Denis ;
Hugot, Jean-Pierre ;
de Vos, Martine ;
Vermeire, Severine ;
Louis, Edouard ;
Cardon, Lon R. ;
Anderson, Carl A. ;
Drummond, Hazel ;
Nimmo, Elaine ;
Ahmad, Tariq ;
Prescott, Natalie J. ;
Onnie, Clive M. ;
Fisher, Sheila A. ;
Marchini, Jonathan ;
Ghori, Jilur .
NATURE GENETICS, 2008, 40 (08) :955-962
[4]   Filamin C interacts with the muscular dystrophy KY protein and is abnormally distributed in mouse KY deficient muscle fibres [J].
Beatham, J ;
Romero, R ;
Townsend, SKM ;
Hacker, T ;
van der Ven, PFM ;
Blanco, G .
HUMAN MOLECULAR GENETICS, 2004, 13 (22) :2863-2874
[5]   The kyphoscoliosis (ky) mouse is deficient in hypertrophic responses and is caused by a mutation in a novel muscle-specific protein [J].
Blanco, G ;
Coulton, GR ;
Biggin, A ;
Grainge, C ;
Moss, J ;
Barrett, M ;
Berquin, A ;
Maréchal, G ;
Skynner, N ;
van Mier, P ;
Nikitopoulou, A ;
Kraus, M ;
Ponting, CP ;
Mason, RM ;
Brown, SDM .
HUMAN MOLECULAR GENETICS, 2001, 10 (01) :9-16
[6]   Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls [J].
Burton, Paul R. ;
Clayton, David G. ;
Cardon, Lon R. ;
Craddock, Nick ;
Deloukas, Panos ;
Duncanson, Audrey ;
Kwiatkowski, Dominic P. ;
McCarthy, Mark I. ;
Ouwehand, Willem H. ;
Samani, Nilesh J. ;
Todd, John A. ;
Donnelly, Peter ;
Barrett, Jeffrey C. ;
Davison, Dan ;
Easton, Doug ;
Evans, David ;
Leung, Hin-Tak ;
Marchini, Jonathan L. ;
Morris, Andrew P. ;
Spencer, Chris C. A. ;
Tobin, Martin D. ;
Attwood, Antony P. ;
Boorman, James P. ;
Cant, Barbara ;
Everson, Ursula ;
Hussey, Judith M. ;
Jolley, Jennifer D. ;
Knight, Alexandra S. ;
Koch, Kerstin ;
Meech, Elizabeth ;
Nutland, Sarah ;
Prowse, Christopher V. ;
Stevens, Helen E. ;
Taylor, Niall C. ;
Walters, Graham R. ;
Walker, Neil M. ;
Watkins, Nicholas A. ;
Winzer, Thilo ;
Jones, Richard W. ;
McArdle, Wendy L. ;
Ring, Susan M. ;
Strachan, David P. ;
Pembrey, Marcus ;
Breen, Gerome ;
St Clair, David ;
Caesar, Sian ;
Gordon-Smith, Katherine ;
Jones, Lisa ;
Fraser, Christine ;
Green, Elain K. .
NATURE, 2007, 447 (7145) :661-678
[7]   Genetic dissection of immunity to mycobacteria: The human model [J].
Casanova, JL ;
Abel, L .
ANNUAL REVIEW OF IMMUNOLOGY, 2002, 20 :581-620
[8]   FitSNPs: highly differentially expressed genes are more likely to have variants associated with disease [J].
Chen, Rong ;
Morgan, Alex A. ;
Dudley, Joel ;
Deshpande, Tarangini ;
Li, Li ;
Kodama, Keiichi ;
Chiang, Annie P. ;
Butte, Atul J. .
GENOME BIOLOGY, 2008, 9 (12)
[9]   Mice lacking bioactive IL-12 can generate protective, antigen-specific cellular responses to mycobacterial infection only if the IL-12 p40 subunit is present [J].
Cooper, AM ;
Kipnis, A ;
Turner, J ;
Magram, J ;
Ferrante, J ;
Orme, IM .
JOURNAL OF IMMUNOLOGY, 2002, 168 (03) :1322-1327
[10]   Interleukin 12 (IL-12) is crucial to the development of protective immunity in mice intravenously infected with Mycobacterium tuberculosis [J].
Cooper, AM ;
Magram, J ;
Ferrante, J ;
Orme, IM .
JOURNAL OF EXPERIMENTAL MEDICINE, 1997, 186 (01) :39-45