Interpretation of Genomic Variants Using a Unified Biological Network Approach

被引:131
作者
Khurana, Ekta [1 ,2 ]
Fu, Yao [1 ]
Chen, Jieming [1 ,3 ]
Gerstein, Mark [1 ,2 ,4 ]
机构
[1] Yale Univ, Program Computat Biol & Bioinformat, New Haven, CT 06520 USA
[2] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT USA
[3] Yale Univ, Integrated Grad Program Phys & Engn Biol, New Haven, CT USA
[4] Yale Univ, Dept Comp Sci, New Haven, CT 06520 USA
关键词
PROTEIN; EVOLUTION; KEGG;
D O I
10.1371/journal.pcbi.1002886
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The decreasing cost of sequencing is leading to a growing repertoire of personal genomes. However, we are lagging behind in understanding the functional consequences of the millions of variants obtained from sequencing. Global system-wide effects of variants in coding genes are particularly poorly understood. It is known that while variants in some genes can lead to diseases, complete disruption of other genes, called 'loss-of-function tolerant', is possible with no obvious effect. Here, we build a systems-based classifier to quantitatively estimate the global perturbation caused by deleterious mutations in each gene. We first survey the degree to which gene centrality in various individual networks and a unified 'Multinet' correlates with the tolerance to loss-of-function mutations and evolutionary conservation. We find that functionally significant and highly conserved genes tend to be more central in physical protein-protein and regulatory networks. However, this is not the case for metabolic pathways, where the highly central genes have more duplicated copies and are more tolerant to loss-of-function mutations. Integration of three-dimensional protein structures reveals that the correlation with centrality in the protein-protein interaction network is also seen in terms of the number of interaction interfaces used. Finally, combining all the network and evolutionary properties allows us to build a classifier distinguishing functionally essential and loss-of-function tolerant genes with higher accuracy (AUC = 0.91) than any individual property. Application of the classifier to the whole genome shows its strong potential for interpretation of variants involved in Mendelian diseases and in complex disorders probed by genome-wide association studies.
引用
收藏
页数:9
相关论文
共 33 条
[1]   A method and server for predicting damaging missense mutations [J].
Adzhubei, Ivan A. ;
Schmidt, Steffen ;
Peshkin, Leonid ;
Ramensky, Vasily E. ;
Gerasimova, Anna ;
Bork, Peer ;
Kondrashov, Alexey S. ;
Sunyaev, Shamil R. .
NATURE METHODS, 2010, 7 (04) :248-249
[2]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[3]   Integration of protein motions with molecular networks reveals different mechanisms for permanent and transient interactions [J].
Bhardwaj, Nitin ;
Abyzov, Alexej ;
Clarke, Declan ;
Shou, Chong ;
Gerstein, Mark B. .
PROTEIN SCIENCE, 2011, 20 (10) :1745-1754
[4]   Protein evolution on a human signaling network [J].
Cui, Qinghua ;
Purisima, Enrico O. ;
Wang, Edwin .
BMC SYSTEMS BIOLOGY, 2009, 3
[5]   iPfam:: visualization of protein-protein interactions in PDB at domain and amino acid resolutions [J].
Finn, RD ;
Marshall, M ;
Bateman, A .
BIOINFORMATICS, 2005, 21 (03) :410-412
[6]   Ensembl 2012 [J].
Flicek, Paul ;
Amode, M. Ridwan ;
Barrell, Daniel ;
Beal, Kathryn ;
Brent, Simon ;
Carvalho-Silva, Denise ;
Clapham, Peter ;
Coates, Guy ;
Fairley, Susan ;
Fitzgerald, Stephen ;
Gil, Laurent ;
Gordon, Leo ;
Hendrix, Maurice ;
Hourlier, Thibaut ;
Johnson, Nathan ;
Kaehaeri, Andreas K. ;
Keefe, Damian ;
Keenan, Stephen ;
Kinsella, Rhoda ;
Komorowska, Monika ;
Koscielny, Gautier ;
Kulesha, Eugene ;
Larsson, Pontus ;
Longden, Ian ;
McLaren, William ;
Muffato, Matthieu ;
Overduin, Bert ;
Pignatelli, Miguel ;
Pritchard, Bethan ;
Riat, Harpreet Singh ;
Ritchie, Graham R. S. ;
Ruffier, Magali ;
Schuster, Michael ;
Sobral, Daniel ;
Tang, Y. Amy ;
Taylor, Kieron ;
Trevanion, Stephen ;
Vandrovcova, Jana ;
White, Simon ;
Wilson, Mark ;
Wilder, Steven P. ;
Aken, Bronwen L. ;
Birney, Ewan ;
Cunningham, Fiona ;
Dunham, Ian ;
Durbin, Richard ;
Fernandez-Suarez, Xose M. ;
Harrow, Jennifer ;
Herrero, Javier ;
Hubbard, Tim J. P. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D84-D90
[7]   Architecture of the human regulatory network derived from ENCODE data [J].
Gerstein, Mark B. ;
Kundaje, Anshul ;
Hariharan, Manoj ;
Landt, Stephen G. ;
Yan, Koon-Kiu ;
Cheng, Chao ;
Mu, Xinmeng Jasmine ;
Khurana, Ekta ;
Rozowsky, Joel ;
Alexander, Roger ;
Min, Renqiang ;
Alves, Pedro ;
Abyzov, Alexej ;
Addleman, Nick ;
Bhardwaj, Nitin ;
Boyle, Alan P. ;
Cayting, Philip ;
Charos, Alexandra ;
Chen, David Z. ;
Cheng, Yong ;
Clarke, Declan ;
Eastman, Catharine ;
Euskirchen, Ghia ;
Frietze, Seth ;
Fu, Yao ;
Gertz, Jason ;
Grubert, Fabian ;
Harmanci, Arif ;
Jain, Preti ;
Kasowski, Maya ;
Lacroute, Phil ;
Leng, Jing ;
Lian, Jin ;
Monahan, Hannah ;
O'Geen, Henriette ;
Ouyang, Zhengqing ;
Partridge, E. Christopher ;
Patacsil, Dorrelyn ;
Pauli, Florencia ;
Raha, Debasish ;
Ramirez, Lucia ;
Reddy, Timothy E. ;
Reed, Brian ;
Shi, Minyi ;
Slifer, Teri ;
Wang, Jing ;
Wu, Linfeng ;
Yang, Xinqiong ;
Yip, Kevin Y. ;
Zilberman-Schapira, Gili .
NATURE, 2012, 489 (7414) :91-100
[8]   The human disease network [J].
Goh, Kwang-Il ;
Cusick, Michael E. ;
Valle, David ;
Childs, Barton ;
Vidal, Marc ;
Barabasi, Albert-Laszlo .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (21) :8685-8690
[9]   Transcription factor SIX5 is mutated in patients with branchio-oto-renal syndrome [J].
Hoskins, Bethan E. ;
Cramer, Carl H., II ;
Silvius, Derek ;
Zou, Dan ;
Raymond, Richard M., Jr. ;
Orten, Dana J. ;
Kimberling, William J. ;
Smith, Richard J. H. ;
Weil, Dominique ;
Petit, Christine ;
Otto, Edgar A. ;
Xu, Pin-Xian ;
Hildebrandt, Friedhelm .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 80 (04) :800-804
[10]   Lethality and centrality in protein networks [J].
Jeong, H ;
Mason, SP ;
Barabási, AL ;
Oltvai, ZN .
NATURE, 2001, 411 (6833) :41-42