Sequencing and analysis of an Irish human genome

被引:35
作者
Tong, Pin [1 ]
Prendergast, James G. D. [2 ]
Lohan, Amanda J. [1 ]
Farrington, Susan M. [2 ,3 ]
Cronin, Simon [4 ]
Friel, Nial [5 ]
Bradley, Dan G. [6 ]
Hardiman, Orla [7 ,8 ]
Evans, Alex [9 ]
Wilson, James F. [10 ]
Loftus, Brendan [1 ]
机构
[1] Univ Coll Dublin, Conway Inst, Dublin 4, Ireland
[2] Western Gen Hosp, MRC Human Genet Unit, Edinburgh EH4 2XU, Midlothian, Scotland
[3] Univ Edinburgh, Inst Genet & Mol Med, Colon Canc Genet Grp & Acad Coloproctol, Edinburgh EH4 2XU, Midlothian, Scotland
[4] Royal Coll Surgeons Ireland, Dept Clin Neurol Sci, Dublin 2, Ireland
[5] Univ Coll Dublin, Sch Math Sci, Dublin 4, Ireland
[6] Trinity Coll Dublin, Smurfit Inst Genet, Dublin 2, Ireland
[7] Beaumont Hosp, Dept Neurol, Dublin 9, Ireland
[8] Trinity Coll Dublin, Dublin 9, Ireland
[9] Univ Coll Dublin, Sch Agr Food Sci & Vet Med, Dublin 4, Ireland
[10] Univ Edinburgh, Ctr Populat Hlth Sci, Edinburgh EH8 9AG, Midlothian, Scotland
来源
GENOME BIOLOGY | 2010年 / 11卷 / 09期
基金
爱尔兰科学基金会;
关键词
WIDE ASSOCIATION; POSITIVE SELECTION; NATURAL-SELECTION; DNA; INFERENCE; LOCI; DUPLICATION; INHERITANCE; GEOGRAPHY; GENES;
D O I
10.1186/gb-2010-11-9-r91
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Recent studies generating complete human sequences from Asian, African and European subgroups have revealed population-specific variation and disease susceptibility loci. Here, choosing a DNA sample from a population of interest due to its relative geographical isolation and genetic impact on further populations, we extend the above studies through the generation of 11-fold coverage of the first Irish human genome sequence. Results: Using sequence data from a branch of the European ancestral tree as yet unsequenced, we identify variants that may be specific to this population. Through comparisons with HapMap and previous genetic association studies, we identified novel disease-associated variants, including a novel nonsense variant putatively associated with inflammatory bowel disease. We describe a novel method for improving SNP calling accuracy at low genome coverage using haplotype information. This analysis has implications for future re-sequencing studies and validates the imputation of Irish haplotypes using data from the current Human Genome Diversity Cell Line Panel (HGDP CEPH). Finally, we identify gene duplication events as constituting significant targets of recent positive selection in the human lineage. Conclusions: Our findings show that there remains utility in generating whole genome sequences to illustrate both general principles and reveal specific instances of human biology. With increasing access to low cost sequencing we would predict that even armed with the resources of a small research group a number of similar initiatives geared towards answering specific biological questions will emerge.
引用
收藏
页数:14
相关论文
共 61 条
[51]   Complete Khoisan and Bantu genomes from southern Africa [J].
Schuster, Stephan C. ;
Miller, Webb ;
Ratan, Aakrosh ;
Tomsho, Lynn P. ;
Giardine, Belinda ;
Kasson, Lindsay R. ;
Harris, Robert S. ;
Petersen, Desiree C. ;
Zhao, Fangqing ;
Qi, Ji ;
Alkan, Can ;
Kidd, Jeffrey M. ;
Sun, Yazhou ;
Drautz, Daniela I. ;
Bouffard, Pascal ;
Muzny, Donna M. ;
Reid, Jeffrey G. ;
Nazareth, Lynne V. ;
Wang, Qingyu ;
Burhans, Richard ;
Riemer, Cathy ;
Wittekindt, Nicola E. ;
Moorjani, Priya ;
Tindall, Elizabeth A. ;
Danko, Charles G. ;
Teo, Wee Siang ;
Buboltz, Anne M. ;
Zhang, Zhenhai ;
Ma, Qianyi ;
Oosthuysen, Arno ;
Steenkamp, Abraham W. ;
Oostuisen, Hermann ;
Venter, Philippus ;
Gajewski, John ;
Zhang, Yu ;
Pugh, B. Franklin ;
Makova, Kateryna D. ;
Nekrutenko, Anton ;
Mardis, Elaine R. ;
Patterson, Nick ;
Pringle, Tom H. ;
Chiaromonte, Francesca ;
Mullikin, James C. ;
Eichler, Evan E. ;
Hardison, Ross C. ;
Gibbs, Richard A. ;
Harkins, Timothy T. ;
Hayes, Vanessa M. .
NATURE, 2010, 463 (7283) :943-947
[52]   The Archaeogenetics of Europe [J].
Soares, Pedro ;
Achilli, Alessandro ;
Semino, Ornella ;
Davies, William ;
Macaulays, Vincent ;
Bandelt, Hans-Juergen ;
Torroni, Antonio ;
Richards, Martin B. .
CURRENT BIOLOGY, 2010, 20 (04) :R174-R183
[53]   Structural Variation in the Human Genome and its Role in Disease [J].
Stankiewicz, Pawel ;
Lupski, James R. .
ANNUAL REVIEW OF MEDICINE, 2010, 61 :437-455
[54]   Pervasive positive selection on duplicated and nonduplicated vertebrate protein coding genes [J].
Studer, Romain A. ;
Penel, Simon ;
Duret, Laurent ;
Robinson-Rechavi, Marc .
GENOME RESEARCH, 2008, 18 (09) :1393-1402
[55]  
TAJIMA F, 1989, GENETICS, V123, P585
[56]   Genome-wide association study identifies 19p13.3 (UNC13A) and 9p21.2 as susceptibility loci for sporadic amyotrophic lateral sclerosis [J].
van Es, Michael A. ;
Veldink, Jan H. ;
Saris, Christiaan G. J. ;
Blauw, Hylke M. ;
van Vught, Paul W. J. ;
Birve, Anna ;
Lemmens, Robin ;
Schelhaas, Helenius J. ;
Groen, Ewout J. N. ;
Huisman, Mark H. B. ;
van der Kooi, Anneke J. ;
de Visser, Marianne ;
Dahlberg, Caroline ;
Estrada, Karol ;
Rivadeneira, Fernando ;
Hofman, Albert ;
Zwarts, Machiel J. ;
van Doormaal, Perry T. C. ;
Rujescu, Dan ;
Strengman, Eric ;
Giegling, Ina ;
Muglia, Pierandrea ;
Tomik, Barbara ;
Slowik, Agnieszka ;
Uitterlinden, Andre G. ;
Hendrich, Corinna ;
Waibel, Stefan ;
Meyer, Thomas ;
Ludolph, Albert C. ;
Glass, Jonathan D. ;
Purcell, Shaun ;
Cichon, Sven ;
Noethen, Markus M. ;
Wichmann, H-Erich ;
Schreiber, Stefan ;
Vermeulen, Sita H. H. M. ;
Kiemeney, Lambertus A. ;
Wokke, John H. J. ;
Cronin, Simon ;
McLaughlin, Russell L. ;
Hardiman, Orla ;
Fumoto, Katsumi ;
Pasterkamp, R. Jeroen ;
Meininger, Vincent ;
Melki, Judith ;
Leigh, P. Nigel ;
Shaw, Christopher E. ;
Landers, John E. ;
Al-Chalabi, Ammar ;
Brown, Robert H., Jr. .
NATURE GENETICS, 2009, 41 (10) :1083-U53
[57]   EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates [J].
Vilella, Albert J. ;
Severin, Jessica ;
Ureta-Vidal, Abel ;
Heng, Li ;
Durbin, Richard ;
Birney, Ewan .
GENOME RESEARCH, 2009, 19 (02) :327-335
[58]  
Voight BF, 2006, PLOS BIOL, V4, P446, DOI 10.1371/journal.pbio.0040072
[59]   The diploid genome sequence of an Asian individual [J].
Wang, Jun ;
Wang, Wei ;
Li, Ruiqiang ;
Li, Yingrui ;
Tian, Geng ;
Goodman, Laurie ;
Fan, Wei ;
Zhang, Junqing ;
Li, Jun ;
Zhang, Juanbin ;
Guo, Yiran ;
Feng, Binxiao ;
Li, Heng ;
Lu, Yao ;
Fang, Xiaodong ;
Liang, Huiqing ;
Du, Zhenglin ;
Li, Dong ;
Zhao, Yiqing ;
Hu, Yujie ;
Yang, Zhenzhen ;
Zheng, Hancheng ;
Hellmann, Ines ;
Inouye, Michael ;
Pool, John ;
Yi, Xin ;
Zhao, Jing ;
Duan, Jinjie ;
Zhou, Yan ;
Qin, Junjie ;
Ma, Lijia ;
Li, Guoqing ;
Yang, Zhentao ;
Zhang, Guojie ;
Yang, Bin ;
Yu, Chang ;
Liang, Fang ;
Li, Wenjie ;
Li, Shaochuan ;
Li, Dawei ;
Ni, Peixiang ;
Ruan, Jue ;
Li, Qibin ;
Zhu, Hongmei ;
Liu, Dongyuan ;
Lu, Zhike ;
Li, Ning ;
Guo, Guangwu ;
Zhang, Jianguo ;
Ye, Jia .
NATURE, 2008, 456 (7218) :60-U1
[60]   The complete genome of an individual by massively parallel DNA sequencing [J].
Wheeler, David A. ;
Srinivasan, Maithreyan ;
Egholm, Michael ;
Shen, Yufeng ;
Chen, Lei ;
McGuire, Amy ;
He, Wen ;
Chen, Yi-Ju ;
Makhijani, Vinod ;
Roth, G. Thomas ;
Gomes, Xavier ;
Tartaro, Karrie ;
Niazi, Faheem ;
Turcotte, Cynthia L. ;
Irzyk, Gerard P. ;
Lupski, James R. ;
Chinault, Craig ;
Song, Xing-zhi ;
Liu, Yue ;
Yuan, Ye ;
Nazareth, Lynne ;
Qin, Xiang ;
Muzny, Donna M. ;
Margulies, Marcel ;
Weinstock, George M. ;
Gibbs, Richard A. ;
Rothberg, Jonathan M. .
NATURE, 2008, 452 (7189) :872-U5