Multiplex padlock targeted sequencing reveals human hypermutable CpG variations

被引:49
作者
Li, Jin Billy [1 ]
Gao, Yuan [2 ]
Aach, John [1 ]
Zhang, Kun [3 ]
Kryukov, Gregory V. [4 ]
Xie, Bin [2 ]
Ahlford, Annika [1 ]
Yoon, Jung-Ki [1 ]
Rosenbaum, Abraham M. [1 ]
Zaranek, Alexander Wait [1 ]
LeProust, Emily [5 ]
Sunyaev, Shamil R. [4 ]
Church, George M. [1 ]
机构
[1] Harvard Univ, Dept Genet, Sch Med, Boston, MA 02115 USA
[2] Virginia Commonwealth Univ, Ctr Study Biol Complex, Richmond, VA 23284 USA
[3] Univ Calif San Diego, Dept Bioengn, San Diego, CA 92093 USA
[4] Brigham & Womens Hosp, Div Genet, Dept Med, Boston, MA 02115 USA
[5] Agilent Technol, Genom Solut Unit, Santa Clara, CA 95051 USA
关键词
MUTATION-RATES; LARGE SETS; DNA; AMPLIFICATION; SELECTION; DINUCLEOTIDE; POLYMORPHISM; CONTRIBUTE; DATABASE; SCALE;
D O I
10.1101/gr.092213.109
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Utilizing the full power of next-generation sequencing often requires the ability to perform large-scale multiplex enrichment of many specific genomic loci in multiple samples. Several technologies have been recently developed but await substantial improvements. We report the 10,000-fold improvement of a previously developed padlock-based approach, and apply the assay to identifying genetic variations in hypermutable CpG regions across human chromosome 21. From similar to 3 million reads derived from a single Illumina Genome Analyzer lane, similar to 94% (similar to 50,500) target sites can be observed with at least one read. The uniformity of coverage was also greatly improved; up to 93% and 57% of all targets fell within a 100- and 10-fold coverage range, respectively. Alleles at >400,000 target base positions were determined across six subjects and examined for single nucleotide polymorphisms ( SNPs), and the concordance with independently obtained genotypes was 98.4%-100%. We detected >500 SNPs not currently in dbSNP, 362 of which were in targeted CpG locations. Transitions in CpG sites were at least 13.7 times more abundant than non-CpG transitions. Fractions of polymorphic CpG sites are lower in CpG-rich regions and show higher correlation with human-chimpanzee divergence within CpG versus non-CpG sites. This is consistent with the hypothesis that methylation rate heterogeneity along chromosomes contributes to mutation rate variation in humans. Our success suggests that targeted CpG resequencing is an efficient way to identify common and rare genetic variations. In addition, the significantly improved padlock capture technology can be readily applied to other projects that require multiplex sample preparation.
引用
收藏
页码:1606 / 1615
页数:10
相关论文
共 44 条
[11]   Multigene amplification and massively parallel sequencing for cancer mutation discovery [J].
Dahl, Fredrik ;
Stenberg, Johan ;
Fredriksson, Simon ;
Welch, Katrina ;
Zhang, Michael ;
Nilsson, Mats ;
Bicknell, David ;
Bodmer, Walter F. ;
Davis, Ronald W. ;
Ji, Hanlee .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (22) :9387-9392
[12]   Targeted bisulfite sequencing reveals changes in DNA methylation associated with nuclear reprogramming [J].
Deng, Jie ;
Shoemaker, Robert ;
Xie, Bin ;
Gore, Athurva ;
LeProust, Emily M. ;
Antosiewicz-Bourget, Jessica ;
Egli, Dieter ;
Maherali, Nimet ;
Park, In-Hyun ;
Yu, Junying ;
Daley, George Q. ;
Eggan, Kevin ;
Hochedlinger, Konrad ;
Thomson, James ;
Wang, Wei ;
Gao, Yuan ;
Zhang, Kun .
NATURE BIOTECHNOLOGY, 2009, 27 (04) :353-360
[13]   A second generation human haplotype map of over 3.1 million SNPs [J].
Frazer, Kelly A. ;
Ballinger, Dennis G. ;
Cox, David R. ;
Hinds, David A. ;
Stuve, Laura L. ;
Gibbs, Richard A. ;
Belmont, John W. ;
Boudreau, Andrew ;
Hardenbol, Paul ;
Leal, Suzanne M. ;
Pasternak, Shiran ;
Wheeler, David A. ;
Willis, Thomas D. ;
Yu, Fuli ;
Yang, Huanming ;
Zeng, Changqing ;
Gao, Yang ;
Hu, Haoran ;
Hu, Weitao ;
Li, Chaohua ;
Lin, Wei ;
Liu, Siqi ;
Pan, Hao ;
Tang, Xiaoli ;
Wang, Jian ;
Wang, Wei ;
Yu, Jun ;
Zhang, Bo ;
Zhang, Qingrun ;
Zhao, Hongbin ;
Zhao, Hui ;
Zhou, Jun ;
Gabriel, Stacey B. ;
Barry, Rachel ;
Blumenstiel, Brendan ;
Camargo, Amy ;
Defelice, Matthew ;
Faggart, Maura ;
Goyette, Mary ;
Gupta, Supriya ;
Moore, Jamie ;
Nguyen, Huy ;
Onofrio, Robert C. ;
Parkin, Melissa ;
Roy, Jessica ;
Stahl, Erich ;
Winchester, Ellen ;
Ziaugra, Liuda ;
Altshuler, David ;
Shen, Yan .
NATURE, 2007, 449 (7164) :851-U3
[14]   The scale of mutational variation in the murid genome [J].
Gaffney, DJ ;
Keightley, PD .
GENOME RESEARCH, 2005, 15 (08) :1086-1094
[15]   Fixing the front end [J].
Garber, Ken .
NATURE BIOTECHNOLOGY, 2008, 26 (10) :1101-1104
[16]   Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing [J].
Gnirke, Andreas ;
Melnikov, Alexandre ;
Maguire, Jared ;
Rogov, Peter ;
LeProust, Emily M. ;
Brockman, William ;
Fennell, Timothy ;
Giannoukos, Georgia ;
Fisher, Sheila ;
Russ, Carsten ;
Gabriel, Stacey ;
Jaffe, David B. ;
Lander, Eric S. ;
Nusbaum, Chad .
NATURE BIOTECHNOLOGY, 2009, 27 (02) :182-189
[17]   Multiplexed genotyping with sequence-tagged molecular inversion probes [J].
Hardenbol, P ;
Banér, J ;
Jain, M ;
Nilsson, M ;
Namsaraev, EA ;
Karlin-Neumann, GA ;
Fakhrai-Rad, H ;
Ronaghi, M ;
Willis, TD ;
Landegren, U ;
Davis, RW .
NATURE BIOTECHNOLOGY, 2003, 21 (06) :673-678
[18]   Genome-wide in situ exon capture for selective resequencing [J].
Hodges, Emily ;
Xuan, Zhenyu ;
Balija, Vivekanand ;
Kramer, Melissa ;
Molla, Michael N. ;
Smith, Steven W. ;
Middle, Christina M. ;
Rodesch, Matthew J. ;
Albert, Thomas J. ;
Hannon, Gregory J. ;
McCombie, W. Richard .
NATURE GENETICS, 2007, 39 (12) :1522-1527
[19]   Bayesian Markov chain Monte Carlo sequence analysis reveals varying neutral substitution patterns in mammalian evolution [J].
Hwang, DG ;
Green, P .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (39) :13994-14001
[20]   Rare independent mutations in renal salt handling genes contribute to blood pressure variation [J].
Ji, Weizhen ;
Foo, Jia Nee ;
O'Roak, Brian J. ;
Zhao, Hongyu ;
Larson, Martin G. ;
Simon, David B. ;
Newton-Cheh, Christopher ;
State, Matthew W. ;
Levy, Daniel ;
Lifton, Richard P. .
NATURE GENETICS, 2008, 40 (05) :592-599