An initial map of insertion and deletion (INDEL) variation in the human genome

被引:470
作者
Mills, Ryan E.
Luttig, Christopher T.
Larkins, Christine E.
Beauchamp, Adam
Tsui, Circe
Pittard, W. Stephen
Devine, Scott E. [1 ]
机构
[1] Emory Univ, Sch Med, Dept Biochem, Atlanta, GA 30322 USA
[2] Emory Univ, Sch Med, Ctr Bioinformat, Atlanta, GA 30322 USA
[3] Emory Univ, Sch Med, Biochem Cell & Dev Biol Grad Program, Atlanta, GA 30322 USA
[4] Emory Univ, Sch Med, Genet & Mol Biol Grad Program, Atlanta, GA 30322 USA
关键词
D O I
10.1101/gr.4565806
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Although many studies have been conducted to identify single nucleotide polymorphisms (SNPs) in humans, few studies have been conducted to identify alternative forms of natural genetic variation, such as insertion and deletion (INDEL) polymorphisms. In this report, we describe an initial map of human INDEL variation that contains 415,436 unique INDEL polymorphisms. These INDELs were identified with a computational approach using DNA re-sequencing traces that originally were generated for SNP discovery projects. They range from 1 bp to 9989 bp in length and are split almost equally between insertions and deletions, relative to the chimpanzee genome sequence. Five major classes of INDELs were identified, including (1) insertions and deletions of single-base pairs, (2) monomeric base pair expansions, (3) multi-base pair expansions of 2-15 bp repeat units, (4) transposon insertions, and (5) INDELs containing random DNA sequences. Our INDELs are distributed throughout the human genome with an average density of one INDEL per 7.2 kb of DNA. Variation hotspots were identified with up to 48-fold regional increases in INDEL and/ or SNP variation compared with the chromosomal averages for the same chromosomes. Over 148,000 INDELs (35.7%) were identified within known genes, and 5542 of these INDELs were located in the promoters and exons of genes, where gene function would be expected to be influenced the greatest. All INDELs in this study have been deposited into dbSNP and have been integrated into maps of human genetic variation that are available to the research community.
引用
收藏
页码:1182 / 1190
页数:9
相关论文
共 33 条
  • [1] An SNP map of the human genome generated by reduced representation shotgun sequencing
    Altshuler, D
    Pollara, VJ
    Cowles, CR
    Van Etten, WJ
    Baldwin, J
    Linton, L
    Lander, ES
    [J]. NATURE, 2000, 407 (6803) : 513 - 516
  • [2] A haplotype map of the human genome
    Altshuler, D
    Brooks, LD
    Chakravarti, A
    Collins, FS
    Daly, MJ
    Donnelly, P
    Gibbs, RA
    Belmont, JW
    Boudreau, A
    Leal, SM
    Hardenbol, P
    Pasternak, S
    Wheeler, DA
    Willis, TD
    Yu, FL
    Yang, HM
    Zeng, CQ
    Gao, Y
    Hu, HR
    Hu, WT
    Li, CH
    Lin, W
    Liu, SQ
    Pan, H
    Tang, XL
    Wang, J
    Wang, W
    Yu, J
    Zhang, B
    Zhang, QR
    Zhao, HB
    Zhao, H
    Zhou, J
    Gabriel, SB
    Barry, R
    Blumenstiel, B
    Camargo, A
    Defelice, M
    Faggart, M
    Goyette, M
    Gupta, S
    Moore, J
    Nguyen, H
    Onofrio, RC
    Parkin, M
    Roy, J
    Stahl, E
    Winchester, E
    Ziaugra, L
    Shen, Y
    [J]. NATURE, 2005, 437 (7063) : 1299 - 1320
  • [3] Recent segmental duplications in the human genome
    Bailey, JA
    Gu, ZP
    Clark, RA
    Reinert, K
    Samonte, RV
    Schwartz, S
    Adams, MD
    Myers, EW
    Li, PW
    Eichler, EE
    [J]. SCIENCE, 2002, 297 (5583) : 1003 - 1007
  • [4] Natural genetic variation caused by transposable elements in humans
    Bennettt, EA
    Coleman, LE
    Tsui, C
    Pittard, WS
    Devine, SE
    [J]. GENETICS, 2004, 168 (02) : 933 - 951
  • [5] Genetic mapping with SNP markers in Drosophila
    Berger, J
    Suzuki, T
    Senti, KA
    Stubbs, J
    Schaffner, G
    Dickson, BJ
    [J]. NATURE GENETICS, 2001, 29 (04) : 475 - 481
  • [6] Comprehensive identification and characterization of diallelic insertion-deletion polymorphisms in 330 human candidate genes
    Bhangale, TR
    Rieder, MJ
    Livingston, RJ
    Nickerson, DA
    [J]. HUMAN MOLECULAR GENETICS, 2005, 14 (01) : 59 - 69
  • [7] A DNA polymorphism discovery resource for research on human genetic variation
    Collins, FS
    Brooks, LD
    Chakravarti, A
    [J]. GENOME RESEARCH, 1998, 8 (12) : 1229 - 1231
  • [8] CONSTRUCTION OF A GENERAL HUMAN-CHROMOSOME JUMPING LIBRARY, WITH APPLICATION TO CYSTIC-FIBROSIS
    COLLINS, FS
    DRUMM, ML
    COLE, JL
    LOCKWOOD, WK
    VANDEWOUDE, GF
    IANNUZZI, MC
    [J]. SCIENCE, 1987, 235 (4792) : 1046 - 1049
  • [9] A high-resolution survey of deletion polymorphism in the human genome
    Conrad, DF
    Andrews, TD
    Carter, NP
    Hurles, ME
    Pritchard, JK
    [J]. NATURE GENETICS, 2006, 38 (01) : 75 - 81
  • [10] High-resolution haplotype structure in the human genome
    Daly, MJ
    Rioux, JD
    Schaffner, SE
    Hudson, TJ
    Lander, ES
    [J]. NATURE GENETICS, 2001, 29 (02) : 229 - 232