An initial map of insertion and deletion (INDEL) variation in the human genome

被引:470
作者
Mills, Ryan E.
Luttig, Christopher T.
Larkins, Christine E.
Beauchamp, Adam
Tsui, Circe
Pittard, W. Stephen
Devine, Scott E. [1 ]
机构
[1] Emory Univ, Sch Med, Dept Biochem, Atlanta, GA 30322 USA
[2] Emory Univ, Sch Med, Ctr Bioinformat, Atlanta, GA 30322 USA
[3] Emory Univ, Sch Med, Biochem Cell & Dev Biol Grad Program, Atlanta, GA 30322 USA
[4] Emory Univ, Sch Med, Genet & Mol Biol Grad Program, Atlanta, GA 30322 USA
关键词
D O I
10.1101/gr.4565806
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Although many studies have been conducted to identify single nucleotide polymorphisms (SNPs) in humans, few studies have been conducted to identify alternative forms of natural genetic variation, such as insertion and deletion (INDEL) polymorphisms. In this report, we describe an initial map of human INDEL variation that contains 415,436 unique INDEL polymorphisms. These INDELs were identified with a computational approach using DNA re-sequencing traces that originally were generated for SNP discovery projects. They range from 1 bp to 9989 bp in length and are split almost equally between insertions and deletions, relative to the chimpanzee genome sequence. Five major classes of INDELs were identified, including (1) insertions and deletions of single-base pairs, (2) monomeric base pair expansions, (3) multi-base pair expansions of 2-15 bp repeat units, (4) transposon insertions, and (5) INDELs containing random DNA sequences. Our INDELs are distributed throughout the human genome with an average density of one INDEL per 7.2 kb of DNA. Variation hotspots were identified with up to 48-fold regional increases in INDEL and/ or SNP variation compared with the chromosomal averages for the same chromosomes. Over 148,000 INDELs (35.7%) were identified within known genes, and 5542 of these INDELs were located in the promoters and exons of genes, where gene function would be expected to be influenced the greatest. All INDELs in this study have been deposited into dbSNP and have been integrated into maps of human genetic variation that are available to the research community.
引用
收藏
页码:1182 / 1190
页数:9
相关论文
共 33 条
  • [21] Biology of mammalian L1 retrotransposons
    Ostertag, EM
    Kazazian, HH
    [J]. ANNUAL REVIEW OF GENETICS, 2001, 35 : 501 - 538
  • [22] Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21
    Patil, N
    Berno, AJ
    Hinds, DA
    Barrett, WA
    Doshi, JM
    Hacker, CR
    Kautzer, CR
    Lee, DH
    Marjoribanks, C
    McDonough, DP
    Nguyen, BTN
    Norris, MC
    Sheehan, JB
    Shen, NP
    Stern, D
    Stokowski, RP
    Thomas, DJ
    Trulson, MO
    Vyas, KR
    Frazer, KA
    Fodor, SPA
    Cox, DR
    [J]. SCIENCE, 2001, 294 (5547) : 1719 - 1723
  • [23] Sequence variation in the human angiotensin converting enzyme
    Rieder, MJ
    Taylor, SL
    Clark, AG
    Nickerson, DA
    [J]. NATURE GENETICS, 1999, 22 (01) : 59 - 62
  • [24] A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms
    Sachidanandam, R
    Weissman, D
    Schmidt, SC
    Kakol, JM
    Stein, LD
    Marth, G
    Sherry, S
    Mullikin, JC
    Mortimore, BJ
    Willey, DL
    Hunt, SE
    Cole, CG
    Coggill, PC
    Rice, CM
    Ning, ZM
    Rogers, J
    Bentley, DR
    Kwok, PY
    Mardis, ER
    Yeh, RT
    Schultz, B
    Cook, L
    Davenport, R
    Dante, M
    Fulton, L
    Hillier, L
    Waterston, RH
    McPherson, JD
    Gilman, B
    Schaffner, S
    Van Etten, WJ
    Reich, D
    Higgins, J
    Daly, MJ
    Blumenstiel, B
    Baldwin, J
    Stange-Thomann, NS
    Zody, MC
    Linton, L
    Lander, ES
    Altshuler, D
    [J]. NATURE, 2001, 409 (6822) : 928 - 933
  • [25] Human gene mutation database (HGMD®):: 2003 update
    Stenson, PD
    Ball, EV
    Mort, M
    Phillips, AD
    Shiel, JA
    Thomas, NST
    Abeysinghe, S
    Krawczak, M
    Cooper, DN
    [J]. HUMAN MUTATION, 2003, 21 (06) : 577 - 581
  • [26] Haplotype variation and linkage disequilibrium in 313 human genes
    Stephens, JC
    Schneider, JA
    Tanguay, DA
    Choi, J
    Acharya, T
    Stanley, SE
    Jiang, RH
    Messer, CJ
    Chew, A
    Han, JH
    Duan, JC
    Carr, JL
    Lee, MS
    Koshy, B
    Kumar, AM
    Zhang, G
    Newell, WR
    Windemuth, A
    Xu, CB
    Kalbfleisch, TS
    Shaner, SL
    Arnold, K
    Schulz, V
    Drysdale, CM
    Nandabalan, K
    Judson, RS
    Ruaño, G
    Vovis, GF
    [J]. SCIENCE, 2001, 293 (5529) : 489 - 493
  • [27] Statistical significance for genomewide studies
    Storey, JD
    Tibshirani, R
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (16) : 9440 - 9445
  • [28] Taillon-Miller P, 1999, GENOME RES, V9, P499
  • [29] A high-density single-nucleotide polymorphism map of Xq25-q28
    Taillon-Miller, P
    Kwok, PY
    [J]. GENOMICS, 2000, 65 (03) : 195 - 202
  • [30] Single nucleotide polymorphisms (SNPs) that map to gaps in the human SNP map
    Tsui, C
    Coleman, LE
    Griffith, JL
    Bennett, EA
    Goodson, SG
    Scott, JD
    Pittard, WS
    Devine, SE
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (16) : 4910 - 4916