What Is a Microsatellite: A Computational and Experimental Definition Based upon Repeat Mutational Behavior at A/T and GT/AC Repeats
被引:101
作者:
Kelkar, Yogeshwar D.
论文数: 0引用数: 0
h-index: 0
机构:
Penn State Univ, Ctr Med Genom, University Pk, PA 16802 USA
Penn State Univ, Dept Biol, University Pk, PA 16802 USAPenn State Univ, Ctr Med Genom, University Pk, PA 16802 USA
Kelkar, Yogeshwar D.
[1
,2
]
Strubczewski, Noelle
论文数: 0引用数: 0
h-index: 0
机构:
Penn State Univ, Ctr Med Genom, University Pk, PA 16802 USA
Penn State Univ, Coll Med, Gittlen Canc Res Fdn, Dept Pathol, Hershey, PA USAPenn State Univ, Ctr Med Genom, University Pk, PA 16802 USA
Strubczewski, Noelle
[1
,3
]
Hile, Suzanne E.
论文数: 0引用数: 0
h-index: 0
机构:
Penn State Univ, Ctr Med Genom, University Pk, PA 16802 USA
Penn State Univ, Coll Med, Gittlen Canc Res Fdn, Dept Pathol, Hershey, PA USAPenn State Univ, Ctr Med Genom, University Pk, PA 16802 USA
Hile, Suzanne E.
[1
,3
]
Chiaromonte, Francesca
论文数: 0引用数: 0
h-index: 0
机构:
Penn State Univ, Ctr Med Genom, University Pk, PA 16802 USA
Penn State Univ, Dept Stat, University Pk, PA 16802 USAPenn State Univ, Ctr Med Genom, University Pk, PA 16802 USA
Chiaromonte, Francesca
[1
,4
]
Eckert, Kristin A.
论文数: 0引用数: 0
h-index: 0
机构:
Penn State Univ, Ctr Med Genom, University Pk, PA 16802 USA
Penn State Univ, Coll Med, Gittlen Canc Res Fdn, Dept Pathol, Hershey, PA USAPenn State Univ, Ctr Med Genom, University Pk, PA 16802 USA
Eckert, Kristin A.
[1
,3
]
Makova, Kateryna D.
论文数: 0引用数: 0
h-index: 0
机构:
Penn State Univ, Ctr Med Genom, University Pk, PA 16802 USA
Penn State Univ, Dept Biol, University Pk, PA 16802 USAPenn State Univ, Ctr Med Genom, University Pk, PA 16802 USA
Makova, Kateryna D.
[1
,2
]
机构:
[1] Penn State Univ, Ctr Med Genom, University Pk, PA 16802 USA
[2] Penn State Univ, Dept Biol, University Pk, PA 16802 USA
[3] Penn State Univ, Coll Med, Gittlen Canc Res Fdn, Dept Pathol, Hershey, PA USA
[4] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
Microsatellites are abundant in eukaryotic genomes and have high rates of strand slippage-induced repeat number alterations. They are popular genetic markers, and their mutations are associated with numerous neurological diseases. However, the minimal number of repeats required to constitute a microsatellite has been debated, and a definition of a microsatellite that considers its mutational behavior has been lacking. To define a microsatellite, we investigated slippage dynamics for a range of repeat sizes, utilizing two approaches. Computationally, we assessed length polymorphism at repeat loci in ten ENCODE regions resequenced in four human populations, assuming that the occurrence of polymorphism reflects strand slippage rates. Experimentally, we determined the in vitro DNA polymerase-mediated strand slippage error rates as a function of repeat number. In both approaches, we compared strand slippage rates at tandem repeats with the background slippage rates. We observed two distinct modes of mutational behavior. At small repeat numbers, slippage rates were low and indistinguishable from background measurements. A marked transition in mutability was observed as the repeat array lengthened, such that slippage rates at large repeat numbers were significantly higher than the background rates. For both mononucleotide and dinucleotide microsatellites studied, the transition length corresponded to a similar number of nucleotides (approximately 10). Thus, microsatellite threshold is determined not by the presence/absence of strand slippage at repeats but by an abrupt alteration in slippage rates relative to background. These findings have implications for understanding microsatellite mutagenesis, standardization of genome-wide microsatellite analyses, and predicting polymorphism levels of individual microsatellite loci.