Abundance and length of simple repeats in vertebrate genomes are determined by their structural properties

被引:78
作者
Bacolla, Albino [1 ]
Larson, Jacquelynn E. [1 ]
Collins, Jack R. [2 ]
Li, Jian [3 ,4 ]
Milosavljevic, Aleksandar [3 ,4 ]
Stenson, Peter D. [5 ]
Cooper, David N. [5 ]
Wells, Robert D. [1 ]
机构
[1] Texas A&M Univ, Hlth Sci Ctr, Ctr Genome Res, Inst Biosci & Technol, Houston, TX 77030 USA
[2] NCI, Adv Biomed Comp Ctr, Adv Technol Program, SAIC Frederick Inc, Frederick, MD 21702 USA
[3] Baylor Coll Med, Dept Mol & Human Genet, Houston, TX 77030 USA
[4] Baylor Coll Med, Human Genome Sequencing Ctr, Houston, TX 77030 USA
[5] Cardiff Univ, Inst Med Genet, Sch Med, Cardiff CF14 4XN, S Glam, Wales
关键词
D O I
10.1101/gr.078303.108
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Microsatellites are abundant in vertebrate genomes, but their sequence representation and length distributions vary greatly within each family of repeats (e. g., tetranucleotides). Biophysical studies of 82 synthetic single-stranded oligonucleotides comprising all tetra-and trinucleotide repeats revealed an inverse correlation between the stability of folded-back hairpin and quadruplex structures and the sequence representation for repeats >= 30 bp in length in nine vertebrate genomes. Alternatively, the predicted energies of base-stacking interactions correlated directly with the longest length distributions in vertebrate genomes. Genome-wide analyses indicated that unstable sequences, such as CAG: CTG and CCG: CGG, were over-represented in coding regions and that micro/ minisatellites were recruited in genes involved in transcription and signaling pathways, particularly in the nervous system. Microsatellite instability (MSI) is a hallmark of cancer, and length polymorphism within genes can confer susceptibility to inherited disease. Sequences that manifest the highest MSI values also displayed the strongest base-stacking interactions; analyses of 62 tri-and tetranucleotide repeat-containing genes associated with human genetic disease revealed enrichments similar to those noted for micro/ minisatellite-containing genes. We conclude that DNA structure and base-stacking determined the number and length distributions of microsatellite repeats in vertebrate genomes over evolutionary time and that micro/ minisatellites have been recruited to participate in both gene and protein function.
引用
收藏
页码:1545 / 1553
页数:9
相关论文
共 58 条
[1]  
Ahrendt SA, 2000, CANCER RES, V60, P2488
[2]   The ERCC1/XPF endonuclease is required for efficient single-strand annealing and gene conversion in mammalian cells [J].
Al-Minawi, Ali Z. ;
Saleh-Gohari, Nasrollah ;
Helleday, Thomas .
NUCLEIC ACIDS RESEARCH, 2008, 36 (01) :1-9
[3]  
[Anonymous], 1980, BIOPHYS CHEM
[4]   THERMODYNAMICS OF ONE-STRANDED HELIX-COIL EQUILIBRIUM IN POLYADENYLIC ACID [J].
APPLEQUI.J ;
DAMLE, V .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1966, 88 (17) :3895-&
[5]   Non-B DNA conformations, genomic rearrangements, and human disease [J].
Bacolla, A ;
Wells, RD .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2004, 279 (46) :47411-47414
[6]   Recombinant human DNA (cytosine-5) methyltransferase -: III.: Allosteric control, reaction order, and influence of plasmid topology and triplet repeat length on methylation of the fragile X CGG•CCG sequence [J].
Bacolla, A ;
Pradhan, S ;
Larson, JE ;
Roberts, RJ ;
Wells, RD .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2001, 276 (21) :18605-18613
[7]   Long homopurine•homopyrimidine sequences are characteristic of genes expressed in brain and the pseudoautosomal region [J].
Bacolla, Albino ;
Collins, Jack R. ;
Gold, Bert ;
Chuzhanova, Nadia ;
Yi, Ming ;
Stephens, Robert M. ;
Stefanov, Stefan ;
Olsh, Adam ;
Jakupciak, John P. ;
Dean, Michael ;
Lempicki, Richard A. ;
Cooper, David N. ;
Wells, Robert D. .
NUCLEIC ACIDS RESEARCH, 2006, 34 (09) :2663-2675
[8]   Transcription increases the deletion frequency of long CTG center dot CAG triplet repeats from plasmids in Escherichia coli [J].
Bowater, RP ;
Jaworski, A ;
Larson, JE ;
Parniewski, P ;
Wells, RD .
NUCLEIC ACIDS RESEARCH, 1997, 25 (14) :2861-2868
[9]   Expansion of GAA trinucleotide repeats in mammals [J].
Clark, RM ;
Bhaskar, SS ;
Miyahara, M ;
Dalgliesh, GL ;
Bidichandani, SI .
GENOMICS, 2006, 87 (01) :57-67
[10]   An exhaustive DNA micro-satellite map of the human genome using high performance computing [J].
Collins, JR ;
Stephens, RM ;
Gold, B ;
Long, B ;
Dean, M ;
Burt, SK .
GENOMICS, 2003, 82 (01) :10-19