Comparative analysis of amino acid repeats in rodents and humans

被引:138
作者
Albà, MM
Guigó, R
机构
[1] Univ Pompeu Fabra, Inst Municipal Invest Med, Dept Ciencias Expt & Salut, Grp Recerca Informat Biomed, Barcelona 08003, Spain
[2] Ctr Regulacio Genom, Barcelona 08003, Spain
关键词
D O I
10.1101/gr.1925704
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Amino acid tandem repeats, also called homopolymeric tracts, are extremely abundant in eukaryotic proteins. To gain insight into the genome-wide evolution of these regions in mammals, we analyzed the repeat content in a large data set of rat-mouse-human orthologs. Our results show that human proteins contain more amino acid repeats than rodent proteins and that trinucleotide repeats are also more abundant in human coding sequences. Using the human species as ail outgroup, we were able to address differences in repeat loss and repeat gain in the rat and mouse lineages. In this data set, mouse proteins contain Substantially more repeats than rat proteins, which call be at least partly attributed to a higher repeat loss in the rat lineage. The data are consistent with a role for trinucleotide slippage in the generation of novel amino acid repeats. We confirm the previously observed functional bias of proteins with repeats, with overrepresentation of transcription factors and DNA-binding proteins. We show that genes encoding amino acid repeats tend to have all unusually high GC content, and that differences in coding CC content among orthologs are directly related to the presence/absence of repeats. We propose that the different GC content isochore structure in rodents and humans may result in an increased amino acid repeat prevalence in the human lineage.
引用
收藏
页码:549 / 554
页数:6
相关论文
共 34 条
  • [1] Molecular phylogeny and divergence time estimates for major rodent groups: Evidence from multiple genes
    Adkins, RM
    Gelke, EL
    Rowe, D
    Honeycutt, RL
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2001, 18 (05) : 777 - 791
  • [2] Amino acid reiterations in yeast are overrepresented in particular classes of proteins and show evidence of a slippage-like mutational process
    Albà, MM
    Santibàñez-Koref, MF
    Hancock, JM
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 1999, 49 (06) : 789 - 797
  • [3] Conservation of polyglutamine tract size between mice and humans depends on codon interruption
    Albà, MM
    Santibáñez-Koref, MF
    Hancock, JM
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 1999, 16 (11) : 1641 - 1644
  • [4] The comparative genomics of polyglutamine repeats:: Extreme difference in the codon organization of repeat-encoding regions between mammals and Drosophila
    Albà, MM
    Santibáñez-Koref, MF
    Hancock, JM
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 2001, 52 (03) : 249 - 259
  • [5] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [6] Ensembl 2002: accommodating comparative genomics
    Clamp, M
    Andrews, D
    Barker, D
    Bevan, P
    Cameron, G
    Chen, Y
    Clark, L
    Cox, T
    Cuff, J
    Curwen, V
    Down, T
    Durbin, R
    Eyras, E
    Gilbert, J
    Hammond, M
    Hubbard, T
    Kasprzyk, A
    Keefe, D
    Lehvaslaiho, H
    Iyer, V
    Melsopp, C
    Mongin, E
    Pettett, R
    Potter, S
    Rust, A
    Schmidt, E
    Searle, S
    Slater, G
    Smith, J
    Spooner, W
    Stabenau, A
    Stalker, J
    Stupka, E
    Ureta-Vidal, A
    Vastrik, I
    Birney, E
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 38 - 42
  • [7] Sp1 and TAFII130 transcriptional activity disrupted in early Huntington's disease
    Dunah, AW
    Jeong, H
    Griffin, A
    Kim, YM
    Standaert, DG
    Hersch, SM
    Mouradian, MM
    Young, AB
    Tanese, N
    Krainc, D
    [J]. SCIENCE, 2002, 296 (5576) : 2238 - 2243
  • [8] SPECIES-SPECIFIC INTERACTION OF THE GLUTAMINE-RICH ACTIVATION DOMAINS OF SP1 WITH THE TATA BOX-BINDING PROTEIN
    EMILI, A
    GREENBLATT, J
    INGLES, CJ
    [J]. MOLECULAR AND CELLULAR BIOLOGY, 1994, 14 (03) : 1582 - 1593
  • [9] Perspectives: Neurodegeneration - A glutamine-rich trail leads to transcription factors
    Freiman, RN
    Tjian, R
    [J]. SCIENCE, 2002, 296 (5576) : 2149 - 2150
  • [10] Galtier N, 1998, GENETICS, V150, P1577