Trinucleotide repeats and long homopeptides in genes and proteins associated with nervous system disease and development

被引:158
作者
Karlin, S
Burge, C
机构
[1] Department of Mathematics, Stanford University, Stanford
关键词
D O I
10.1073/pnas.93.4.1560
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Several human neurological disorders are associated with proteins containing abnormally long runs of glutamine residues. Strikingly, most of these proteins contain two or more additional long runs of amino acids other than glutamine. We screened the current human, mouse, Drosophila, yeast, and Escherichia coli protein sequence data bases and identified all proteins containing multiple long homopeptides. This search found multiple long homopeptides in about 12% of Drosophila proteins but in only about 1.7% of human, mouse, and yeast proteins and none among E. coli proteins. Most of these sequences show other unusual sequence features, including multiple charge clusters and excessive counts of homopeptides of length greater than or equal to two amino acid residues. Intriguingly, a large majority of the identified Drosophila proteins are essential developmental proteins and, in particular, most play a role in central nervous system development. Almost half of the human and mouse proteins identified are homeotic homologs. The role of long homopeptides in fine-tuning protein conformation for multiple functional activities is discussed. The relative contributions of strand slippage and of dynamic mutation are also addressed. Several new experiments are proposed.
引用
收藏
页码:1560 / 1565
页数:6
相关论文
共 22 条
  • [1] VERY LONG CHARGE RUNS IN SYSTEMIC LUPUS ERYTHEMATOSUS-ASSOCIATED AUTOANTIGENS
    BRENDEL, V
    DOHLMAN, J
    BLAISDELL, BE
    KARLIN, S
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1991, 88 (04) : 1536 - 1540
  • [2] METHODS AND ALGORITHMS FOR STATISTICAL-ANALYSIS OF PROTEIN SEQUENCES
    BRENDEL, V
    BUCHER, P
    NOURBAKHSH, IR
    BLAISDELL, BE
    KARLIN, S
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (06) : 2002 - 2006
  • [3] ASSOCIATION OF CHARGE CLUSTERS WITH FUNCTIONAL DOMAINS OF CELLULAR TRANSCRIPTION FACTORS
    BRENDEL, V
    KARLIN, S
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1989, 86 (15) : 5698 - 5702
  • [4] ANALYSIS OF SP1 INVIVO REVEALS MULTIPLE TRANSCRIPTIONAL DOMAINS, INCLUDING A NOVEL GLUTAMINE-RICH ACTIVATION MOTIF
    COUREY, AJ
    TJIAN, R
    [J]. CELL, 1988, 55 (05) : 887 - 898
  • [5] INACTIVATION OF THE MOUSE HUNTINGTONS-DISEASE GENE HOMOLOG HDH
    DUYAO, MP
    AUERBACH, AB
    RYAN, A
    PERSICHETTI, F
    BARNES, GT
    MCNEIL, SM
    GE, P
    VONSATTEL, JP
    GUSELLA, JF
    JOYNER, AL
    MACDONALD, ME
    [J]. SCIENCE, 1995, 269 (5222) : 407 - 410
  • [6] ACROSIN AND THE ACROSOME IN HUMAN SPERMATOGENESIS
    FLORKEGERLOFF, S
    TOPFERPETERSEN, E
    MULLERESTERL, W
    SCHILL, WB
    ENGEL, W
    [J]. HUMAN GENETICS, 1983, 65 (01) : 61 - 67
  • [7] TRANSCRIPTIONAL ACTIVATION MODULATED BY HOMOPOLYMERIC GLUTAMINE AND PROLINE STRETCHES
    GERBER, HP
    SEIPEL, K
    GEORGIEV, O
    HOFFERER, M
    HUG, M
    RUSCONI, S
    SCHAFFNER, W
    [J]. SCIENCE, 1994, 263 (5148) : 808 - 811
  • [8] HUMAN GENETIC-DISEASES DUE TO CODON REITERATION - RELATIONSHIP TO AN EVOLUTIONARY MECHANISM
    GREEN, H
    [J]. CELL, 1993, 74 (06) : 955 - 956
  • [9] HOPKIN K, 1995, J NIH RES, V1, P45
  • [10] 2 TYPES OF INACTIVATION IN SHAKER K+ CHANNELS - EFFECTS OF ALTERATIONS IN THE CARBOXY-TERMINAL REGION
    HOSHI, T
    ZAGOTTA, WN
    ALDRICH, RW
    [J]. NEURON, 1991, 7 (04) : 547 - 556