Amino acid runs in eukaryotic proteomes and disease associations

被引:174
作者
Karlin, S [1 ]
Brocchieri, L
Bergman, A
Mrázek, J
Gentles, AJ
机构
[1] Stanford Univ, Dept Math, Stanford, CA 94305 USA
[2] Stanford Univ, Ctr Computat Genet & Biol Modelling, Stanford, CA 94305 USA
关键词
D O I
10.1073/pnas.012608599
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We present a comparative proteome analysis of the five complete eukaryoticgenomes (human, Drosophila melanogaster, Caenorhabditis elegans, Saccharomyces cerevisiae, Arabidopsis thaliana), focusing on individual and multiple amino acid runs, charge and hydrophobic runs. We found that human proteins with multiple long runs are often associated with diseases, these include long glutamine runs that induce neurological disorders, various cancers, categories of leukemias (mostly involving chromosomal translocations), and an abundance of Ca2+ and K+ channel proteins. Many human proteins with multiple runs function in development and/or transcription regulation and are Drosophila homeotic homologs. A large number of these proteins are expressed in the nervous system. More than 80% of Drosophila proteins with multiple runs seem to function in transcription regulation. The most frequent amino acid runs in Drosophila sequences occur for glutamine, alanine, and serine, whereas human sequences highlight glutamate, proline, and leucine. The most frequent runs in yeast are of serine, glutamine, and acidic residues. Compared with the other eukaryotic proteomes, amino acid runs are significantly more abundant in the fly. This finding might be interpreted in terms of innate differences in DNA-replication processes, repair mechanisms, DNA-modification systems, and mutational biases. There are striking differences in amino acid runs for glutamine, asparagine, and leucine among the five proteomes.
引用
收藏
页码:333 / 338
页数:6
相关论文
共 32 条
  • [1] Alberts B., 1994, MOL BIOL CELL
  • [2] [Anonymous], GENOME BIOL
  • [3] Impairment of the ubiquitin-proteasome system by protein aggregation
    Bence, NF
    Sampat, RM
    Kopito, RR
    [J]. SCIENCE, 2001, 292 (5521) : 1552 - 1555
  • [4] UNITS OF DNA-REPLICATION IN DROSOPHILA-MELANOGASTER CHROMOSOMES
    BLUMENTH.AB
    KRIEGSTE.HJ
    HOGNESS, DS
    [J]. COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY, 1973, 38 : 205 - 223
  • [5] Chai YH, 1999, J NEUROSCI, V19, P10338
  • [6] Trinucleotide repeats: Mechanisms and pathophysiology
    Cummings, CJ
    Zoghbi, HY
    [J]. ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, 2000, 1 : 281 - 328
  • [7] NEITHER ENHANCED REMOVAL OF CYCLOBUTANE PYRIMIDINE DIMERS NOR STRAND-SPECIFIC REPAIR IS FOUND AFTER TRANSCRIPTION INDUCTION OF THE BETA(3)-TUBULIN GENE IN A DROSOPHILA EMBRYONIC-CELL LINE KC
    DECOCK, JGR
    KLINK, EC
    FERRO, W
    LOHMAN, PHM
    EEKEN, JCJ
    [J]. MUTATION RESEARCH, 1992, 293 (01): : 11 - 20
  • [8] HUMAN GENETIC-DISEASES DUE TO CODON REITERATION - RELATIONSHIP TO AN EVOLUTIONARY MECHANISM
    GREEN, H
    [J]. CELL, 1993, 74 (06) : 955 - 956
  • [9] SEQUENCE-DEPENDENT DNA-STRUCTURE - THE ROLE OF BASE STACKING INTERACTIONS
    HUNTER, CA
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1993, 230 (03) : 1025 - 1054
  • [10] QUANTILE DISTRIBUTIONS OF AMINO-ACID USAGE IN PROTEIN CLASSES
    KARLIN, S
    BLAISDELL, BE
    BUCHER, P
    [J]. PROTEIN ENGINEERING, 1992, 5 (08): : 729 - 738