SEQUENCES WITH UNUSUAL AMINO-ACID COMPOSITIONS

被引:111
作者
WOOTTON, JC
机构
[1] National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, 8th Floor
关键词
D O I
10.1016/S0959-440X(94)90111-2
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Amino acid sequences of very non-random composition ('low-complexity' segments) are abundant in natural proteins. From recent statistical analyses of protein sequence databases, approximately 15% of the residues occur in segments of extreme compositional bias, and approximately 34% of proteins have at least one such interspersed segment. Sequences of many elongated non-globular domains also have non-random compositional bias, and these regions increase the proportion of residues in statistically deviant segments to approximately 25% of the database. In contrast, less than 1% of residues in known ordered crystal structures are in segments of reduced complexity. Increasingly, low-complexity segments have been implicated in crucial biological functions, shown by genetic engineering and mutagenesis experiments, variations in human disease and locations of autoimmune epitopes, but relatively little is known about their range of possible molecular structures, dynamics and interactions.
引用
收藏
页码:413 / 421
页数:9
相关论文
共 56 条
[1]   ISSUES IN SEARCHING MOLECULAR SEQUENCE DATABASES [J].
ALTSCHUL, SF ;
BOGUSKI, MS ;
GISH, W ;
WOOTTON, JC .
NATURE GENETICS, 1994, 6 (02) :119-129
[2]   THE SWISS-PROT PROTEIN-SEQUENCE DATA-BANK, RECENT DEVELOPMENTS [J].
BAIROCH, A ;
BOECKMANN, B .
NUCLEIC ACIDS RESEARCH, 1993, 21 (13) :3093-3096
[3]   THE PIR-INTERNATIONAL DATABASES [J].
BARKER, W ;
GEORGE, DG ;
MEWES, HW ;
PFEIFFER, F ;
TSUGITA, A .
NUCLEIC ACIDS RESEARCH, 1993, 21 (13) :3089-3092
[4]   3-DIMENSIONAL STRUCTURE OF THE ALKALINE PROTEASE OF PSEUDOMONAS-AERUGINOSA - A 2-DOMAIN PROTEIN WITH A CALCIUM-BINDING PARALLEL-BETA ROLL MOTIF [J].
BAUMANN, U ;
WU, S ;
FLAHERTY, KM ;
MCKAY, DB .
EMBO JOURNAL, 1993, 12 (09) :3357-3364
[5]   GENBANK [J].
BENSON, D ;
LIPMAN, DJ ;
OSTELL, J .
NUCLEIC ACIDS RESEARCH, 1993, 21 (13) :2963-2965
[6]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[7]   VERY LONG CHARGE RUNS IN SYSTEMIC LUPUS ERYTHEMATOSUS-ASSOCIATED AUTOANTIGENS [J].
BRENDEL, V ;
DOHLMAN, J ;
BLAISDELL, BE ;
KARLIN, S .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1991, 88 (04) :1536-1540
[8]  
BRENDEL V, 1989, P NATL ACAD SCI USA, V86, P5696
[9]   THE CLASSIFICATION AND ORIGINS OF PROTEIN FOLDING PATTERNS [J].
CHOTHIA, C ;
FINKELSTEIN, AV .
ANNUAL REVIEW OF BIOCHEMISTRY, 1990, 59 :1007-1039
[10]  
CHOTHIA C, 1992, NATURE, V357, P453