A METHOD TO RECOGNIZE DISTANT REPEATS IN PROTEIN SEQUENCES

被引:49
作者
HERINGA, J
ARGOS, P
机构
[1] European Molecular Biology Laboratory, Heidelberg
来源
PROTEINS-STRUCTURE FUNCTION AND GENETICS | 1993年 / 17卷 / 04期
关键词
PROTEIN SEQUENCE; SEQUENCE REPEATS; SEQUENCE ALIGNMENTS;
D O I
10.1002/prot.340170407
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
An automated algorithm is presented that delineates protein sequence fragments which display similarity. The method incorporates a selection of a number of local nonoverlapping sequence alignments with the highest similarity scores and a graph-theoretical approach to elucidate the consistent start and end points of the fragments comprising one or more ensembles of related subsequences. The procedure allows the simultaneous identification of different types of repeats within one sequence. A multiple alignment of the resulting fragments is performed and a consensus sequence derived from the ensemble(s). Finally, a profile is constructed from the multiple alignment to detect possible and more distant members within the sequence. The method tolerates mutations in the repeats as well as insertions and deletions. The sequence spans between the various repeats or repeat clusters may be of different lengths. The technique has been applied to a number of proteins where the repeating fragments have been derived from information additional to the protein sequences. (C) 1993 Wiley-Liss, Inc.
引用
收藏
页码:391 / 411
页数:21
相关论文
共 42 条
[1]   THE PRIMARY STRUCTURE OF HUMAN HEMOPEXIN DEDUCED FROM CDNA SEQUENCE - EVIDENCE FOR INTERNAL, REPEATING HOMOLOGY [J].
ALTRUDA, F ;
POLI, V ;
RESTAGNO, G ;
ARGOS, P ;
CORTESE, R ;
SILENGO, L .
NUCLEIC ACIDS RESEARCH, 1985, 13 (11) :3841-3859
[2]   PROTEIN-SEQUENCE COMPARISON - METHODS AND SIGNIFICANCE [J].
ARGOS, P ;
VINGRON, M ;
VOGT, G .
PROTEIN ENGINEERING, 1991, 4 (04) :375-383
[3]   A SENSITIVE PROCEDURE TO COMPARE AMINO-ACID-SEQUENCES [J].
ARGOS, P .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 193 (02) :385-396
[4]   THE SWISS-PROT PROTEIN-SEQUENCE DATA-BANK [J].
BAIROCH, A ;
BOECKMANN, B .
NUCLEIC ACIDS RESEARCH, 1991, 19 :2247-2248
[5]   A STRATEGY FOR THE RAPID MULTIPLE ALIGNMENT OF PROTEIN SEQUENCES - CONFIDENCE LEVELS FROM TERTIARY STRUCTURE COMPARISONS [J].
BARTON, GJ ;
STERNBERG, MJE .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 198 (02) :327-337
[6]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[7]   THE MOLECULAR-STRUCTURE AND STABILITY OF THE EYE LENS - X-RAY-ANALYSIS OF GAMMA-CRYSTALLIN-II [J].
BLUNDELL, T ;
LINDLEY, P ;
MILLER, L ;
MOSS, D ;
SLINGSBY, C ;
TICKLE, I ;
TURNELL, B ;
WISTOW, G .
NATURE, 1981, 289 (5800) :771-777
[8]  
BOGUSKI MS, 1986, J BIOL CHEM, V261, P6398
[9]   SEQUENCE COMPARISON BY EXPONENTIALLY-DAMPED ALIGNMENT [J].
BOSWELL, DR ;
MCLACHLAN, AD .
NUCLEIC ACIDS RESEARCH, 1984, 12 (01) :457-464
[10]   MOLECULAR-STRUCTURE OF AN APOLIPOPROTEIN DETERMINED AT 2.5-A RESOLUTION [J].
BREITER, DR ;
KANOST, MR ;
BENNING, MM ;
WESENBERG, G ;
LAW, JH ;
WELLS, MA ;
RAYMENT, I ;
HOLDEN, HM .
BIOCHEMISTRY, 1991, 30 (03) :603-608