YEAST CHROMOSOME-III - NEW GENE FUNCTIONS

被引:91
作者
KOONIN, EV
BORK, P
SANDER, C
机构
[1] EUROPEAN MOLEC BIOL LAB, D-69012 HEIDELBERG, GERMANY
[2] MAX DELBRUCK CTR MOLEC MED, D-13125 BERLIN, GERMANY
关键词
COMPUTER METHODS; GENOME ANALYSIS; PREDICTION OF PROTEIN FUNCTION; PREDICTION OF PROTEIN STRUCTURE; PROTEIN SEQUENCE ANALYSIS;
D O I
10.1002/j.1460-2075.1994.tb06287.x
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
One year after the release of the sequence of yeast chromosome III, we have re-examined its open reading frames (ORFs) by computer methods. More than 61% of the 171 probable gene products have significant sequence similarities in the current databases; as many as 54% have already known functions or are related to functionally characterized proteins, allowing partial prediction of protein function, 11 percentage points more than reported a year ago; 19% are similar to proteins of known three-dimensional structure, allowing model building by homology. The most interesting new identifications include a sugar kinase distantly related to ribokinases, a phosphatidyl serine synthetase, a putative transcription regulator, a flavodoxin-like protein, and a zinc finger protein belonging to a distinct subfamily. Several ORFs have similarities to uncharacterized proteins, resulting in new families 'in search of a function'. About 54% of ORFs match sequences from other phyla, including numerous fragments in the database of expressed sequence tags (ESTs). Most significant similarities to ESTs are with proteins in conserved families widely represented in the databases. About 30% of ORFs contain one or more predicted transmembrane segments. The increase in the power of functional and structural prediction comes from improvements in sequence analysis and from richer databases and is expected to facilitate substantially the experimental effort in characterizing the function of new gene products.
引用
收藏
页码:493 / 503
页数:11
相关论文
共 58 条
[1]  
ADAMS MD, 1993, NATURE, V3, P266
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]  
AMAKASU H, 1993, GENETICS, V134, P675
[4]   THE PROSITE DICTIONARY OF SITES AND PATTERNS IN PROTEINS, ITS CURRENT STATUS [J].
BAIROCH, A .
NUCLEIC ACIDS RESEARCH, 1993, 21 (13) :3097-3103
[5]   THE SWISS-PROT PROTEIN-SEQUENCE DATA-BANK, RECENT DEVELOPMENTS [J].
BAIROCH, A ;
BOECKMANN, B .
NUCLEIC ACIDS RESEARCH, 1993, 21 (13) :3093-3096
[6]   THE PIR-INTERNATIONAL DATABASES [J].
BARKER, W ;
GEORGE, DG ;
MEWES, HW ;
PFEIFFER, F ;
TSUGITA, A .
NUCLEIC ACIDS RESEARCH, 1993, 21 (13) :3089-3092
[7]   GENBANK [J].
BENSON, D ;
LIPMAN, DJ ;
OSTELL, J .
NUCLEIC ACIDS RESEARCH, 1993, 21 (13) :2963-2965
[8]   DBEST - DATABASE FOR EXPRESSED SEQUENCE TAGS [J].
BOGUSKI, MS ;
LOWE, TMJ ;
TOLSTOSHEV, CM .
NATURE GENETICS, 1993, 4 (04) :332-333
[9]  
BORK P, 1993, PROTEIN SCI, V2, P31
[10]   COMPREHENSIVE SEQUENCE-ANALYSIS OF THE 182 PREDICTED OPEN READING FRAMES OF YEAST CHROMOSOME-III [J].
BORK, P ;
OUZOUNIS, C ;
SANDER, C ;
SCHARF, M ;
SCHNEIDER, R ;
SONNHAMMER, E .
PROTEIN SCIENCE, 1992, 1 (12) :1677-1690