IPSA - INDUCTIVE PROTEIN-STRUCTURE ANALYSIS

被引:7
作者
SCHULZEKREMER, S
KING, RD
机构
[1] FREE UNIV BERLIN,DEPT CRYSTALLOG,W-1000 BERLIN 33,GERMANY
[2] UNIV STRATHCLYDE,DEPT STAT & DATA MODELLING,GLASGOW G1 1XW,SCOTLAND
来源
PROTEIN ENGINEERING | 1992年 / 5卷 / 05期
关键词
HELIX PAIRS CLASSIFICATION; INTEGRATION OF INFORMATION; LONG-RANGE INTERACTIONS; MACHINE LEARNING; SUPER-SECONDARY STRUCTURE;
D O I
10.1093/protein/5.5.377
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The Inductive Structure Protein Analysis (IPSA) project presents a new method for investigating protein structure. IPSA includes the creation of a new database which was designed specifically for the analysis of protein structure by statistics and machine learning. The Protein Representation Language (PRL) database includes explicit and symbolic representations of geometrical, topological and chemo-physical information about secondary structures and the relationships between secondary structures. The IPSA methodology consists of: the use of PRL information to produce a new database of examples of secondary structures which associate together (examples of possible super-secondary structures); then the use of a variety of clustering techniques to produce a consensus clustering of these examples (super-secondary structures); these super-secondary structures are finally examined to uncover any biological features of significance. We have applied this method to find simple super-secondary structures consisting of pairs of alpha-helices. We found four well-defined super-secondary structures, one formed exclusively by long range interactions, and another in association with an additional element of secondary structure (alpha-t-alpha-motif). Examinations were carried out using homologous pairs and conformational fits which confirm our clustering.
引用
收藏
页码:377 / 390
页数:14
相关论文
共 29 条
[1]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[2]  
CHEESEMAN P, 1988, 5TH P INT C MACH LEA, P56
[3]  
CHOTHIA C, 1984, ANNU REV BIOCHEM, V53, P537
[4]   HELIX TO HELIX PACKING IN PROTEINS [J].
CHOTHIA, C ;
LEVITT, M ;
RICHARDSON, D .
JOURNAL OF MOLECULAR BIOLOGY, 1981, 145 (01) :215-250
[5]   HOW BIG IS THE UNIVERSE OF EXONS [J].
DORIT, RL ;
SCHOENBACH, L ;
GILBERT, W .
SCIENCE, 1990, 250 (4986) :1377-1382
[6]   3-DIMENSIONAL STRUCTURE OF MEMBRANE AND SURFACE-PROTEINS [J].
EISENBERG, D .
ANNUAL REVIEW OF BIOCHEMISTRY, 1984, 53 :595-623
[7]  
EISENBERG D, 1982, FARADAY S CHEM SOC, V17, P109, DOI DOI 10.1039/FS9821700109
[8]  
Fisher D. H., 1987, Machine Learning, V2, P139, DOI 10.1023/A:1022852608280
[9]   THE PROTEIN IDENTIFICATION RESOURCE (PIR) [J].
GEORGE, DG ;
BARKER, WC ;
HUNT, LT .
NUCLEIC ACIDS RESEARCH, 1986, 14 (01) :11-15
[10]   FURTHER DEVELOPMENTS OF PROTEIN SECONDARY STRUCTURE PREDICTION USING INFORMATION-THEORY - NEW PARAMETERS AND CONSIDERATION OF RESIDUE PAIRS [J].
GIBRAT, JF ;
GARNIER, J ;
ROBSON, B .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 198 (03) :425-443