MACHINE DISCOVERY OF PROTEIN MOTIFS

被引:10
作者
CONKLIN, D
机构
关键词
PROTEIN TERTIARY STRUCTURE; MACHINE DISCOVERY; RELATIONAL LEARNING; KNOWLEDGE REPRESENTATION; DESCRIPTION LOGICS; INFORMATION RETRIEVAL; KNOWLEDGE DISCOVERY IN DATABASES;
D O I
10.1007/BF00993382
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The investigation of relations between protein tertiary structure and amino acid sequence is a topic of tremendous importance in molecular biology. The automated discovery of recurrent patterns of structure and sequence is an essential part of this investigation. These patterns, known as protein motifs, are abstractions of fragments drawn from proteins of known sequence and tertiary structure. This paper has two objectives. The first is to introduce and define protein motifs, and provide a survey of previous research on protein motif discovery. The second is to present and apply a novel approach to protein motif representation and discovery, which is based on a spatial description logic and the symbolic machine learning paradigm of structured concept formation. A large database of protein fragments is processed using this approach, and several interesting and significant protein motifs are discovered.
引用
收藏
页码:125 / 150
页数:26
相关论文
共 58 条
  • [1] PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES
    BERNSTEIN, FC
    KOETZLE, TF
    WILLIAMS, GJB
    MEYER, EF
    BRICE, MD
    RODGERS, JR
    KENNARD, O
    SHIMANOUCHI, T
    TASUMI, M
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) : 535 - 542
  • [2] Bisson G, 1992, P AAAI 1992, P82
  • [3] KNOWLEDGE-BASED PREDICTION OF PROTEIN STRUCTURES AND THE DESIGN OF NOVEL MOLECULES
    BLUNDELL, TL
    SIBANDA, BL
    STERNBERG, MJE
    THORNTON, JM
    [J]. NATURE, 1987, 326 (6111) : 347 - 352
  • [4] CHOTHIA C, 1992, NATURE, V357, P544
  • [5] ON THE PREDICTION OF PROTEIN-STRUCTURE - THE SIGNIFICANCE OF THE ROOT-MEAN-SQUARE DEVIATION
    COHEN, FE
    STERNBERG, MJE
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1980, 138 (02) : 321 - 333
  • [6] COMPARISON OF 3 ALGORITHMS FOR THE ASSIGNMENT OF SECONDARY STRUCTURE IN PROTEINS - THE ADVANTAGES OF A CONSENSUS ASSIGNMENT
    COLLOCH, N
    ETCHEBEST, C
    THOREAU, E
    HENRISSAT, B
    MORNON, JP
    [J]. PROTEIN ENGINEERING, 1993, 6 (04): : 377 - 382
  • [7] CONKLIN D, 1994, 2ND P INT C INT SYST, P96
  • [8] Conklin D., 1992, P 9 INT C MACH LEARN, P111
  • [9] CONKLIN D, 1992, 9TH P INT C ML92, P11
  • [10] CONKLIN D, 1995, THESIS QUEENS U CANA