SnapDRAGON: a method to delineate protein structural domains from sequence data

被引：68

作者：

George, RA ^{[1
]}

Heringa, J ^{[1
]}

机构：

[1] Natl Inst Med Res, Div Math Biol, London NW7 1AA, England

来源：

JOURNAL OF MOLECULAR BIOLOGY | 2002年 / 316卷 / 03期

基金：

英国医学研究理事会;

关键词：

protein; domain; boundaries; prediction; folding;

D O I：

10.1006/jmbi.2001.5387

中图分类号：

Q5 [生物化学]; Q7 [分子生物学];

学科分类号：

071010 ; 081704 ;

摘要：

We describe a method to identify protein domain boundaries from sequence information alone based on the assumption that hydrophobic residues cluster together in space. SnapDRAGON is a suite of programs developed to predict domain boundaries based on the consistency observed in a set of alternative ab initio three-dimensional (3D) models generated for a given protein multiple sequence alignment. This is achieved by running a distance geometry-based folding technique in conjunction with a 3D-domain assignment algorithm. The overall accuracy of our method in predicting the number of domains for a non-redundant data set of 414 multiple alignments, representing 185 single and 231 multiple-domain proteins, is 72.4%. Using domain linker regions observed in the tertiary structures associated with each query alignment as the standard of truth, inter-domain boundary positions are delineated with an accuracy of 63.9% for proteins comprising continuous domains only, and 35.4% for proteins with discontinuous domains. Overall, domain boundaries are delineated with an accuracy of 51.8%. The prediction accuracy values are independent of the pair-wise sequence similarities within each of the alignments. These results demonstrate the capability of our method to delineate domains in protein sequences associated with a wide variety of structural domain organisation. (C) 2002 Elsevier Science Ltd.

引用

页码：839 / 851

页数：13

共 69 条

[11] Bonneau R, 2001, PROTEINS, V43, P1, DOI 10.1002/1097-0134(20010401)43:1<1::AID-PROT1012>3.0.CO
[12] 2-A
[13] SHUFFLED DOMAINS IN EXTRACELLULAR PROTEINS
BORK, P
[J]. FEBS LETTERS, 1991, 286 (1-2) : 47 - 54
[14] THE PREDICTION OF PROTEIN DOMAINS
BUSETTA, B
BARRANS, Y
[J]. BIOCHIMICA ET BIOPHYSICA ACTA, 1984, 790 (02) : 117 - 124
[15] Creighton T.E., 1993, PROTEINS STRUCTURE M, V2nd
[16] Crippen G. M., 1988, DISTANCE GEOMETRY MO
[17] Polymer principles and protein folding
Dill, KA
[J]. PROTEIN SCIENCE, 1999, 8 (06) : 1166 - 1180
[18] THEORY FOR THE FOLDING AND STABILITY OF GLOBULAR-PROTEINS
DILL, KA
[J]. BIOCHEMISTRY, 1985, 24 (06) : 1501 - 1509
[19] STRUCTURE OF PAPAIN
DRENTH, J
JANSONIUS, JN
KOEKOEK, R
SWEN, HM
WOLTHERS, BG
[J]. NATURE, 1968, 218 (5145) : 929 - +
[20] ANTIBODY STRUCTURE AND MOLECULAR IMMUNOLOGY
EDELMAN, GM
[J]. SCIENCE, 1973, 180 (4088) : 830 - 840

← 1 2 3 4 5 6 7 →