Improving the performance of DomainParser for structural domain partition using neural network

被引:52
作者
Guo, JT
Xu, D
Kim, D
Xu, Y
机构
[1] Oak Ridge Natl Lab, Div Life Sci, Prot Informat Grp, Oak Ridge, TN 37830 USA
[2] Oak Ridge Natl Lab, Div Math & Comp Sci, Oak Ridge, TN 37830 USA
关键词
D O I
10.1093/nar/gkg189
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Structural domains are considered as the basic units of protein folding, evolution, function and design. Automatic decomposition of protein structures into structural domains, though after many years of investigation, remains a challenging and unsolved problem. Manual inspection still plays a key role in domain decomposition of a protein structure. We have previously developed a computer program, DomainParser, using network flow algorithms. The algorithm partitions a protein structure into domains accurately when the number of domains to be partitioned is known. However the performance drops when this number is unclear (the overall performance is 74.5% over a set of 1317 protein chains). Through utilization of various types of structural information including hydrophobic moment profile, we have developed an effective method for assessing the most probable number of domains a structure may have. The core of this method is a neural network, which is trained to discriminate correctly partitioned domains from incorrectly partitioned domains. When compared with the manual decomposition results given in the SCOP database, our new algorithm achieves higher decomposition accuracy (81.9%) on the same data set.
引用
收藏
页码:944 / 952
页数:9
相关论文
共 30 条
[1]  
[Anonymous], [No title captured]
[2]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[3]   THE HELICAL HYDROPHOBIC MOMENT - A MEASURE OF THE AMPHIPHILICITY OF A HELIX [J].
EISENBERG, D ;
WEISS, RM ;
TERWILLIGER, TC .
NATURE, 1982, 299 (5881) :371-374
[4]  
Eisenberg D., 1982, FARADAY S CHEM SOC, V17, P109, DOI DOI 10.1039/FS9821700109
[5]  
Ford L. R, 1962, FLOWS NETWORKS
[6]  
HOBOHM U, 1992, PROTEIN SCI, V1, P409
[7]  
Holm L, 1998, PROTEINS, V33, P88, DOI 10.1002/(SICI)1097-0134(19981001)33:1<88::AID-PROT8>3.0.CO
[8]  
2-H
[9]   PARSER FOR PROTEIN-FOLDING UNITS [J].
HOLM, L ;
SANDER, C .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1994, 19 (03) :256-268
[10]   IDENTIFICATION AND ANALYSIS OF DOMAINS IN PROTEINS [J].
ISLAM, SA ;
LUO, JC ;
STERNBERG, MJE .
PROTEIN ENGINEERING, 1995, 8 (06) :513-525