Automated prediction of domain boundaries in CASP6 targets using Ginzu and RosettaDOM

被引：63

作者：

Kim, DE ^{[1
]}

Chivian, D ^{[1
]}

Malmström, L ^{[1
]}

Baker, D ^{[1
]}

机构：

[1] Univ Washington, Dept Biochem, Seattle, WA 98195 USA

来源：

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS | 2005年 / 61卷

关键词：

domain prediction; domain parsing; domain; identification; CASP; CAFASP; Rosetta; Robetta; protein structure prediction; ab initio modeling; de novo modeling; template-based modeling; comparative modeling; homology modeling;

D O I：

10.1002/prot.20737

中图分类号：

Q5 [生物化学]; Q7 [分子生物学];

学科分类号：

071010 ; 081704 ;

摘要：

Domain boundary prediction is an important step in both experimental and computational protein structure characterization. We have developed two fully automated domain parsing methods: the first, Ginzu, which we have described previously, utilizes information from homologous sequences and structures, while the second, RosettaDOM, which has not been described previously, uses only information in the query sequence. Ginzu iteratively assigns domains by homology to structures and sequence families using successively less confident methods. RosettaDOM uses the Rosetta de novo structure prediction method to build three-dimensional models, and then applies Taylor's structure based domain assignment method to parse the models into domains. Domain boundaries observed repeatedly in the models are predicted to be domain boundaries for the protein. Interestingly, RosettaDOM produced quite good domain predictions for proteins of a size typically considered to be beyond the reach of de novo structure prediction methods. For remote fold recognition targets and new folds, both Ginzu and RosettaDOM produced promising results, and in some cases where one method failed to detect the correct domain boundary, it was correctly identified by the other method. We describe here the successes and failures using both methods, and address the possibility of incorporating both protocols into an improved hybrid method.

引用

页码：193 / 200

页数：8

共 16 条

[1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].