A hint to search for metalloproteins in gene banks

被引:98
作者
Andreini, C
Bertini, I [1 ]
Rosato, A
机构
[1] Univ Florence, CERM, I-50019 Sesto Fiorentino, Italy
[2] Univ Florence, Dept Chem, I-50019 Sesto Fiorentino, Italy
关键词
D O I
10.1093/bioinformatics/bth095
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: With the advent of genome sequencing, a huge database of protein primary sequences has been accumulating. In parallel, a number of tools to investigate and expand upon this information, e.g. reconstructing and building relationships between protein families and superfamilies, have been developed. Metalloproteins are proteins capable of binding one or more metal ions, which are required for their biological function or for regulation of their activities or for structural purposes. Sometimes, metal binding can be observed in vitro but not be physiologically relevant. At present, there is a lack of specific tools to address the matter of the identification of metalloproteins in databases of gene sequences. Results: In the present work, an approach exploiting metal-binding patterns (MBPs) of metalloproteins present in the Protein Data Bank to search gene banks for new metalloproteins is presented and applied to copper proteins. Nearly 100 different MBPs have been identified and then used for subsequent applications. The ensemble of sequences of the whole PDB is used to assess the potentiality and limits of the method and to identify levels of confidence for the predictions output by the search. It appears that copper-binding capabilities are identified with a confidence >90% when the percentage of identical amino acids aligned around the MBP by PHI-BLAST is at least 20% with respect to the entire protein domain length. If this percentage is between 10% and 20%, the level of confidence is similar to50%. Application of the methodology to the entire genome sequences of Pyrococcus furiosus, Escherichia coli, Drosophila melanogaster and Homo sapiens suggests some differentiation between prokaryotes and eukaryotes.
引用
收藏
页码:1373 / 1380
页数:8
相关论文
共 33 条
[1]   The genome sequence of Drosophila melanogaster [J].
Adams, MD ;
Celniker, SE ;
Holt, RA ;
Evans, CA ;
Gocayne, JD ;
Amanatides, PG ;
Scherer, SE ;
Li, PW ;
Hoskins, RA ;
Galle, RF ;
George, RA ;
Lewis, SE ;
Richards, S ;
Ashburner, M ;
Henderson, SN ;
Sutton, GG ;
Wortman, JR ;
Yandell, MD ;
Zhang, Q ;
Chen, LX ;
Brandon, RC ;
Rogers, YHC ;
Blazej, RG ;
Champe, M ;
Pfeiffer, BD ;
Wan, KH ;
Doyle, C ;
Baxter, EG ;
Helt, G ;
Nelson, CR ;
Miklos, GLG ;
Abril, JF ;
Agbayani, A ;
An, HJ ;
Andrews-Pfannkoch, C ;
Baldwin, D ;
Ballew, RM ;
Basu, A ;
Baxendale, J ;
Bayraktaroglu, L ;
Beasley, EM ;
Beeson, KY ;
Benos, PV ;
Berman, BP ;
Bhandari, D ;
Bolshakov, S ;
Borkova, D ;
Botchan, MR ;
Bouck, J ;
Brokstein, P .
SCIENCE, 2000, 287 (5461) :2185-2195
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]   Metallochaperones and metal-transporting ATPases: A comparative analysis of sequences and structures [J].
Arnesano, F ;
Banci, L ;
Bertini, I ;
Ciofi-Baffoni, S ;
Molteni, E ;
Huffman, DL ;
O'Halloran, TV .
GENOME RESEARCH, 2002, 12 (02) :255-271
[4]   Mitochondrial cytochromes c:: a comparative analysis [J].
Banci, L ;
Bertini, I ;
Rosato, A ;
Varani, G .
JOURNAL OF BIOLOGICAL INORGANIC CHEMISTRY, 1999, 4 (06) :824-837
[5]   Structural genomics of proteins involved in copper homeostasis [J].
Banci, L ;
Rosato, A .
ACCOUNTS OF CHEMICAL RESEARCH, 2003, 36 (03) :215-221
[6]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[7]   Bioinorganic chemistry in the postgenomic era [J].
Bertini, I ;
Rosato, A .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (07) :3601-3604
[8]   Browsing gene banks for Fe2S2 ferredoxins and structural modeling of 88 plant-type sequences:: An analysis of fold and function [J].
Bertini, I ;
Luchinat, C ;
Provenzani, A ;
Rosato, A ;
Vasos, PR .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2002, 46 (01) :110-127
[9]  
BERTINI I, 1982, STRUCT BOND, V48, P45
[10]  
Bertini I., 1994, Bioinorganic Chemistry, DOI 10/BioinCh_chapter9.pdf