Statistical modeling, phylogenetic analysis and structure prediction of a protein splicing domain common to inteins and hedgehog proteins

被引:64
作者
Dalgaard, JZ
Moser, MJ
Hughey, R
Mian, IS
机构
[1] UNIV CALIF BERKELEY, LAWRENCE BERKELEY LAB, DIV LIFE SCI, BERKELEY, CA 94720 USA
[2] UNIV CALIF SANTA CRUZ, BASKIN CTR COMP ENGN & INFORMAT SCI, SANTA CRUZ, CA 95064 USA
关键词
hidden Markov model; intein; hedgehog; endonuclease; klbA domain; protein splicing;
D O I
10.1089/cmb.1997.4.193
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Inteins, introns spliced at the protein level, and the hedgehog family of proteins involved in eucaryotic development both undergo autocatalytic proteolysis. Here, a specific and sensitive hidden Markov model (HMM) of a protein splicing domain shared by inteins and the hedgehog proteins has been trained and employed for further analysis. The HMM characterizes the common features of this domain including the position where a site-specific DNA endonuclease domain is inserted in the majority of the inteins, The HMM was used to identify several new putative inteins, such as that in the Methanococcus jannaschii klbA protein, and to generate a multiple sequence alignment of sequences possessing this domain, Phylogenetic analysis suggests that hedgehog proteins evolved from inteins, Secondary and tertiary structure predictions suggest that the domain has a structure similar to a beta-sandwich, Similarities between the serine protease cleavage mechanism and the protein splicing reaction mechanism are discussed, Examination of the locations of inteins indicates that they are not inserted randomly in an extein, but are often inserted at functionally important positions in the host proteins, A specific and sensitive HMM for a domain present in klbA proteins identified several additional bacterial and archaeal family members, and analysis of the site of insertion of the intein suggests residues that may be functionally important, This domain may play a role in formation of surface-associated protein complexes.
引用
收藏
页码:193 / 214
页数:22
相关论文
共 78 条
[1]  
ADACHI J, 1995, THESIS I STAT MATH T
[2]   AMINO-ACID SUBSTITUTION MATRICES FROM AN INFORMATION THEORETIC PERSPECTIVE [J].
ALTSCHUL, SF .
JOURNAL OF MOLECULAR BIOLOGY, 1991, 219 (03) :555-565
[3]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[4]   The PROSITE database, its status in 1995 [J].
Bairoch, A ;
Bucher, P ;
Hofmann, K .
NUCLEIC ACIDS RESEARCH, 1996, 24 (01) :189-196
[5]   HIDDEN MARKOV-MODELS OF BIOLOGICAL PRIMARY SEQUENCE INFORMATION [J].
BALDI, P ;
CHAUVIN, Y ;
HUNKAPILLER, T ;
MCCLURE, MA .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1994, 91 (03) :1059-1063
[6]  
BARRETT C, 1996, IN PRESS CABIOS
[7]   STRUCTURAL AND FUNCTIONAL-RELATIONSHIPS BETWEEN PROKARYOTIC AND EUKARYOTIC DNA-POLYMERASES [J].
BERNAD, A ;
ZABALLOS, A ;
SALAS, M ;
BLANCO, L .
EMBO JOURNAL, 1987, 6 (13) :4219-4225
[8]   COMPILATION, ALIGNMENT, AND PHYLOGENETIC-RELATIONSHIPS OF DNA-POLYMERASES [J].
BRAITHWAITE, DK ;
ITO, J .
NUCLEIC ACIDS RESEARCH, 1993, 21 (04) :787-802
[9]  
Brenner SE, 1996, METHOD ENZYMOL, V266, P635
[10]  
Brown M, 1993, Proc Int Conf Intell Syst Mol Biol, V1, P47