GIBBS MOTIF SAMPLING - DETECTION OF BACTERIAL OUTER-MEMBRANE PROTEIN REPEATS

被引:289
作者
NEUWALD, AF
LIU, JS
LAWRENCE, CE
机构
[1] STANFORD UNIV, DEPT STAT, STANFORD, CA 94305 USA
[2] NEW YORK STATE DEPT HLTH, WADSWORTH CTR LABS & RES, BIOMETR LAB, ALBANY, NY 12201 USA
关键词
BAYESIAN INFERENCE; MULTIPLE ALIGNMENT ALGORITHMS; OUTER MEMBRANE PROTEINS; PATTERN RECOGNITION; PORINS; PROTEIN MOTIFS; STATISTICAL SIGNIFICANCE; WILCOXON SIGNED RANK TEST;
D O I
10.1002/pro.5560040820
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The detection and alignment of locally conserved regions (motifs) in multiple sequences can provide insight into protein structure, function, and evolution. A new Gibbs sampling algorithm is described that detects motif-encoding regions in sequences and optimally partitions them into distinct motif models; this is illustrated using a set of immunoglobulin fold proteins. When applied to sequences sharing a single motif, the sampler can be used to classify motif regions into related submodels, as is illustrated using helix-turn-helix DNA-binding proteins. Other statistically based procedures are described for searching a database for sequences matching motifs found by the sampler. When applied to a set of 32 very distantly related bacterial integral outer membrane proteins, the sampler revealed that they share a subtle, repetitive motif. Although BLAST (Altschul SF et al., 1990, J Mol Biol 215:403-410) fails to detect significant pairwise similarity between any of the sequences, the repeats present in these outer membrane proteins, taken as a whole, are highly significant (based on a generally applicable statistical test for motifs described here). Analysis of bacterial porins with known trimeric beta-barrel structure and related proteins reveals a similar repetitive motif corresponding to alternating membrane-spanning beta-strands. These beta-strands occur on the membrane interface (as opposed to the trimeric interface) of the beta-barrel. The broad conservation and structural location of these repeats suggests that they play important functional roles.
引用
收藏
页码:1618 / 1632
页数:15
相关论文
共 61 条
  • [1] Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ, Basic local alignment search tool, J Mol Biol, 215, pp. 403-410, (1990)
  • [2] Bairoch A, Boeckmann B., The SWISS‐PROT protein sequence data bank, Nucleic Acids Res, 20, pp. 2019-2022, (1992)
  • [3] Baldi P, Chauvin Y, McClure M, Hunkapiller T., Hidden Markov models of biological primary sequence information, Proc Natl Acad Sci USA, 91, pp. 1059-1063, (1994)
  • [4] Barker WC, George DG, Mewes HW, Pfeiffer F, Tsugita A., The PIR‐international databases, Nucleic Acids Res, 21, pp. 3089-3092, (1993)
  • [5] Bennet PB, Makita N, George AL, A molecular basis for gating mode transitions in human skeletal muscle Na<sup>+</sup> channels, FEBS Letters, 326, pp. 21-24, (1993)
  • [6] Benson D, Lipman DJ, Ostell J., GenBank, Nucleic Acids Res, 21, pp. 2963-2965, (1993)
  • [7] Bork P, Holm L, Sander C., The immunoglobulin fold: Structural classification, sequence patterns and common core, J Mol Biol, 242, pp. 309-320, (1994)
  • [8] Bosch D, Scholten M, Verhagen C, Tommassen J., The role of the carboxy‐terminal membrane‐spanning fragment in the biogenesis of Escherichia coli K12 outer membrane protein PhoE, Mol Gen Genet, 216, pp. 144-148, (1989)
  • [9] Brennan RG, Matthews BW, The helix‐turn‐helix DNA binding motif, J Biol Chem, 264, pp. 1903-1906, (1989)
  • [10] Cardon LR, Stormo GD, Expectation maximization algorithm for identifying protein‐binding sites with variable lengths from unaligned DNA fragments, J Mol Biol, 225, pp. 159-170, (1992)