Generating quantitative models describing the sequence specificity of biological processes with the stabilized matrix method

被引：397

作者：

Peters, B ^{[1
]}

Sette, A ^{[1
]}

机构：

[1] Ja Jolla Inst Allergy & Immunol, San Diego, CA 92109 USA

来源：

BMC BIOINFORMATICS | 2005年 / 6卷 / 1期

关键词：

D O I：

10.1186/1471-2105-6-132

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Background: Many processes in molecular biology involve the recognition of short sequences of nucleic- or amino acids, such as the binding of immunogenic peptides to major histocompatibility complex (MHC) molecules. From experimental data, a model of the sequence specificity of these processes can be constructed, such as a sequence motif, a scoring matrix or an artificial neural network. The purpose of these models is two-fold. First, they can provide a summary of experimental results, allowing for a deeper understanding of the mechanisms involved in sequence recognition. Second, such models can be used to predict the experimental outcome for yet untested sequences. In the past we reported the development of a method to generate such models called the Stabilized Matrix Method (SMM). This method has been successfully applied to predicting peptide binding to MHC molecules, peptide transport by the transporter associated with antigen presentation ( TAP) and proteasomal cleavage of protein sequences. Results: Herein we report the implementation of the SMM algorithm as a publicly available software package. Specific features determining the type of problems the method is most appropriate for are discussed. Advantageous features of the package are: ( 1) the output generated is easy to interpret, ( 2) input and output are both quantitative, ( 3) specific computational strategies to handle experimental noise are built in, ( 4) the algorithm is designed to effectively handle bounded experimental data, ( 5) experimental data from randomized peptide libraries and conventional peptides can easily be combined, and ( 6) it is possible to incorporate pair interactions between positions of a sequence. Conclusion: Making the SMM method publicly available enables bioinformaticians and experimental biologists to easily access it, to compare its performance to other prediction methods, and to extend it to other applications.

引用

页数：9

共 19 条

[1] Bagging predictors [J].

Breiman, L .

MACHINE LEARNING, 1996, 24 (02) :123-140

[2]

Daniel S, 1998, J IMMUNOL, V161, P617

[3] Additive method for the prediction of protein-peptide binding affinity. Application to the MHC class I molecule HLA-A*0201 [J].

Doytchinova, IA ;

Blythe, MJ ;

Flower, DR .

JOURNAL OF PROTEOME RESEARCH, 2002, 1 (03) :263-272

[4] Quantitative predictions of peptide binding to MHC class I molecules using specificity matrices and anchor-stratified calibrations [J].

Lauemoller, SL ;

Holm, A ;

Hilden, J ;

Brunak, S ;

Nissen, MH ;

Stryhn, A ;

Pedersen, LO ;

Buus, S .

TISSUE ANTIGENS, 2001, 57 (05) :405-414

[5] Global analysis of proteasomal substrate specificity using positional-scanning libraries of covalent inhibitors [J].

Nazif, T ;

Bogyo, M .

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (06) :2967-2972

[6] Reliable prediction of T-cell epitopes using neural networks with novel sequence representations [J].

Nielsen, M ;

Lundegaard, C ;

Worning, P ;

Lauemoller, SL ;

Lamberth, K ;

Buus, S ;

Brunak, S ;

Lund, O .

PROTEIN SCIENCE, 2003, 12 (05) :1007-1017

[7]

Orr M. J. L, INTRO RADIAL BASIS F

[8]

PARKER KC, 1994, J IMMUNOL, V152, P163

[9] Examining the independent binding assumption for binding of peptide epitopes to MHC-1 molecules [J].

Peters, B ;

Tong, WW ;

Sidney, J ;

Sette, A ;

Weng, ZP .

BIOINFORMATICS, 2003, 19 (14) :1765-1772

[10] Identifying MHC class I epitopes by predicting the TAP transport efficiency of epitope precursors [J].

Peters, B ;

Bulik, S ;

Tampe, R ;

van Endert, PM ;

Holzhütter, HG .

JOURNAL OF IMMUNOLOGY, 2003, 171 (04) :1741-1749

← 1 2 →