MAAP: Malarial adhesins and adhesin-like proteins predictor

被引:29
作者
Ansari, Faraz Alam [1 ]
Kumar, Naveen [1 ]
Subramanyam, Mekapati Bala [1 ]
Gnanamani, Muthiah [1 ]
Ramachandran, Srinivasan [1 ]
机构
[1] GN Ramachandran Knowledge Ctr Genome Informat, Inst Integrat Biol, Delhi 110007, India
关键词
support vector machine; compositional property; Plasmodium; Plasmodium falciparum; vaccine targets;
D O I
10.1002/prot.21568
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 [生物化学与分子生物学]; 081704 [应用化学];
摘要
Malaria caused by protozoan parasites belonging to the genus Plasmodium is a dreaded disease, second only to tuberculosis. The emergence of parasites resistant to commonly used drugs and the lack of availability of vaccines aggravates the problem. One of the preventive approaches targets adhesion of parasites to host cells and tissues. Adhesion of parasites is mediated by proteins called adhesins. Abrogation of adhesion by either immunizing the host with adhesins or inhibiting the interaction using structural analogs of host cell receptors holds the potential to develop novel preventive strategies. The availability of complete genome sequence offers new opportunities for identifying adhesin and adhesin-like proteins. Development of computational algorithms can simplify this task and accelerate experimental characterization of the predicted adhesins from complete genomes. A curated positive dataset of experimentally known adhesins from Plasmodium species was prepared by careful examination of literature reports. "Controversial" or "hypothetical" adhesins were excluded. The negative dataset consisted of proteins representing various intracellular functions including information processing, metabolism, and interface (transporters). We did not include proteins likely to be on the surface with unknown adhesin properties or which are linked even indirectly to the adhesion process in either of the training sets. A nonhomology-based approach using 420 compositional properties of amino acid dipeptide and multiplet frequencies was used to develop MAAP Web server with Support Vector Machine (SVM) model classifier as its engine for the prediction of malarial adhesins and adhesin-like proteins. The MAAP engine has six SVM classifier models identifled through an exhaustive search from 728 kernel parameters set. These models displayed an efficiency (Mathews correlation coefficient) of 0.860-0.967. The final prediction P-maap score is the maximum score attained by a given sequence in any of the six models. The results of MAAP runs on complete proteomes of Plasmodium species revealed that in Plasmodium falciparum at P-maap scores above 0.0, we observed a sensitivity of 100% with two false positives. In P. vivax and P. yoelii an optimal threshold P-maap score of 0.7 was optimal with very few false positives (upto 5). Several new predictions were obtained. This list includes hypothetical protein PF14_0040, interspersed repeat antigen, STEVOR, liver stage antigen, SURFIN, RIFIN, stevor (3D7-stevorT3-2), mature parasite-infected erythrocyte surface antigen or P. falciparum erythrocyte membrane protein 2, merozoite surface protein 6 in P. falciparum, circumsporozoite proteins, microneme protein-1, Vir18, Vir12-like, Vir12, Vir18-like, Vir18-related and Vir4 in P. vivax, circumsporozoite protein/thrombospondin related anonymous proteins, 28 kDa ookinete surface protein, yir1, and yir4 of P. yoelii. Among these, a few proteins identified by MAAP were matched with those identified by other groups using different experimental and theoretical strategies. Most other interspersed repeat proteins in Plasmodium species had lower P-maap scores. These new predictions could serve as new leads for further experimental characterization (MAAP webserver. http://maap.igib.res.in).
引用
收藏
页码:659 / 666
页数:8
相关论文
共 42 条
[1]
SYNTHETIC PEPTIDES FROM THE CIRCUMSPOROZOITE PROTEINS OF PLASMODIUM-FALCIPARUM AND PLASMODIUM-KNOWLESI RECOGNIZE THE HUMAN HEPATOMA-CELL LINE HEPG2-A16 INVITRO [J].
ALEY, SB ;
BATES, MD ;
TAM, JP ;
HOLLINGDALE, MR .
JOURNAL OF EXPERIMENTAL MEDICINE, 1986, 164 (06) :1915-1922
[2]
PlasmoDB:: the Plasmodium genome resource.: A database integrating experimental and computational data [J].
Bahl, A ;
Brunk, B ;
Crabtree, J ;
Fraunholz, MJ ;
Gajria, B ;
Grant, GR ;
Ginsburg, H ;
Gupta, D ;
Kissinger, JC ;
Labo, P ;
Li, L ;
Mailman, MD ;
Milgram, AJ ;
Pearson, DS ;
Roos, DS ;
Schug, J ;
Stoeckert, CJ ;
Whetzel, P .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :212-215
[3]
PLASMODIUM-VIVAX - A GLIMPSE INTO THE UNIQUE AND SHARED BIOLOGY OF THE MEROZOITE [J].
BARNWELL, JW ;
GALINSKI, MR .
ANNALS OF TROPICAL MEDICINE AND PARASITOLOGY, 1995, 89 (02) :113-120
[4]
Plasmodium falciparum erythrocyte membrane protein 1 is a parasitized erythrocyte receptor for adherence to CD36, thrombospondin, and intercellular adhesion molecule 1 [J].
Baruch, DI ;
Gormley, JA ;
Ma, C ;
Howard, RJ ;
Pasloske, BL .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (08) :3497-3502
[5]
Do malaria ookinete surface proteins P25 and P28 mediate parasite entry into mosquito midgut epithelial cells? [J].
Baton, LA ;
Ranford-Cartwright, LC .
MALARIA JOURNAL, 2005, 4 (1)
[6]
Improved prediction of signal peptides: SignalP 3.0 [J].
Bendtsen, JD ;
Nielsen, H ;
von Heijne, G ;
Brunak, S .
JOURNAL OF MOLECULAR BIOLOGY, 2004, 340 (04) :783-795
[7]
Analysis and prediction of affinity of TAP binding peptides using cascade SVM [J].
Bhasin, M ;
Raghava, GPS .
PROTEIN SCIENCE, 2004, 13 (03) :596-607
[8]
METHODS AND ALGORITHMS FOR STATISTICAL-ANALYSIS OF PROTEIN SEQUENCES [J].
BRENDEL, V ;
BUCHER, P ;
NOURBAKHSH, IR ;
BLAISDELL, BE ;
KARLIN, S .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (06) :2002-2006
[9]
Mutually exclusive expression of virulence genes by malaria parasites is regulated independently of antigen production [J].
Dzikowski, Ron ;
Frank, Matthias ;
Deitsch, Kirk .
PLOS PATHOGENS, 2006, 2 (03) :184-194
[10]
Genome sequence of the human malaria parasite Plasmodium falciparum [J].
Gardner, MJ ;
Hall, N ;
Fung, E ;
White, O ;
Berriman, M ;
Hyman, RW ;
Carlton, JM ;
Pain, A ;
Nelson, KE ;
Bowman, S ;
Paulsen, IT ;
James, K ;
Eisen, JA ;
Rutherford, K ;
Salzberg, SL ;
Craig, A ;
Kyes, S ;
Chan, MS ;
Nene, V ;
Shallom, SJ ;
Suh, B ;
Peterson, J ;
Angiuoli, S ;
Pertea, M ;
Allen, J ;
Selengut, J ;
Haft, D ;
Mather, MW ;
Vaidya, AB ;
Martin, DMA ;
Fairlamb, AH ;
Fraunholz, MJ ;
Roos, DS ;
Ralph, SA ;
McFadden, GI ;
Cummings, LM ;
Subramanian, GM ;
Mungall, C ;
Venter, JC ;
Carucci, DJ ;
Hoffman, SL ;
Newbold, C ;
Davis, RW ;
Fraser, CM ;
Barrell, B .
NATURE, 2002, 419 (6906) :498-511