FungalRV: adhesin prediction and immunoinformatics portal for human fungal pathogens

被引:58
作者
Chaudhuri, Rupanjali [1 ]
Ansari, Faraz Alam [1 ]
Raghunandanan, Muthukurussi Varieth [1 ]
Ramachandran, Srinivasan [1 ]
机构
[1] CSIR, Inst Genom & Integrat Biol, GN Ramachandran Knowledge Ctr Genome Informat, Delhi 110007, India
关键词
B-CELL EPITOPES; ASPERGILLUS-FUMIGATUS; COCCIDIOIDES-IMMITIS; PROTEIN; GENE; BINDING; IDENTIFICATION; PEPTIDES; DATABASE; DOMAIN;
D O I
10.1186/1471-2164-12-192
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 [微生物学]; 090105 [作物生产系统与生态工程];
摘要
Background: The availability of sequence data of human pathogenic fungi generates opportunities to develop Bioinformatics tools and resources for vaccine development towards benefitting at-risk patients. Description: We have developed a fungal adhesin predictor and an immunoinformatics database with predicted adhesins. Based on literature search and domain analysis, we prepared a positive dataset comprising adhesin protein sequences from human fungal pathogens Candida albicans, Candida glabrata, Aspergillus fumigatus, Coccidioides immitis, Coccidioides posadasii, Histoplasma capsulatum, Blastomyces dermatitidis, Pneumocystis carinii, Pneumocystis jirovecii and Paracoccidioides brasiliensis. The negative dataset consisted of proteins with high probability to function intracellularly. We have used 3945 compositional properties including frequencies of mono, doublet, triplet, and multiplets of amino acids and hydrophobic properties as input features of protein sequences to Support Vector Machine. Best classifiers were identified through an exhaustive search of 588 parameters and meeting the criteria of best Mathews Correlation Coefficient and lowest coefficient of variation among the 3 fold cross validation datasets. The "FungalRV adhesin predictor" was built on three models whose average Mathews Correlation Coefficient was in the range 0.89-0.90 and its coefficient of variation across three fold cross validation datasets in the range 1.2% - 2.74% at threshold score of 0. We obtained an overall MCC value of 0.8702 considering all 8 pathogens, namely, C. albicans, C. glabrata, A. fumigatus, B. dermatitidis, C. immitis, C. posadasii, H. capsulatum and P. brasiliensis thus showing high sensitivity and specificity at a threshold of 0.511. In case of P. brasiliensis the algorithm achieved a sensitivity of 66.67%. A total of 307 fungal adhesins and adhesin like proteins were predicted from the entire proteomes of eight human pathogenic fungal species. The immunoinformatics analysis data on these proteins were organized for easy user interface analysis. A Web interface was developed for analysis by users. The predicted adhesin sequences were processed through 18 immunoinformatics algorithms and these data have been organized into MySQL backend. A user friendly interface has been developed for experimental researchers for retrieving information from the database. Conclusion: FungalRV webserver facilitating the discovery process for novel human pathogenic fungal adhesin vaccine has been developed.
引用
收藏
页数:14
相关论文
共 79 条
[1]
BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]
Prediction of residues in discontinuous B-cell epitopes using protein 3D structures [J].
Andersen, Pernille Haste ;
Nielsen, Morten ;
Lund, Ole .
PROTEIN SCIENCE, 2006, 15 (11) :2558-2567
[3]
MAAP: Malarial adhesins and adhesin-like proteins predictor [J].
Ansari, Faraz Alam ;
Kumar, Naveen ;
Subramanyam, Mekapati Bala ;
Gnanamani, Muthiah ;
Ramachandran, Srinivasan .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 70 (03) :659-666
[5]
The Candida Genome Database (CGD), a community resource for Candida albicans gene and protein information [J].
Arnaud, MB ;
Costanzo, MC ;
Skrzypek, MS ;
Binkley, G ;
Lane, C ;
Miyasato, SR ;
Sherlock, G .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D358-D363
[6]
Improved prediction of signal peptides: SignalP 3.0 [J].
Bendtsen, JD ;
Nielsen, H ;
von Heijne, G ;
Brunak, S .
JOURNAL OF MOLECULAR BIOLOGY, 2004, 340 (04) :783-795
[7]
BETAWRAP:: Successful prediction of parallel β-helices from primary sequence reveals an association with many microbial pathogens [J].
Bradley, P ;
Cowen, L ;
Menke, M ;
King, J ;
Berger, B .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (26) :14819-14824
[8]
Paracoccidioides brasiliensis Vaccine Formulations Based on the gp43-Derived P10 Sequence and the Salmonella enterica FliC Flagellin [J].
Braga, Catarina J. M. ;
Rittner, Glauce M. G. ;
Munoz Henao, Julian E. ;
Teixeira, Aline F. ;
Massis, Liliana M. ;
Sbrogio-Almeida, Maria E. ;
Taborda, Carlos P. ;
Travassos, Luiz R. ;
Ferreira, Luis C. S. .
INFECTION AND IMMUNITY, 2009, 77 (04) :1700-1707
[9]
METHODS AND ALGORITHMS FOR STATISTICAL-ANALYSIS OF PROTEIN SEQUENCES [J].
BRENDEL, V ;
BUCHER, P ;
NOURBAKHSH, IR ;
BLAISDELL, BE ;
KARLIN, S .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (06) :2002-2006
[10]
Automated generation and evaluation of specific MHC binding predictive tools:: ARB matrix applications [J].
Bui, HH ;
Sidney, J ;
Peters, B ;
Sathiamurthy, M ;
Sinichi, A ;
Purton, KA ;
Mothé, BR ;
Chisari, FV ;
Watkins, DI ;
Sette, A .
IMMUNOGENETICS, 2005, 57 (05) :304-314