Data Mining Methods for Omics and Knowledge of Crude Medicinal Plants toward Big Data Biology

被引:31
作者
Afendi, Farit M. [1 ,2 ]
Ono, Naoaki [1 ]
Nakamura, Yukiko [1 ]
Nakamura, Kensuke [4 ]
Darusman, Latifah K. [3 ]
Kibinge, Nelson [1 ]
Morita, Aki Hirai [1 ]
Tanaka, Ken [5 ]
Horai, Hisayuki [6 ]
Altaf-Ul-Amin, Md. [1 ]
Kanaya, Shigehiko [1 ]
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, Nara, Ikoma 6300101, Japan
[2] Bogor Agr Univ, Dept Stat, Bogor 16680, Indonesia
[3] Bogor Agr Univ, Biopharmaca Res Ctr, Bogor 16151, Indonesia
[4] Maebashi Inst Technol, Maebashi, Gunma 3710816, Japan
[5] Univ Toyama, Inst Nat Med, Dept Med Resources, Toyama 9300194, Japan
[6] Ibaraki Natl Coll Technol, Dept Elect & Comp Engn, Hitachinaka, Ibaraki 3128508, Japan
来源
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL | 2013年 / 4卷 / 05期
关键词
D O I
10.5936/csbj.201301010
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Molecular biological data has rapidly increased with the recent progress of the Omics fields, e.g., genomics, transcriptomics, proteomics and metabolomics that necessitates the development of databases and methods for efficient storage, retrieval, integration and analysis of massive data. The present study reviews the usage of KNApSAcK Family DB in metabolomics and related area, discusses several statistical methods for handling multivariate data and shows their application on Indonesian blended herbal medicines (Jamu) as a case study. Exploration using Biplot reveals many plants are rarely utilized while some plants are highly utilized toward specific efficacy. Furthermore, the ingredients of Jamu formulas are modeled using Partial Least Squares Discriminant Analysis (PLS-DA) in order to predict their efficacy. The plants used in each Jamu medicine served as the predictors, whereas the efficacy of each Jamu provided the responses. This model produces 71.6% correct classification in predicting efficacy. Permutation test then is used to determine plants that serve as main ingredients in Jamu formula by evaluating the significance of the PLS-DA coefficients. Next, in order to explain the role of plants that serve as main ingredients in Jamu medicines, information of pharmacological activity of the plants is added to the predictor block. Then N-PLS-DA model, multiway version of PLS-DA, is utilized to handle the three-dimensional array of the predictor block. The resulting N-PLS-DA model reveals that the effects of some pharmacological activities are specific for certain efficacy and the other activities are diverse toward many efficacies. Mathematical modeling introduced in the present study can be utilized in global analysis of big data targeting to reveal the underlying biology.
引用
收藏
页数:14
相关论文
共 150 条
[1]   Data integration and network reconstruction with ∼omics data using Random Forest regression in potato [J].
Acharjee, Animesh ;
Kloosterman, Bjorn ;
de Vos, Ric C. H. ;
Werij, Jeroen S. ;
Bachem, Christian W. B. ;
Visser, Richard G. F. ;
Maliepaard, Chris .
ANALYTICA CHIMICA ACTA, 2011, 705 (1-2) :56-63
[2]  
Adnyana IK, 2007, EVALUATIONOF PHARM E
[3]  
Adnyana LK, 2007, EVALUATIONOF PHARM E
[4]  
Afendi Farit Mochamad, 2012, Current Pharmacogenomics & Personalized Medicine, V10, P111
[5]   KNApSAcK Family Databases: Integrated Metabolite-Plant Species Databases for Multifaceted Plant Research [J].
Afendi, Farit Mochamad ;
Okada, Taketo ;
Yamazaki, Mami ;
Hirai-Morita, Aki ;
Nakamura, Yukiko ;
Nakamura, Kensuke ;
Ikeda, Shun ;
Takahashi, Hiroki ;
Altaf-Ul-Amin, Md. ;
Darusman, Latifah K. ;
Saito, Kazuki ;
Kanaya, Shigehiko .
PLANT AND CELL PHYSIOLOGY, 2012, 53 (02) :e1
[6]  
Afendi FM, 2011, 7 AS CROP SCI ASS C
[7]  
Afendi FM, 2010, IEEE INT C DAT MIN W
[8]  
Afendi FM, 2011, 2 INT S TEM MAY 25 2
[9]   Phytoalexins in defense against pathogens [J].
Ahuja, Ishita ;
Kissen, Ralph ;
Bones, Atle M. .
TRENDS IN PLANT SCIENCE, 2012, 17 (02) :73-90
[10]  
Akiyama Kenji, 2008, In Silico Biology, V8, P339