Biomarker discovery, disease classification, and similarity query processing on high-throughput MS/MS data of inborn errors of metabolism

被引:20
作者
Baumgartner, C [1 ]
Baumgartner, D
机构
[1] Univ Hlth Sci Med Informat & Technol, Inst Biomed Engn, Res Grp Clin Bioinformat, Eduard Wallnofer Zentrum 1, A-6060 Hall In Tyrol, Austria
[2] Innsbruck Med Univ, Dept Pediat, A-6020 Innsbruck, Austria
关键词
biomarker discovery; disease classification; similarity query processing; tandem mass spectrometry; metabolic disorders;
D O I
10.1177/1087057105280518
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In newborn errors of metabolism, biomarkers are urgently needed for disease screening, diagnosis, and monitoring of therapeutic interventions. This article describes a 2-step approach to discover metabolic markers, which involves (1) the identification of marker candidates and (2) the prioritization of them based on expert knowledge of disease metabolism. For step 1, the authors developed a new algorithm, the biomarker identifier (BMI), to identify markers from quantified diseased versus normal tandem mass spectrometry data sets. BMI produces a ranked list of marker candidates and discards irrelevant metabolites based on a quality measure, taking into account the discriminatory performance, discriminatory space, and variance of metabolites' concentrations at the state of disease. To determine the ability of identified markers to classify subjects, the authors compared the discriminatory performance of several machine-learning paradigms and described a retrieval technique that searches and classifies abnormal metabolic profiles from a screening database. Seven inborn errors of metabolism-phenylketonuria (PKU), glutaric acidemia type I (GA-I), 3-methylcrotonylglycinemia deficiency (3-MCCD), methylmalonic acidernia (MMA), propionic acidernia (PA), medium-chain acyl CoA dehydrogenase deficiency (MCADD), and 3-OH long-chain acyl CoA dehydrogenase deficiency (LCHADD)-were investigated. All primarily prioritized marker candidates could be confirmed by literature. Some novel secondary candidates were identified (i.e., C 16:1 and C4DC for PKU, C4DC for GA-I, and C18:1 for MCADD), which require further validation to confirm their biochemical role during, health and disease.
引用
收藏
页码:90 / 99
页数:10
相关论文
共 32 条
[1]  
[Anonymous], 2004, KERNEL METHODS PATTE
[2]  
[Anonymous], 2021, Bayesian Data Analysis
[3]   Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments [J].
Baggerly, KA ;
Morris, JS ;
Coombes, KR .
BIOINFORMATICS, 2004, 20 (05) :777-U710
[4]   An integrated approach utilizing artificial neural networks and SELDI mass spectrometry for the classification of human tumours and rapid identification of potential biomarkers [J].
Ball, G ;
Mian, S ;
Holding, F ;
Allibone, RO ;
Lowe, J ;
Ali, S ;
Li, G ;
McCardle, S ;
Ellis, IO ;
Creaser, C ;
Rees, RC .
BIOINFORMATICS, 2002, 18 (03) :395-404
[5]   Modelling of classification rules on metabolic patterns including machine learning and expert knowledge [J].
Baumgartner, C ;
Böhm, C ;
Baumgartner, D .
JOURNAL OF BIOMEDICAL INFORMATICS, 2005, 38 (02) :89-98
[6]   Supervised machine learning techniques for the classification of metabolic disorders in newborns [J].
Baumgartner, C ;
Böhm, C ;
Baumgartner, D ;
Marini, G ;
Weinberger, K ;
Olgemöller, B ;
Liebl, B ;
Roscher, AA .
BIOINFORMATICS, 2004, 20 (17) :2985-2996
[7]  
Blau N, 2001, METABOLIC MOL BASES
[8]   Laboratory integration and utilization of tandem mass spectrometry in neonatal screening: a model for clinical mass spectrometry in the next millennium [J].
Chace, DH ;
DiPerna, JC ;
Naylor, EW .
ACTA PAEDIATRICA, 1999, 88 :45-47
[9]  
Charrow J., 2000, GENET MED, V2, P267, DOI [10.1097/00125817-200007000-00011, DOI 10.1097/00125817-200007000-00011]
[10]  
Charrow Joel, 2000, Genetics in Medicine, V2, P267