Deep Learning Accurately Predicts Estrogen Receptor Status in Breast Cancer Metabolomics Data

被引:202
作者
Alakwaa, Fadhl M. [1 ]
Chaudhary, Kumardeep [1 ]
Garmire, Lana X. [1 ,2 ]
机构
[1] Univ Hawaii, Canc Ctr, Epidemiol Program, Honolulu, HI 96813 USA
[2] Univ Hawaii Manoa, Mol Biosci & Bioengn Grad Program, Honolulu, HI 96822 USA
关键词
breast cancer; metabolomics; estrogen receptor; deep learning bioinformatics; GENE-EXPRESSION; ABC TRANSPORTERS; SUBTYPES; METABOLISM; SURVIVAL; RESISTANCE; GLUTAMINE; PATHWAY; RACE;
D O I
10.1021/acs.jproteome.7b00595
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Metabolomics holds the promise as a new technology to diagnose highly heterogeneous diseases. Conventionally, metabolomics data analysis for diagnosis is done using various statistical and machine learning based classification methods. However, it remains unknown if deep neural network, a class of increasingly popular machine learning methods, is suitable to classify metabolomics data. Here we use a cohort of 271 breast cancer tissues, 204 positive estrogen receptor (ER+), and 67 negative estrogen receptor (ER-) to test the accuracies of feed forward networks, a deep learning (DL) framework, as well as six widely used machine learning models, namely random forest (RF), support vector machines (SVM), recursive partitioning and regression trees (RPART), linear discriminant analysis (LDA), prediction analysis for microarrays (PAM), and generalized boosted models (GBM). DL framework has the highest area under the curve (AUC) of 0.93 in classifying ER+/ER- patients, compared to the other six machine learning algorithms. Furthermore, the biological interpretation of the first hidden layer reveals eight commonly enriched significant metabolomics pathways (adjusted P-value <0.05) that cannot be discovered by other machine learning methods. Among them, protein digestion and absorption and ATP-binding cassette (ABC) transporters pathways are also confirmed in integrated analysis between metabolomics and gene expression data in these samples. In summary, deep learning method shows advantages for metabolomics based breast cancer ER status classification, with both the highest prediction accuracy (AUC = 0.93) and better revelation of disease biology. We encourage the adoption of feed-forward networks based deep learning method in the metabolomics research community for classification.
引用
收藏
页码:337 / 347
页数:11
相关论文
共 61 条
[1]
Serine and glycine metabolism in cancer [J].
Amelio, Ivano ;
Cutruzzola, Francesca ;
Antonov, Alexey ;
Agostini, Massimiliano ;
Melino, Gerry .
TRENDS IN BIOCHEMICAL SCIENCES, 2014, 39 (04) :191-198
[2]
Deep learning for computational biology [J].
Angermueller, Christof ;
Parnamaa, Tanel ;
Parts, Leopold ;
Stegle, Oliver .
MOLECULAR SYSTEMS BIOLOGY, 2016, 12 (07)
[3]
[Anonymous], 2004, P 10 ACM SIGKDD INT
[4]
[Anonymous], 2017, AB BREAST CANC
[5]
[Anonymous], 2017, BREAST CANC PREV CON
[6]
[Anonymous], 2000, J. Official Statistics
[7]
Arab A., 2016, ADV BIOMED RES, V5, P115, DOI DOI 10.4103/2277-9175.185573
[8]
The forkhead transcription factor FOXM1 promotes endocrine resistance and invasiveness in estrogen receptor-positive breast cancer by expansion of stem-like cancer cells [J].
Bergamaschi, Anna ;
Madak-Erdogan, Zeynep ;
Kim, Yu Jin ;
Choi, Yoon-La ;
Lu, Hailing ;
Katzenellenbogen, Benita S. .
BREAST CANCER RESEARCH, 2014, 16 (05)
[9]
Glutamate enrichment as new diagnostic opportunity in breast cancer [J].
Budczies, Jan ;
Pfitzner, Berit M. ;
Gyoerffy, Balazs ;
Winzer, Klaus-Juergen ;
Radke, Cornelia ;
Dietel, Manfred ;
Fiehn, Oliver ;
Denkert, Carsten .
INTERNATIONAL JOURNAL OF CANCER, 2015, 136 (07) :1619-1628
[10]
Comparative metabolomics of estrogen receptor positive and estrogen receptor negative breast cancer: alterations in glutamine and beta-alanine metabolism [J].
Budczies, Jan ;
Brockmoeller, Scarlet F. ;
Mueller, Berit M. ;
Barupal, Dinesh K. ;
Richter-Ehrenstein, Christiane ;
Kleine-Tebbe, Anke ;
Griffin, Julian L. ;
Oresic, Matej ;
Dietel, Manfred ;
Denkert, Carsten ;
Fiehn, Oliver .
JOURNAL OF PROTEOMICS, 2013, 94 :279-288