class prediction;
cross-validation;
feature selection;
frequency of selection;
stable feature set;
GENE SELECTION;
CANCER CLASSIFICATION;
PERSONALIZED MEDICINE;
MICROARRAY;
VALIDATION;
TUMOR;
ALGORITHMS;
PREDICTION;
DIAGNOSIS;
PATTERNS;
D O I:
10.1093/bib/bbp016
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
Recent development of high-throughput technology has accelerated interest in the development of molecular biomarker classifiers for safety assessment, disease diagnostics and prognostics, and prediction of response for patient assignment. This article reviews and evaluates some important aspects and key issues in the development of biomarker classifiers. Development of a biomarker classifier for high-throughput data involves two components: (i) model building and (ii) performance assessment. This article focuses on feature selection in model building and cross validation for performance assessment. A frequency approach to feature selection is presented and compared to the conventional approach in terms of the predictive accuracy and stability of the selected feature set. The two approaches are compared based on four biomarker classifiers, each with a different feature selection method and well-known classification algorithm. In each of the four classifiers the feature predictor set selected by the frequency approach is more stable than the feature set selected by the conventional approach.
机构:
US FDA, Natl Ctr Toxicol Res, Biometry Branch, Div Personalized Nutr & Med, Jefferson, AR 72079 USACalif State Univ Long Beach, Dept Math & Stat, Long Beach, CA 90840 USA
Baek, Songjoon
;
Moon, Hojin
论文数: 0引用数: 0
h-index: 0
机构:
Calif State Univ Long Beach, Dept Math & Stat, Long Beach, CA 90840 USACalif State Univ Long Beach, Dept Math & Stat, Long Beach, CA 90840 USA
Moon, Hojin
;
Ahn, Hongshik
论文数: 0引用数: 0
h-index: 0
机构:
SUNY Stony Brook, Dept Appl Math & Stat, Stony Brook, NY 11794 USACalif State Univ Long Beach, Dept Math & Stat, Long Beach, CA 90840 USA
Ahn, Hongshik
;
Kodell, Ralph L.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Arkansas Med Sci, Dept Biostat, Little Rock, AR 72205 USACalif State Univ Long Beach, Dept Math & Stat, Long Beach, CA 90840 USA
Kodell, Ralph L.
;
Lin, Chien-Ju
论文数: 0引用数: 0
h-index: 0
机构:
US FDA, Natl Ctr Toxicol Res, Biometry Branch, Div Personalized Nutr & Med, Jefferson, AR 72079 USACalif State Univ Long Beach, Dept Math & Stat, Long Beach, CA 90840 USA
Lin, Chien-Ju
;
Chen, James J.
论文数: 0引用数: 0
h-index: 0
机构:
US FDA, Natl Ctr Toxicol Res, Biometry Branch, Div Personalized Nutr & Med, Jefferson, AR 72079 USACalif State Univ Long Beach, Dept Math & Stat, Long Beach, CA 90840 USA
机构:
US FDA, Natl Ctr Toxicol Res, Biometry Branch, Div Personalized Nutr & Med, Jefferson, AR 72079 USACalif State Univ Long Beach, Dept Math & Stat, Long Beach, CA 90840 USA
Baek, Songjoon
;
Moon, Hojin
论文数: 0引用数: 0
h-index: 0
机构:
Calif State Univ Long Beach, Dept Math & Stat, Long Beach, CA 90840 USACalif State Univ Long Beach, Dept Math & Stat, Long Beach, CA 90840 USA
Moon, Hojin
;
Ahn, Hongshik
论文数: 0引用数: 0
h-index: 0
机构:
SUNY Stony Brook, Dept Appl Math & Stat, Stony Brook, NY 11794 USACalif State Univ Long Beach, Dept Math & Stat, Long Beach, CA 90840 USA
Ahn, Hongshik
;
Kodell, Ralph L.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Arkansas Med Sci, Dept Biostat, Little Rock, AR 72205 USACalif State Univ Long Beach, Dept Math & Stat, Long Beach, CA 90840 USA
Kodell, Ralph L.
;
Lin, Chien-Ju
论文数: 0引用数: 0
h-index: 0
机构:
US FDA, Natl Ctr Toxicol Res, Biometry Branch, Div Personalized Nutr & Med, Jefferson, AR 72079 USACalif State Univ Long Beach, Dept Math & Stat, Long Beach, CA 90840 USA
Lin, Chien-Ju
;
Chen, James J.
论文数: 0引用数: 0
h-index: 0
机构:
US FDA, Natl Ctr Toxicol Res, Biometry Branch, Div Personalized Nutr & Med, Jefferson, AR 72079 USACalif State Univ Long Beach, Dept Math & Stat, Long Beach, CA 90840 USA