A Pipeline for Automated Analysis of Flow Cytometry Data: Preliminary Results on Lymphoma Sub-Type Diagnosis

被引:7
作者
Bashashati, Ali [1 ]
Lo, Kenneth [2 ]
Gottardo, Raphael [3 ]
Gascoyne, Randy D. [4 ,5 ]
Weng, Andrew [4 ,5 ]
Brinkman, Ryan [1 ,6 ]
机构
[1] British Columbia Canc Res Ctr, Vancouver, BC V5Z 1L3, Canada
[2] Univ British Columbia, Dept Stat, Vancouver, BC V5Z 1M9, Canada
[3] Univ Montreal, Inst Rec Clin Montreal, Dept Biochem, Montreal, PQ H3C 3J7, Canada
[4] British Columbia Canc Agcy, Vancouver, BC, Canada
[5] Univ British Columbia, Dept Pathol, Lab Med, Vancouver, BC V5Z 1M9, Canada
[6] Univ British Columbia, Dept Med Genet, Vancouver, BC V5Z 1M9, Canada
来源
2009 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-20 | 2009年
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1109/IEMBS.2009.5332710
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Flow cytometry (FCM) is widely used in health research and is a technique to measure cell properties such as phenotype, cytokine expression, etc., for up to millions of cells from a sample. FCM data analysis is a highly tedious, subjective and manually time-consuming (to the level of impracticality for some data) process that is based on intuition rather than standardized statistical inference. This study proposes a pipeline for automatic analysis of FCM data. The proposed pipeline identifies biomarkers that correlate with physiological/pathological conditions and classifies the samples to specific pathological/physiological entities. The pipeline utilizes a model-based clustering approach to identify cell populations that share similar biological functions. Support vector machine (SVM) and random forest (RF) classifiers were then used to classify the samples and identify biomarkers associated with disease status. The performance of the proposed data analysis pipeline has been evaluated on lymphoma patients. Preliminary results show more than 90% accuracy in differentiating between some sub-types of lymphoma. The proposed pipeline also finds biologically meaningful biomarkers that differ between lymphoma sub-types.
引用
收藏
页码:4945 / +
页数:2
相关论文
共 17 条
[1]   DNA histogram analysis for node-negative breast cancer [J].
Bagwell, CB .
CYTOMETRY PART A, 2004, 58A (01) :76-78
[2]   PATTERN SORTING - A COMPUTER-CONTROLLED MULTIDIMENSIONAL SORTING METHOD USING K-D TREES [J].
BIGOS, M ;
PARKS, DR ;
MOORE, WA ;
HERZENBERG, LA ;
HERZENBERG, LA .
CYTOMETRY, 1994, 16 (04) :357-363
[3]   Impact of flow cytometry on the diagnosis and characterization of lymphomas, chronic lymphoproliferative disorders and plasma cell neoplasias [J].
Braylan, RC .
CYTOMETRY PART A, 2004, 58A (01) :57-61
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[6]  
COUSAR JB, 1995, AM J CLIN PATHOL, V104, P126
[7]   Modern histological classification of low grade B-cell lymphomas [J].
Dogan, A .
BEST PRACTICE & RESEARCH CLINICAL HAEMATOLOGY, 2005, 18 (01) :11-26
[8]  
Frankel DS, 1996, CYTOMETRY, V23, P290, DOI 10.1002/(SICI)1097-0320(19960401)23:4<290::AID-CYTO5>3.0.CO
[9]  
2-L
[10]  
Hengel RL, 2001, CLIN LAB MED, V21, P841