flowClust: a Bioconductor package for automated gating of flow cytometry data

被引:126
作者
Lo, Kenneth [1 ]
Hahne, Florian [2 ]
Brinkman, Ryan R. [3 ]
Gottardo, Raphael [4 ,5 ]
机构
[1] Univ British Columbia, Dept Stat, Vancouver, BC V6T 1Z2, Canada
[2] Fred Hutchinson Canc Res Ctr, Seattle, WA 98109 USA
[3] BC Canc Res Ctr, Terry Fox Lab, Vancouver, BC V5Z 1L3, Canada
[4] Inst Rech Clin Montreal, Montreal, PQ H2W 1R7, Canada
[5] Univ Montreal, Dept Biochim, Montreal, PQ H3T 1J4, Canada
来源
BMC BIOINFORMATICS | 2009年 / 10卷
关键词
BIOINFORMATICS; DIAGNOSIS;
D O I
10.1186/1471-2105-10-145
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: As a high-throughput technology that offers rapid quantification of multidimensional characteristics for millions of cells, flow cytometry (FCM) is widely used in health research, medical diagnosis and treatment, and vaccine development. Nevertheless, there is an increasing concern about the lack of appropriate software tools to provide an automated analysis platform to parallelize the high-throughput data-generation platform. Currently, to a large extent, FCM data analysis relies on the manual selection of sequential regions in 2-D graphical projections to extract the cell populations of interest. This is a time-consuming task that ignores the high-dimensionality of FCM data. Results: In view of the aforementioned issues, we have developed an R package called flowClust to automate FCM analysis. flowClust implements a robust model-based clustering approach based on multivariate t mixture models with the Box-Cox transformation. The package provides the functionality to identify cell populations whilst simultaneously handling the commonly encountered issues of outlier identification and data transformation. It offers various tools to summarize and visualize a wealth of features of the clustering results. In addition, to ensure its convenience of use, flowClust has been adapted for the current FCM data format, and integrated with existing Bioconductor packages dedicated to FCM analysis. Conclusion: flowClust addresses the issue of a dearth of software that helps automate FCM analysis with a sound theoretical foundation. It tends to give reproducible results, and helps reduce the significant subjectivity and human time cost encountered in FCM analysis. The package contributes to the cytometry community by offering an efficient, automated analysis platform which facilitates the active, ongoing technological advancement.
引用
收藏
页数:8
相关论文
共 42 条
[1]  
[Anonymous], PROGRAMMING DATA GUI
[2]  
[Anonymous], 2006, TECHNICAL REPORT
[3]   DNA histogram analysis for node-negative breast cancer [J].
Bagwell, CB .
CYTOMETRY PART A, 2004, 58A (01) :76-78
[4]   MODEL-BASED GAUSSIAN AND NON-GAUSSIAN CLUSTERING [J].
BANFIELD, JD ;
RAFTERY, AE .
BIOMETRICS, 1993, 49 (03) :803-821
[5]   AN ANALYSIS OF TRANSFORMATIONS REVISITED [J].
BICKEL, PJ ;
DOKSUM, KA .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1981, 76 (374) :296-311
[6]   Impact of flow cytometry on the diagnosis and characterization of lymphomas, chronic lymphoproliferative disorders and plasma cell neoplasias [J].
Braylan, RC .
CYTOMETRY PART A, 2004, 58A (01) :57-61
[7]   High-content flow cytometry and temporal data analysis for defining a cellular signature graft-versus-host disease [J].
Brinkman, Ryan Remy ;
Gasparetto, Maura ;
Lee, Shang-Jung Jessica ;
Ribickas, Albert J. ;
Perkins, Janelle ;
Janssen, William ;
Smiley, Renee ;
Smith, Clay .
BIOLOGY OF BLOOD AND MARROW TRANSPLANTATION, 2007, 13 (06) :691-700
[8]   Beyond six colors: A new era in flow cytometry [J].
De Rosa, Stephen C. ;
Brenchley, Jason M. ;
Roederer, Mario .
NATURE MEDICINE, 2003, 9 (01) :112-117
[9]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[10]   How many clusters? Which clustering method? Answers via model-based cluster analysis [J].
Fraley, C ;
Raftery, AE .
COMPUTER JOURNAL, 1998, 41 (08) :578-588