CCAST: A Model-Based Gating Strategy to Isolate Homogeneous Subpopulations in a Heterogeneous Population of Single Cells

被引:15
作者
Anchang, Benedict [1 ]
Do, Mary T. [1 ]
Zhao, Xi [1 ]
Plevritis, Sylvia K. [1 ]
机构
[1] Stanford Univ, Dept Radiol, Ctr Canc Syst Biol, Stanford, CA 94305 USA
关键词
FLOW-CYTOMETRY DATA; CELLULAR HIERARCHY; CANCER;
D O I
10.1371/journal.pcbi.1003664
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
A model-based gating strategy is developed for sorting cells and analyzing populations of single cells. The strategy, named CCAST, for Clustering, Classification and Sorting Tree, identifies a gating strategy for isolating homogeneous subpopulations from a heterogeneous population of single cells using a data-derived decision tree representation that can be applied to cell sorting. Because CCAST does not rely on expert knowledge, it removes human bias and variability when determining the gating strategy. It combines any clustering algorithm with silhouette measures to identify underlying homogeneous subpopulations, then applies recursive partitioning techniques to generate a decision tree that defines the gating strategy. CCAST produces an optimal strategy for cell sorting by automating the selection of gating markers, the corresponding gating thresholds and gating sequence; all of these parameters are typically manually defined. Even though CCAST is optimized for cell sorting, it can be applied for the identification and analysis of homogeneous subpopulations among heterogeneous single cell data. We apply CCAST on single cell data from both breast cancer cell lines and normal human bone marrow. On the SUM159 breast cancer cell line data, CCAST indicates at least five distinct cell states based on two surface markers (CD24 and EPCAM) and provides a gating sorting strategy that produces more homogeneous subpopulations than previously reported. When applied to normal bone marrow data, CCAST reveals an efficient strategy for gating T-cells without prior knowledge of the major T-cell subtypes and the markers that best define them. On the normal bone marrow data, CCAST also reveals two major mature B-cell subtypes, namely CD123+ and CD123- cells, which were not revealed by manual gating but show distinct intracellular signaling responses. More generally, the CCAST framework could be used on other biological and non-biological high dimensional data types that are mixtures of unknown homogeneous subpopulations.
引用
收藏
页数:14
相关论文
共 28 条
[1]
Aghaeepour N, 2013, NAT METHODS, V10, P228, DOI [10.1038/nmeth.2365, 10.1038/NMETH.2365]
[2]
RchyOptimyx: Cellular hierarchy optimization for flow cytometry [J].
Aghaeepour, Nima ;
Jalali, Adrin ;
O'Neill, Kieran ;
Chattopadhyay, Pratip K. ;
Roederer, Mario ;
Hoos, Holger H. ;
Brinkman, Ryan R. .
CYTOMETRY PART A, 2012, 81A (12) :1022-1030
[3]
Early immunologic correlates of HIV protection can be identified from computational analysis of complex multivariate T-cell flow cytometry assays* [J].
Aghaeepour, Nima ;
Chattopadhyay, Pratip K. ;
Ganesan, Anuradha ;
O'Neill, Kieran ;
Zare, Habil ;
Jalali, Adrin ;
Hoos, Holger H. ;
Roederer, Mario ;
Brinkman, Ryan R. .
BIOINFORMATICS, 2012, 28 (07) :1009-1016
[4]
[Anonymous], J MACHINE LEARING RE
[5]
Bashashati Ali., 2009, Advances in Bioinformatics, V2009, P1, DOI DOI 10.1155/2009/584603
[6]
An EM-Like Algorithm for Semi- and Nonparametric Estimation in Multivariate Mixtures [J].
Benaglia, Tatiana ;
Chauveau, Didier ;
Hunter, David R. .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2009, 18 (02) :505-526
[7]
Single-Cell Mass Cytometry of Differential Immune and Drug Responses Across a Human Hematopoietic Continuum [J].
Bendall, Sean C. ;
Simonds, Erin F. ;
Qiu, Peng ;
Amir, El-ad D. ;
Krutzik, Peter O. ;
Finck, Rachel ;
Bruggner, Robert V. ;
Melamed, Rachel ;
Trejo, Angelica ;
Ornatsky, Olga I. ;
Balderas, Robert S. ;
Plevritis, Sylvia K. ;
Sachs, Karen ;
Pe'er, Dana ;
Tanner, Scott D. ;
Nolan, Garry P. .
SCIENCE, 2011, 332 (6030) :687-696
[8]
Mixture modeling approach to flow cytometry data [J].
Boedigheimer, Michael J. ;
Ferbas, John .
CYTOMETRY PART A, 2008, 73A (05) :421-429
[9]
Del Giudice I, 2004, HAEMATOLOGICA, V89, P303
[10]
Ellis B, 2013, FLOWCORE BASIC STRUC