Computational assignment Of cell-cycle stage from single-cell transcriptome data

被引:270
作者
Scialdone, Antonio [1 ,2 ]
Natarajan, Kedar N. [1 ,2 ]
Saraiva, Luis R. [1 ,2 ]
Proserpio, Valentina [1 ,2 ]
Teichmann, Sarah A. [1 ,2 ]
Stegle, Oliver [2 ]
Marioni, John C. [1 ,2 ]
Buettner, Florian [2 ,3 ]
机构
[1] Wellcome Trust Sanger Inst, Cambridge CB10 1SA, England
[2] European Bioinformat Inst EMBL EBI, European Mol Biol Lab, Cambridge CB10 1SD, England
[3] Helmholtz Zentrum Munchen, Inst Computat Biol, D-85764 Neuherberg, Germany
基金
欧洲研究理事会; 英国医学研究理事会;
关键词
Single cell RNA-seq; Computational biology; Cell cycle; Machine learning; RNA-SEQ; GENE-EXPRESSION; HETEROGENEITY; DYNAMICS;
D O I
10.1016/j.ymeth.2015.06.021
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
The transcriptome of single cells can reveal important information about cellular states and heterogeneity within populations of cells. Recently, single-cell RNA-sequencing has facilitated expression profiling of large numbers of single cells in parallel. To fully exploit these data, it is critical that suitable computational approaches are developed. One key challenge, especially pertinent when considering dividing populations of cells, is to understand the cell-cycle stage of each captured cell. Here we describe and compare five established supervised machine learning methods and a custom-built predictor for allocating cells to their cell-cycle stage on the basis of their transcriptome. In particular, we assess the impact of different normalisation strategies and the usage of prior knowledge on the predictive power of the classifiers. We tested the methods on previously published datasets and found that a PCA-based approach and the custom predictor performed best. Moreover, our analysis shows that the performance depends strongly on normalisation and the usage of prior knowledge. Only by leveraging prior knowledge in form of cell-cycle annotated genes and by preprocessing the data using a rank-based normalisation, is it possible to robustly capture the transcriptional cell-cycle signature across different cell types, organisms and experimental protocols. (C) 2015 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.orgilicenses/by/4.0/).
引用
收藏
页码:54 / 61
页数:8
相关论文
共 34 条
[1]
switchBox: an R package for k-Top Scoring Pairs classifier development [J].
Afsari, Bahman ;
Fertig, Elana J. ;
Geman, Donald ;
Marchionni, Luigi .
BIOINFORMATICS, 2015, 31 (02) :273-274
[2]
HTSeq-a Python']Python framework to work with high-throughput sequencing data [J].
Anders, Simon ;
Pyl, Paul Theodor ;
Huber, Wolfgang .
BIOINFORMATICS, 2015, 31 (02) :166-169
[3]
Cell cycle regulation during early mouse embryogenesis [J].
Artus, Jerome ;
Cohen-Tannoudji, Michel .
MOLECULAR AND CELLULAR ENDOCRINOLOGY, 2008, 282 (1-2) :78-86
[4]
Genome-wide transcriptional analysis of the human cell cycle identifies genes differentially regulated in normal and cancer cells [J].
Bar-Joseph, Ziv ;
Siegfried, Zahava ;
Brandeis, Michael ;
Brors, Benedikt ;
Lu, Yong ;
Eils, Roland ;
Dynlacht, Brian D. ;
Simon, Itamar .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (03) :955-960
[5]
Brennecke P, 2013, NAT METHODS, V10, P1093, DOI [10.1038/NMETH.2645, 10.1038/nmeth.2645]
[6]
Computational analysis of cell-to-cell heterogeneity in single-cell RNA-sequencing data reveals hidden subpopulations of cells [J].
Buettner, Florian ;
Natarajan, Kedar N. ;
Casale, F. Paolo ;
Proserpio, Valentina ;
Scialdone, Antonio ;
Theis, Fabian J. ;
Teichmann, Sarah A. ;
Marioni, John C. ;
Stegie, Oliver .
NATURE BIOTECHNOLOGY, 2015, 33 (02) :155-160
[7]
Cell cycle in mouse development [J].
Ciemerych, MA ;
Sicinski, P .
ONCOGENE, 2005, 24 (17) :2877-2898
[8]
SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[9]
Single-Cell RNA-Seq Reveals Dynamic, Random Monoallelic Gene Expression in Mammalian Cells [J].
Deng, Qiaolin ;
Ramskold, Daniel ;
Reinius, Bjorn ;
Sandberg, Rickard .
SCIENCE, 2014, 343 (6167) :193-196
[10]
Cyclebase.org: version 2.0, an updated comprehensive, multi-species repository of cell cycle experiments and derived analysis results [J].
Gauthier, Nicholas Paul ;
Jensen, Lars Juhl ;
Wernersson, Rasmus ;
Brunak, Soren ;
Jensen, Thomas S. .
NUCLEIC ACIDS RESEARCH, 2010, 38 :D699-D702