Embracing the dropouts in single-cell RNA-seq analysis

被引:230
作者
Qiu, Peng [1 ,2 ]
机构
[1] Georgia Inst Technol, Dept Biomed Engn, 950 Atlantic Dr NW, Atlanta, GA 30332 USA
[2] Emory Univ, 950 Atlantic Dr NW, Atlanta, GA 30332 USA
基金
美国国家科学基金会;
关键词
DIFFERENTIATION; RECONSTRUCTION; DYNAMICS;
D O I
10.1038/s41467-020-14976-9
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
070301 [无机化学]; 070403 [天体物理学]; 070507 [自然资源与国土空间规划学]; 090105 [作物生产系统与生态工程];
摘要
One primary reason that makes single-cell RNA-seq analysis challenging is dropouts, where the data only captures a small fraction of the transcriptome of each cell. Almost all computational algorithms developed for single-cell RNA-seq adopted gene selection, dimension reduction or imputation to address the dropouts. Here, an opposite view is explored. Instead of treating dropouts as a problem to be fixed, we embrace it as a useful signal. We represent the dropout pattern by binarizing single-cell RNA-seq count data, and present a co-occurrence clustering algorithm to cluster cells based on the dropout pattern. We demonstrate in multiple published datasets that the binary dropout pattern is as informative as the quantitative expression of highly variable genes for the purpose of identifying cell types. We expect that recognizing the utility of dropouts provides an alternative direction for developing computational algorithms for single-cell RNA-seq analysis.
引用
收藏
页数:9
相关论文
共 41 条
[1]
An Introduction to the Analysis of Single-Cell RNA-Sequencing Data [J].
AlJanahi, Aisha A. ;
Danielsen, Mark ;
Dunbar, Cynthia E. .
MOLECULAR THERAPY-METHODS & CLINICAL DEVELOPMENT, 2018, 10 :189-196
[2]
M3Drop: dropout-based feature selection for scRNASeq [J].
Andrews, Tallulah S. ;
Hemberg, Martin .
BIOINFORMATICS, 2019, 35 (16) :2865-2867
[3]
Biology of oligodendrocyte and myelin in the mammalian central nervous system [J].
Baumann, N ;
Pham-Dinh, D .
PHYSIOLOGICAL REVIEWS, 2001, 81 (02) :871-927
[4]
Fast unfolding of communities in large networks [J].
Blondel, Vincent D. ;
Guillaume, Jean-Loup ;
Lambiotte, Renaud ;
Lefebvre, Etienne .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
[5]
Integrating single-cell transcriptomic data across different conditions, technologies, and species [J].
Butler, Andrew ;
Hoffman, Paul ;
Smibert, Peter ;
Papalexi, Efthymia ;
Satija, Rahul .
NATURE BIOTECHNOLOGY, 2018, 36 (05) :411-+
[6]
Single-cell trajectories reconstruction, exploration and mapping of omics data with STREAM [J].
Chen, Huidong ;
Albergante, Luca ;
Hsu, Jonathan Y. ;
Lareau, Caleb A. ;
Lo Bosco, Giosue ;
Guan, Jihong ;
Zhou, Shuigeng ;
Gorban, Alexander N. ;
Bauer, Daniel E. ;
Aryee, Martin J. ;
Langenau, David M. ;
Zinovyev, Andrei ;
Buenrostro, Jason D. ;
Yuan, Guo-Cheng ;
Pinello, Luca .
NATURE COMMUNICATIONS, 2019, 10 (1)
[7]
Duo Angelo, 2018, F1000Res, V7, P1141, DOI 10.12688/f1000research.15666.3
[8]
Ensuring respect for human rights in employment [J].
Friedman, S .
INDUSTRIAL RELATIONS RESEARCH ASSOCIATION SERIES, PROCEEDINGS, 2001, :1-13
[9]
Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors [J].
Haghverdi, Laleh ;
Lun, Aaron T. L. ;
Morgan, Michael D. ;
Marioni, John C. .
NATURE BIOTECHNOLOGY, 2018, 36 (05) :421-+
[10]
CEL-Seq2: sensitive highly-multiplexed single-cell RNA-Seq [J].
Hashimshony, Tamar ;
Senderovich, Naftalie ;
Avital, Gal ;
Klochendler, Agnes ;
de Leeuw, Yaron ;
Anavy, Leon ;
Gennert, Dave ;
Li, Shuqiang ;
Livak, Kenneth J. ;
Rozenblatt-Rosen, Orit ;
Dor, Yuval ;
Regev, Aviv ;
Yanai, Itai .
GENOME BIOLOGY, 2016, 17