Stability and aggregation of ranked gene lists

被引:121
作者
Boulesteix, Anne-Laure [1 ]
Slawski, Martin [1 ]
机构
[1] Univ Munich, Fac Med, D-80539 Munich, Germany
关键词
Univariate analysis; differential expression; top-list; ranking; variability; bootstrap; DIFFERENTIALLY EXPRESSED GENES; MICROARRAY DATA; BIOCONDUCTOR PACKAGE; T-TEST; REPRODUCIBILITY; CLASSIFICATION; METAANALYSIS; FRAMEWORK; SELECTION; RANKING;
D O I
10.1093/bib/bbp034
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Ranked gene lists are highly instable in the sense that similar measures of differential gene expression may yield very different rankings, and that a small change of the data set usually affects the obtained gene list considerably. Stability issues have long been under-considered in the literature, but they have grown to a hot topic in the last few years, perhaps as a consequence of the increasing skepticism on the reproducibility and clinical applicability of molecular research findings. In this article, we review existing approaches for the assessment of stability of ranked gene lists and the related problem of aggregation, give some practical recommendations, and warn against potential misuse of these methods. This overview is illustrated through an application to a recent leukemia data set using the freely available Bioconductor package GeneSelector.
引用
收藏
页码:556 / 568
页数:13
相关论文
共 61 条
[1]   Gene prioritization through genomic data fusion [J].
Aerts, S ;
Lambrechts, D ;
Maity, S ;
Van Loo, P ;
Coessens, B ;
De Smet, F ;
Tranchevent, LC ;
De Moor, B ;
Marynen, P ;
Hassan, B ;
Carmeliet, P ;
Moreau, Y .
NATURE BIOTECHNOLOGY, 2006, 24 (05) :537-544
[2]   Microarray data analysis: from disarray to consolidation and consensus [J].
Allison, DB ;
Cui, XQ ;
Page, GP ;
Sabripour, M .
NATURE REVIEWS GENETICS, 2006, 7 (01) :55-65
[3]  
[Anonymous], 2008, STABILITY SELECTION
[4]   Development of biomarker classifiers from high-dimensional data [J].
Baek, Songjoon ;
Tsai, Chen-An ;
Chen, James J. .
BRIEFINGS IN BIOINFORMATICS, 2009, 10 (05) :537-546
[5]   A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes [J].
Baldi, P ;
Long, AD .
BIOINFORMATICS, 2001, 17 (06) :509-519
[6]   Machine learning methods for predictive proteomics [J].
Barla, Annalisa ;
Jurman, Giuseppe ;
Riccadonna, Samantha ;
Merler, Stefano ;
Chierici, Marco ;
Furlanello, Cesare .
BRIEFINGS IN BIOINFORMATICS, 2008, 9 (02) :119-128
[7]   The bootstrap in hypothesis testing [J].
Bickel, PJ ;
Ren, JJ .
STATE OF THE ART IN PROBABILITY AND STATISTICS: FESTSCHRIFT FOR WILLEM R VAN ZWET, 2001, 36 :91-112
[8]  
Boulesteix AL, 2008, CANCER INFORM, V6, P77
[9]  
BOULESTEIX AL, 2009, 58 U MUN DEP STAT
[10]   Iterative Group Analysis (iGA): A simple tool to enhance sensitivity and facilitate interpretation of microarray experiments [J].
Breitling, R ;
Amtmann, A ;
Herzyk, P .
BMC BIOINFORMATICS, 2004, 5 (1)