Multivariate search for differentially expressed gene combinations

被引:31
作者
Xiao, YH
Frisina, R
Gordon, A
Klebanov, L
Yakovlev, A
机构
[1] Univ Rochester, Dept Biostat & Computat Biol, Rochester, NY 14642 USA
[2] Univ Rochester, Dept Otolaryngol, Rochester, NY 14642 USA
[3] Univ Rochester, Dept Neurobiol & Anat, Rochester, NY 14642 USA
[4] Univ Rochester, Dept Biomed Engn, Rochester, NY 14642 USA
[5] Charles Univ Prague, Dept Probabil & Stat, CZ-18675 Prague 8, Czech Republic
关键词
D O I
10.1186/1471-2105-5-164
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: To identify differentially expressed genes, it is standard practice to test a two-sample hypothesis for each gene with a proper adjustment for multiple testing. Such tests are essentially univariate and disregard the multidimensional structure of microarray data. A more general two-sample hypothesis is formulated in terms of the joint distribution of any sub-vector of expression signals. Results: By building on an earlier proposed multivariate test statistic, we propose a new algorithm for identifying differentially expressed gene combinations. The algorithm includes an improved random search procedure designed to generate candidate gene combinations of a given size. Cross-validation is used to provide replication stability of the search procedure. A permutation two-sample test is used for significance testing. We design a multiple testing procedure to control the family-wise error rate (FWER) when selecting significant combinations of genes that result from a successive selection procedure. A target set of genes is composed of all significant combinations selected via random search. Conclusions: A new algorithm has been developed to identify differentially expressed gene combinations. The performance of the proposed search-and-testing procedure has been evaluated by computer simulations and analysis of replicated Affymetrix gene array data on age-related changes in gene expression in the inner ear of CBA mice.
引用
收藏
页数:10
相关论文
共 33 条
[1]   A simulated annealing algorithm for maximum likelihood pedigree reconstruction [J].
Almudevar, A .
THEORETICAL POPULATION BIOLOGY, 2003, 63 (02) :63-75
[2]   Selection bias in gene extraction on the basis of microarray gene-expression data [J].
Ambroise, C ;
McLachlan, GJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (10) :6562-6566
[3]  
[Anonymous], 1993, Resampling-based multiple testing: Examples and methods for P-value adjustment
[4]   A comparison of normalization methods for high density oligonucleotide array data based on variance and bias [J].
Bolstad, BM ;
Irizarry, RA ;
Åstrand, M ;
Speed, TP .
BIOINFORMATICS, 2003, 19 (02) :185-193
[5]   Multivariate approach for selecting sets of differentially expressed genes [J].
Chilingaryan, A ;
Gevorgyan, N ;
Vardanyan, A ;
Jones, D ;
Szabo, A .
MATHEMATICAL BIOSCIENCES, 2002, 176 (01) :59-69
[6]  
CORSO JF, 1980, AUDIOLOGY, V19, P221
[7]   Multiple hypothesis testing in microarray experiments [J].
Dudoit, S ;
Shaffer, JP ;
Boldrick, JC .
STATISTICAL SCIENCE, 2003, 18 (01) :71-103
[8]   Speech recognition in noise and presbycusis: Relations to possible neural mechanisms [J].
Frisina, DR ;
Frisina, RD .
HEARING RESEARCH, 1997, 106 (1-2) :95-104
[9]  
Frisina DR, 2001, FUNCTIONAL NEUROBIOL, P565, DOI DOI 10.1016/B978-012351830-9/50041-X
[10]  
FRISINA DR, 2001, FUNCTIONAL NEUROBIOL, P531