Processing and modeling genome-wide expression data using singular value decomposition

被引:27
作者
Alter, O [1 ]
Brown, PO [1 ]
Botstein, D [1 ]
机构
[1] Stanford Univ, Dept Genet, Stanford, CA 94305 USA
来源
MICROARRAYS: OPTICAL TECHNOLOGIES AND INFORMATICS | 2001年 / 4266卷
关键词
singular value decomposition (SVD); genome-wide expression data; DNA microarrays;
D O I
10.1117/12.427986
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
We describe the use of singular value decomposition in transforming genome-wide expression data from genes x arrays space to reduced diagonalized "eigengenes" x "eigenarrays" space, where the eigengenes (or eigenarrays) are unique orthonormal superpositions of the genes (or arrays). Normalizing the data by filtering out the eigengenes (and eigenarrays) that are inferred to represent additive or multiplicative noise, experimental artifacts, or even irrelevant biological processes enables meaningful comparison of the expression of different genes across different arrays in different experiments. Sorting the data according to the eigengenes and eigenarrays gives a global picture of the dynamics of gene expression, in which individual genes and arrays appear to be classified into groups of similar regulation and function, or similar cellular state and biological phenotype, respectively. After normalization and sorting, the significant eigengenes and eigenarrays can be associated with observed genome-wide effects of regulators, or with measured samples, in which these regulators are overactive or underactive, respectively.
引用
收藏
页码:171 / 186
页数:16
相关论文
共 17 条
  • [1] Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays
    Alon, U
    Barkai, N
    Notterman, DA
    Gish, K
    Ybarra, S
    Mack, D
    Levine, AJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) : 6745 - 6750
  • [2] Singular value decomposition for genome-wide expression data processing and modeling
    Alter, O
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (18) : 10101 - 10106
  • [3] Anderson T., 1984, INTRO MULTIVARIATE S
  • [4] [Anonymous], 1996, MATRIX COMPUTATION
  • [5] Knowledge-based analysis of microarray gene expression data by using support vector machines
    Brown, MPS
    Grundy, WN
    Lin, D
    Cristianini, N
    Sugnet, CW
    Furey, TS
    Ares, M
    Haussler, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) : 262 - 267
  • [6] Cluster analysis and display of genome-wide expression patterns
    Eisen, MB
    Spellman, PT
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) : 14863 - 14868
  • [7] MULTIPLEXED BIOCHEMICAL ASSAYS WITH BIOLOGICAL CHIPS
    FODOR, SPA
    RAVA, RP
    HUANG, XHC
    PEASE, AC
    HOLMES, CP
    ADAMS, CL
    [J]. NATURE, 1993, 364 (6437) : 555 - 556
  • [8] Statistical analysis of array expression data as applied to the problem of tamoxifen resistance
    Hilsenbeck, SG
    Friedrichs, WE
    Schiff, R
    O'Connell, P
    Hansen, RK
    Osborne, CK
    Fuqua, SAW
    [J]. JOURNAL OF THE NATIONAL CANCER INSTITUTE, 1999, 91 (05) : 453 - 459
  • [9] Fundamental patterns underlying gene expression profiles: Simplicity from complexity
    Holter, NS
    Mitra, M
    Maritan, A
    Cieplak, M
    Banavar, JR
    Fedoroff, NV
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (15) : 8409 - 8414
  • [10] Mallat, 1999, WAVELET TOUR SIGNAL