Large-scale statistical analyses of rice ESTs reveal correlated patterns of gene expression

被引:184
作者
Ewing, RM [1 ]
Ben Kahla, A [1 ]
Poirot, O [1 ]
Lopez, F [1 ]
Audic, S [1 ]
Claverie, JM [1 ]
机构
[1] CNRS, Struct & Genet Informat Lab, Unite Mixte Rech 1889, F-13402 Marseille 20, France
关键词
D O I
10.1101/gr.9.10.950
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Large, publicly available collections of expressed sequence tags (ESTs) have been generated From Arabidopsis thaliana and rice (Oryza sativa). A potential, but relatively unexplored application of this data is in the study of plant gene expression, Other EST data, mainly From human and mouse, have been successfully used to point out genes exhibiting tissue- or disease-specific expression, as well as For identification of alternative transcripts. In this report, we go a step further in showing that computer analyses of plant EST data can be used to generate evidence of correlated expression patterns of genes across various tissues. Furthermore, tissue types and organs can be classified with respect to one another on the basis of their global gene expression patterns, As in previous studies, expression profiles are first estimated from EST counts. By clustering gene expression profiles or whole cDNA library profiles, we show that genes with similar functions, or cDNA libraries expected to share patterns of gene expression, are grouped together. Promising uses of this technique include functional genomics, in which evidence of correlated expression might complement Cor substitute for) those of sequence similarity in the annotation of anonymous genes and identification of surrogate markers. The analysis presented here combines the application of a correlation-based clustering method with a graphical color map allowing intuitive visualization of patterns within a large table of expression measurements.
引用
收藏
页码:950 / 959
页数:10
相关论文
共 64 条
  • [1] Toward the development of a gene index to the human genome: An assessment of the nature of high-throughput EST sequence data
    Aaronson, JS
    Eckman, B
    Blevins, RA
    Borkowski, JA
    Myerson, J
    Imran, S
    Elliston, KO
    [J]. GENOME RESEARCH, 1996, 6 (09): : 829 - 845
  • [2] SEQUENCE IDENTIFICATION OF 2,375 HUMAN BRAIN GENES
    ADAMS, MD
    DUBNICK, M
    KERLAVAGE, AR
    MORENO, R
    KELLEY, JM
    UTTERBACK, TR
    NAGLE, JW
    FIELDS, C
    VENTER, JC
    [J]. NATURE, 1992, 355 (6361) : 632 - 634
  • [3] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [4] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [5] A comparison of selected mRNA and protein abundances in human liver
    Anderson, L
    Seilhamer, J
    [J]. ELECTROPHORESIS, 1997, 18 (3-4) : 533 - 537
  • [6] Arber E.A. N., 1907, Journal of the Linnean Society of London, Botany, V38, P29, DOI DOI 10.1111/J.1095-8339.1907.TB01074.X
  • [7] The significance of digital gene expression profiles
    Audic, S
    Claverie, JM
    [J]. GENOME RESEARCH, 1997, 7 (10): : 986 - 995
  • [8] The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1998
    Bairoch, A
    Apweiler, R
    [J]. NUCLEIC ACIDS RESEARCH, 1998, 26 (01) : 38 - 42
  • [9] DBEST - DATABASE FOR EXPRESSED SEQUENCE TAGS
    BOGUSKI, MS
    LOWE, TMJ
    TOLSTOSHEV, CM
    [J]. NATURE GENETICS, 1993, 4 (04) : 332 - 333
  • [10] BOWMAN JL, 1993, DEVELOPMENT, V119, P721