Computational methods for the identification of differential and coordinated gene expression

被引:210
作者
Claverie, JM [1 ]
机构
[1] CNRS, UMR 1889, Struct & Genet Informat Lab, F-13402 Marseille 20, France
关键词
D O I
10.1093/hmg/8.10.1821
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
With the first complete 'draft' of the human genome sequence expected for Spring 2000, the three basic challenges for today's bioinformatics are more than ever: (i) finding the genes; (ii) locating their coding regions; and (iii) predicting their functions. However, our capacity for interpreting vertebrate genomic and transcript (cDNA) sequences using experimental or computational means very much lags behind our raw sequencing power, If the performances of current programs in identifying internal coding exons are goad, the precise 5'-->3' delineation of transcription units land promoters) still requires additional experiments, Similarly, functional predictions made with reference to previously characterized homologues are leaving >50% of human genes unannotated or classified in uninformative categories ('kinase', 'ATP-binding', etc.), In the context of functional genomics, large-scale gene expression studies using massive cDNA tag sequencing, two-dimensional gel proteome analysis or microarray technologies are the only approaches providing genome-scale experimental information at a pace consistent with the progress of sequencing, Given the difficulty and cost of characterizing genes one by one, academic and industrial researchers are increasingly relying on those methods to prioritize their studies and choose their targets, The study of expression patterns can also provide some insight into the function, reveal regulatory pathways, indicate side effects of drugs or serve as a diagnostic tool, In this article, I review the theoretical and computational approaches used to: (i) identify genes differentially expressed (across cell types, developmental stages, pathological conditions, etc.); (ii) identify genes expressed in a coordinated manner across a set of conditions; and (iii) delineate clusters of genes sharing coherent expression features, eventually defining global biological pathways.
引用
收藏
页码:1821 / 1832
页数:12
相关论文
共 108 条
[1]   Toward the development of a gene index to the human genome: An assessment of the nature of high-throughput EST sequence data [J].
Aaronson, JS ;
Eckman, B ;
Blevins, RA ;
Borkowski, JA ;
Myerson, J ;
Imran, S ;
Elliston, KO .
GENOME RESEARCH, 1996, 6 (09) :829-845
[2]   SEQUENCE IDENTIFICATION OF 2,375 HUMAN BRAIN GENES [J].
ADAMS, MD ;
DUBNICK, M ;
KERLAVAGE, AR ;
MORENO, R ;
KELLEY, JM ;
UTTERBACK, TR ;
NAGLE, JW ;
FIELDS, C ;
VENTER, JC .
NATURE, 1992, 355 (6361) :632-634
[3]  
ADAMS MD, 1995, NATURE, V377, P3
[4]   cDNA libraries from single human preimplantation embryos [J].
Adjaye, J ;
Daniels, R ;
Bolton, V ;
Monk, M .
GENOMICS, 1997, 46 (03) :337-344
[5]   A comparison of selected mRNA and protein abundances in human liver [J].
Anderson, L ;
Seilhamer, J .
ELECTROPHORESIS, 1997, 18 (3-4) :533-537
[6]  
ANDERSON NL, 1984, CLIN CHEM, V30, P2031
[7]  
[Anonymous], P ACM RECOMB 11 14 A
[8]   A test case of correlation metric construction of a reaction pathway from measurements [J].
Arkin, A ;
Shen, PD ;
Ross, J .
SCIENCE, 1997, 277 (5330) :1275-1279
[9]   PRINTS prepares for the new millennium [J].
Attwood, TK ;
Flower, DR ;
Lewis, AP ;
Mabey, JE ;
Morgan, SR ;
Scordis, P ;
Selley, JN ;
Wright, W .
NUCLEIC ACIDS RESEARCH, 1999, 27 (01) :220-225
[10]   Visualizing the competitive recognition of TATA-boxes in vertebrate promoters [J].
Audic, S ;
Claverie, JM .
TRENDS IN GENETICS, 1998, 14 (01) :10-11