Exhaustive mining of EST libraries for genes differentially expressed in normal and tumour tissues

被引:106
作者
Schmitt, AO [1 ]
Specht, T [1 ]
Beckmann, G [1 ]
Dahl, E [1 ]
Pilarsky, CP [1 ]
Hinzmann, B [1 ]
Rosenthal, A [1 ]
机构
[1] MetaGen Gesell Genomforsch mbH, D-14195 Berlin, Germany
关键词
D O I
10.1093/nar/27.21.4251
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A four-step procedure for the efficient and systematic mining of whole EST libraries for differentially expressed genes is presented. After eliminating redundant entries from the EST library under investigation (step 1), contigs of maximal length are built upon each remaining EST using about 4000000 public and proprietary ESTs (step 2), These putative genes are compared against a database comprising ESTs from 16 different tissues (both normal and tumour affected) to determine whether or not they are differentially expressed (step 3; electronic northern). Fisher's exact test is used to assess the significance of differential expression. In step 4, an attempt is made to characterise the contigs obtained in the assembly through database comparison. A case study of the CGAP library NCI_CGAP_Br1.1, a library made from three (well, moderately, and poorly differentiated) invasive ductal breast tumours (2126 ESTs in total) was carried out. Of the maximal contigs, 139 were found to be significantly (alpha = 0.05) overexpressed in breast tumour tissue, while 13 appeared to be down-regulated.
引用
收藏
页码:4251 / 4260
页数:10
相关论文
共 43 条
[1]   COMPLEMENTARY-DNA SEQUENCING - EXPRESSED SEQUENCE TAGS AND HUMAN GENOME PROJECT [J].
ADAMS, MD ;
KELLEY, JM ;
GOCAYNE, JD ;
DUBNICK, M ;
POLYMEROPOULOS, MH ;
XIAO, H ;
MERRIL, CR ;
WU, A ;
OLDE, B ;
MORENO, RF ;
KERLAVAGE, AR ;
MCCOMBIE, WR ;
VENTER, JC .
SCIENCE, 1991, 252 (5013) :1651-1656
[2]   H19 overexpression in breast adenocarcinoma stromal cells is associated with tumor values and steroid receptor status but independent of p53 and Ki-67 expression [J].
Adriaenssens, E ;
Dumont, L ;
Lottin, S ;
Bolle, D ;
Leprêtre, A ;
Delobelle, A ;
Bouali, F ;
Dugimont, T ;
Coll, J ;
Curgy, JJ .
AMERICAN JOURNAL OF PATHOLOGY, 1998, 153 (05) :1597-1607
[3]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[4]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[5]   A comparison of selected mRNA and protein abundances in human liver [J].
Anderson, L ;
Seilhamer, J .
ELECTROPHORESIS, 1997, 18 (3-4) :533-537
[6]   The significance of digital gene expression profiles [J].
Audic, S ;
Claverie, JM .
GENOME RESEARCH, 1997, 7 (10) :986-995
[7]   Identification and mapping of human cDNAs homologous to Drosophila mutant genes through EST database searching [J].
Banfi, S ;
Borsani, G ;
Rossi, E ;
Bernard, L ;
Guffanti, A ;
Rubboli, F ;
Marchitiello, A ;
Giglio, S ;
Coluccia, E ;
Zollo, M ;
Zuffardi, O ;
Ballabio, A .
NATURE GENETICS, 1996, 13 (02) :167-174
[8]   GenBank [J].
Benson, DA ;
Boguski, MS ;
Lipman, DJ ;
Ostell, J ;
Ouellette, BFF .
NUCLEIC ACIDS RESEARCH, 1998, 26 (01) :1-7
[9]   ESTABLISHING A HUMAN TRANSCRIPT MAP [J].
BOGUSKI, MS ;
SCHULER, GD .
NATURE GENETICS, 1995, 10 (04) :369-371
[10]   A new DNA sequence assembly program [J].
Bonfield, JK ;
Smith, KF ;
Staden, R .
NUCLEIC ACIDS RESEARCH, 1995, 23 (24) :4992-4999