Extracting knowledge from genomic experiments by incorporating the biomedical literature

被引:5
作者
Sluka, JP
机构
来源
METHODS OF MICROARRAY DATA ANALYSIS II | 2002年
关键词
gene expression analysis; literature; DNA microarray; PDQ_MED; text mining;
D O I
10.1007/0-306-47598-7_14
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
We present a technique to extract relevant information from the literature to aid in the analysis of a typical genomics data set, Analysis was conducted using PDQ_MED, a program based on the assumption that if two genes are found to be related under an experimental paradigm, such as a gene chip experiment, then any literature which relates the two genes is of interest. PDQ_MED searches MEDLINE for abstracts that contain two or more of the terms in the user's query set. For this paper, we have used PDQ_MED to analyze 160 genes up-regulated in acute myeloid leukemia (AML) from the NCI-60 dataset. PDQ_MED executed 12,880 queries to MEDLINE and identified nearly 300,000 abstracts that refer to at least one of the 160 terms. PDQ_MED identified and analyzed a set of 81 terms that can be grouped together via the literature. In addition, there is literature directly linking 52 of the terms with AML. Overall, the literature analysis identified 1028 sentences that directly relate two or more of the query genes.
引用
收藏
页码:195 / 209
页数:15
相关论文
共 4 条
[1]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537
[2]   A literature network of human genes for high-throughput analysis of gene expression [J].
Jenssen, TK ;
Lægreid, A ;
Komorowski, J ;
Hovig, E .
NATURE GENETICS, 2001, 28 (01) :21-+
[3]   A gene expression database for the molecular pharmacology of cancer [J].
Scherf, U ;
Ross, DT ;
Waltham, M ;
Smith, LH ;
Lee, JK ;
Tanabe, L ;
Kohn, KW ;
Reinhold, WC ;
Myers, TG ;
Andrews, DT ;
Scudiero, DA ;
Eisen, MB ;
Sausville, EA ;
Pommier, Y ;
Botstein, D ;
Brown, PO ;
Weinstein, JN .
NATURE GENETICS, 2000, 24 (03) :236-244
[4]   MedMiner: An Internet text-mining tool for biomedical information, with application to gene expression profiling [J].
Tanabe, L ;
Scherf, U ;
Smith, LH ;
Lee, JK ;
Hunter, L ;
Weinstein, JN .
BIOTECHNIQUES, 1999, 27 (06) :1210-+