A combined approach to data mining of textual and structured data to identify cancer-related targets

被引:53
作者
Pospisil, Pavel
Iyer, Lakshmanan K.
Adelstein, S. James
Kassis, Amin I.
机构
[1] Harvard Univ, Sch Med, Dept Radiol, Boston, MA 02115 USA
[2] Harvard Univ, Bauer Ctr Genom Res, Cambridge, MA USA
来源
BMC BIOINFORMATICS | 2006年 / 7卷
关键词
D O I
10.1186/1471-2105/7/354
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: We present an effective, rapid, systematic data mining approach for identifying genes or proteins related to a particular interest. A selected combination of programs exploring PubMed abstracts, universal gene/protein knowledge bases (LSGraph and Ingenuity Pathway Analysis) was assembled to distinguish enzymes with hydrolytic activities that are expressed in the extracellular space of cancer cells. Proteins were identified with respect to six types of cancer occurring in the prostate, breast, lung, colon, ovary, and pancreas. Results: The data mining method identified previously undetected targets. Our combined strategy applied to each cancer type identified a minimum of 375 proteins expressed within the extracellular space and/or attached to the plasma membrane. The method led to the recognition of human cancer-related hydrolases (on average, similar to 35 per cancer type), among which were prostatic acid phosphatase, prostate-specific antigen, and sulfatase 1. Conclusion: The combined data mining of several databases overcame many of the limitations of querying a single database and enabled the facile identification of gene products. In the case of cancer-related targets, it produced a list of putative extracellular, hydrolytic enzymes that merit additional study as candidates for cancer radioimaging and radiotherapy. The proposed data mining strategy is of a general nature and can be applied to other biological databases for understanding biological functions and diseases.
引用
收藏
页数:11
相关论文
共 41 条
[1]  
Aggarwal Kunal, 2003, Briefings in Functional Genomics & Proteomics, V2, P175, DOI 10.1093/bfgp/2.3.175
[2]   Systems biology and the molecular circuits of cancer [J].
Alberghina, L ;
Chiaradonna, F ;
Vanoni, M .
CHEMBIOCHEM, 2004, 5 (10) :1322-1333
[3]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[4]   ALKALINE-PHOSPHATASE ACTIVITY IN HUMAN BLADDER TUMOR-CELL LINES [J].
BENHAM, F ;
COTTELL, DC ;
FRANKS, LM ;
WILSON, PD .
JOURNAL OF HISTOCHEMISTRY & CYTOCHEMISTRY, 1977, 25 (04) :266-274
[5]   HUMAN CELL-LINES EXPRESSING INTESTINAL ALKALINE-PHOSPHATASE [J].
BENHAM, FJ ;
HARRIS, H .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1979, 76 (08) :4016-4019
[6]   ALKALINE-PHOSPHATASE EXPRESSION IN HUMAN CELL-LINES DERIVED FROM VARIOUS MALIGNANCIES [J].
BENHAM, FJ ;
FOGH, J ;
HARRIS, H .
INTERNATIONAL JOURNAL OF CANCER, 1981, 27 (05) :637-644
[7]   Do you do text? [J].
Blaschke, C ;
Yeh, A ;
Camon, E ;
Colosimo, M ;
Apweiler, R ;
Hirschman, L ;
Valencia, A .
BIOINFORMATICS, 2005, 21 (23) :4199-4200
[8]  
Chaussabel Damien, 2004, Am J Pharmacogenomics, V4, P383, DOI 10.2165/00129785-200404060-00005
[9]   ENZYMATIC-ACTIVITY OF PROSTATE-SPECIFIC ANTIGEN AND ITS REACTIONS WITH EXTRACELLULAR SERINE PROTEINASE-INHIBITORS [J].
CHRISTENSSON, A ;
LAURELL, CB ;
LILJA, H .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 1990, 194 (03) :755-763
[10]   Human glandular kallikrein 2 (hK2) expression in prostatic intraepithelial neoplasia and adenocarcinoma: A novel prostate cancer marker [J].
Darson, MF ;
Pacelli, A ;
Roche, P ;
Rittenhouse, HG ;
Wolfert, RL ;
Young, CYF ;
Klee, GG ;
Tindall, DJ ;
Bostwick, DG .
UROLOGY, 1997, 49 (06) :857-862