Statistical assessment of functional categories of genes deregulated in pathological conditions by using microarray data

被引:22
作者
Maglietta, R.
Piepoli, A.
Catalano, D.
Licciulli, F.
Carella, M.
Liuni, S.
Pesole, G.
Perri, F.
Ancona, N.
机构
[1] CNR, Ist Studi Sistemi Intelligent Automa, I-70126 Bari, Italy
[2] Casa Sollievo Sofferenza Osped, IRCCS, Unit Operat Gastroenterol, I-71013 San Giovanni Rotonda, Italy
[3] CNR, Ist Technol Biomed, Sez Bari, I-70126 Bari, Italy
[4] Casa Sollievo Sofferenza Osped, IRCCS, Serv Genet Med, I-71013 San Giovanni Rotonda, Italy
[5] Univ Bari, Dipartimento Biochim & Biol Mol, I-70126 Bari, Italy
关键词
D O I
10.1093/bioinformatics/btm289
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: A major challenge in current biomedical research is the identification of cellular processes deregulated in a given pathology through the analysis of gene expression profiles. To this end, predefined lists of genes, coding specific functions, are compared with a list of genes ordered according to their values of differential expression measured by suitable univariate statistics. Results: We propose a statistically well-founded method for measuring the relevance of predefined lists of genes and for assessing their statistical significance starting from their raw expression levels as recorded on the microarray. We use prediction accuracy as a measure of relevance of the list. The rationale is that a functional category, coded through a list of genes, is perturbed in a given pathology if it is possible to correctly predict the occurrence of the disease in new subjects on the basis of the expression levels of the genes belonging to the list only. The accuracy is estimated with multiple random validation strategy and its statistical significance is assessed against a couple of null hypothesis, by using two independent permutation tests. The utility of the proposed methodology is illustrated by analyzing the relevance of Gene Ontology terms belonging to biological process category in colon and prostate cancer, by using three different microarray data sets and by comparing it with current approaches. Availability: Source code for the algorithms is available from author upon request. Contact: ancona@ba.issia.cnr.it Supplementary information: Colon cancer data set and a complete description of experimental results are available at: ftp://bioftp:76bioftpxxx@marx.ba.issia.cnr.it/supp-info.htm
引用
收藏
页码:2063 / 2072
页数:10
相关论文
共 48 条
[1]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[2]  
ANAND GM, 2002, LAB INVEST, V82, P1725
[3]   Regularized Least Squares cancer classifiers from DNA microarray data [J].
Ancona, N ;
Maglietta, R ;
D'Addabbo, A ;
Liuni, S ;
Pesole, G .
BMC BIOINFORMATICS, 2005, 6 (Suppl 4)
[4]   On the statistical assessment of classifiers using DNA microarray data [J].
Ancona, N. ;
Maglietta, R. ;
Piepoli, A. ;
D'Addabbo, A. ;
Cotugno, R. ;
Savino, M. ;
Liuni, S. ;
Carella, M. ;
Pesole, G. ;
Perri, F. .
BMC BIOINFORMATICS, 2006, 7 (1)
[5]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[6]   Stage II colon cancer prognosis prediction by tumor gene expression profiling [J].
Barrier, Alain ;
Boelle, Pierre-Yves ;
Roser, Francois ;
Gregg, Jennifer ;
Tse, Chantal ;
Brault, Didier ;
Lacaine, Francois ;
Houry, Sidney ;
Huguier, Michel ;
Franc, Brigitte ;
Flahault, Antoine ;
Lemoine, Antoinette ;
Dudoit, Sandrine .
JOURNAL OF CLINICAL ONCOLOGY, 2006, 24 (29) :4685-4691
[7]   Significance analysis of functional categories in gene expression studies: a structured permutation approach [J].
Barry, WT ;
Nobel, AB ;
Wright, FA .
BIOINFORMATICS, 2005, 21 (09) :1943-1949
[8]   Mechanisms linking diet and colorectal cancer: The possible role of insulin resistance [J].
Bruce, WR ;
Wolever, TMS ;
Giacca, A .
NUTRITION AND CANCER-AN INTERNATIONAL JOURNAL, 2000, 37 (01) :19-26
[9]  
Bruce WR, 2000, CANCER EPIDEM BIOMAR, V9, P1271
[10]   Overexpression of fatty acid synthase in ulcerative colitis [J].
Consolazio, A ;
Alò, PL ;
Rivera, M ;
Iacopini, F ;
Paoluzi, OA ;
Crispino, P ;
Pica, R ;
Paoluzi, P .
AMERICAN JOURNAL OF CLINICAL PATHOLOGY, 2006, 126 (01) :113-118