Identification of gene transcript signatures predictive for estrogen receptor and lymph node status using a stepwise forward selection artificial neural network modelling approach

被引:41
作者
Lancashire, Lee J. [1 ,2 ]
Rees, Robert C. [2 ]
Ball, Graham R. [2 ]
机构
[1] Univ Manchester, Paterson Inst Canc Res, Manchester M20 4BX, Lancs, England
[2] Nottingham Trent Univ, Sch Biomed & Nat Sci, Nottingham NG11 8NS, England
关键词
artificial neural networks; predictive modelling; gene expression; breast cancer;
D O I
10.1016/j.artmed.2008.03.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objective: The advent of microarrays has attracted considerable interest from biologists due to the potential for high throughput analysis of hundreds of thousands of gene transcripts. Subsequent analysis of the data may identify specific features which correspond to characteristics of interest within the population, for example, analysis of gene expression profiles in cancer patients to identify molecular signatures corresponding with prognostic outcome. These high throughput technologies have resulted in an unprecedented rate of data generation, often of high complexity, highlighting the need for novel data analysis methodologies that will cope with data of this nature. Methods: Stepwise methods using artificial neural networks (ANNs) have been developed to identify an optimal subset of predictive gene transcripts from highly dimensional microarray data. Here these methods have been applied to a gene microarray dataset to identify and validate gene signatures corresponding with estrogen receptor and lymph node status in breast cancer. Results: Many gene transcripts were identified whose expression could differentiate patients to very high accuracies based upon firstly whether they were positive or negative for estrogen receptor, and secondly whether metastasis to the axillary lymph node had occurred. A number of these genes had been previously reported to have a role in cancer. Significantly fewer genes were used compared to other previous studies. The models using the optimal gene subsets were internally validated using an extensive random sample cross-validation procedure and externally validated using a follow up dataset from a different cohort of patients on a newer array chip containing the same and additional probe sets. Here, the models retained high accuracies, emphasising the potential power of this approach in analysing complex systems. These findings show how the proposed method allows for the rapid analysis and subsequent detailed interrogation of gene expression signatures to provide a further understanding of the underlying molecular mechanisms that could be important in determining novel prognostic markers associated with cancer. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:99 / 111
页数:13
相关论文
共 48 条
[1]   Predicting biomarkers for ovarian cancer using gene-expression microarrays [J].
Adib, TR ;
Henderson, S ;
Perrett, C ;
Hewitt, D ;
Bourmpoulia, D ;
Ledermann, J ;
Boshoff, C .
BRITISH JOURNAL OF CANCER, 2004, 90 (03) :686-692
[2]  
[Anonymous], 1961, Adaptive Control Processes: a Guided Tour, DOI DOI 10.1515/9781400874668
[3]  
[Anonymous], 1997, Machine Learning
[4]   An integrated approach utilizing artificial neural networks and SELDI mass spectrometry for the classification of human tumours and rapid identification of potential biomarkers [J].
Ball, G ;
Mian, S ;
Holding, F ;
Allibone, RO ;
Lowe, J ;
Ali, S ;
Li, G ;
McCardle, S ;
Ellis, IO ;
Creaser, C ;
Rees, RC .
BIOINFORMATICS, 2002, 18 (03) :395-404
[5]   Gene-expression profiles predict survival of patients with lung adenocarcinoma [J].
Beer, DG ;
Kardia, SLR ;
Huang, CC ;
Giordano, TJ ;
Levin, AM ;
Misek, DE ;
Lin, L ;
Chen, GA ;
Gharib, TG ;
Thomas, DG ;
Lizyness, ML ;
Kuick, R ;
Hayasaka, S ;
Taylor, JMG ;
Iannettoni, MD ;
Orringer, MB ;
Hanash, S .
NATURE MEDICINE, 2002, 8 (08) :816-824
[6]  
BERRY MW, 2004, P 4 SIAM INT C DAT M, P517
[7]   Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses [J].
Bhattacharjee, A ;
Richards, WG ;
Staunton, J ;
Li, C ;
Monti, S ;
Vasa, P ;
Ladd, C ;
Beheshti, J ;
Bueno, R ;
Gillette, M ;
Loda, M ;
Weber, G ;
Mark, EJ ;
Lander, ES ;
Wong, W ;
Johnson, BE ;
Golub, TR ;
Sugarbaker, DJ ;
Meyerson, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (24) :13790-13795
[8]   Relationship between intratumoral expression of genes coding for xenobiotic-metabolizing enzymes and benefit from adjuvant tamoxifen in estrogen receptor alpha-positive postmenopausal breast carcinoma [J].
Bièche, I ;
Girault, I ;
Urbain, E ;
Tozlu, S ;
Lidereau, R .
BREAST CANCER RESEARCH, 2004, 6 (03) :R252-R263
[9]  
Bishop CM., 1995, Neural networks for pattern recognition
[10]  
Calin G, 1998, CANCER RES, V58, P3777