Assessing optimal neural network architecture for identifying disease-associated multi-marker genotypes using a permutation test, and application to calpain 10 polymorphisms associated with diabetes

被引:33
作者
North, BV
Curtis, D
Cassell, PG
Hitman, GA
Sham, PC
机构
[1] Barts & London Queen Marys Sch Med & Dent, Acad Dept Psychiat, London E1 1BB, England
[2] Barts & London Queen Marys Sch Med & Dent, Acad Dept Diabet & Metab Med, London E1 1BB, England
[3] Inst Psychiat, Dept Psychol Med, London SE5 8AF, England
关键词
D O I
10.1046/j.1469-1809.2003.00030.x
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Biallelic markers, such as single nucleotide polymorphisms (SNPs), provide greater information for localising disease loci when treated as multilocus haplotypes, but often haplotypes are not immediately available from multilocus genotypes in case-control studies. An artificial neural network allows investigation of association between disease phenotype and tightly linked markers without requiring haplotype phase and without modelling any evolutionary history for the disease-related haplotypes. The network assesses whether marker haplotypes differ between cases and controls to the extent that classification of disease status based on multi-marker genotypes is achievable. The network is "trained" to "recognise" affection status based on supplied marker genotypes, and then for each multi-marker genotype it produces outputs which aim to approximate the associated affection status. Next, the genotypes are permuted relative to affection status to produce many random datasets and the process of training and recording of outputs is repeated. The extent to which the ability to predict affection for the real dataset exceeds that for the random datasets measures the statistical significance of the association between multi-marker genotype and affection. This permutation test performs well with simulated case-control datasets, particularly when major gene effects are present. We have explored the effects of systematically varying different network parameters in order to identify their optimal values. We have applied the permutation test to 4 SNPs of the calpain 10 (CAPAT10) gene typed in a case-control sample of subjects with type 2 diabetes, impaired glucose tolerance, and controls. We show that the neural network produces more highly significant evidence for association than do single marker tests corrected for the number of markers genotyped. The use of a permutation test could potentially allow conditional analyses which could incorporate known risk factors alongside marker genotypes. Permuting only the marker genotypes relative to affection status and these risk factors would allow the contribution of the markers to disease risk to be independently assessed.
引用
收藏
页码:348 / 356
页数:9
相关论文
共 8 条
  • [1] Bishop C. M., 1995, NEURAL NETWORKS PATT
  • [2] Haplotype combinations of calpain 10 gene polymorphisms associate with increased risk of impaired glucose tolerance and type 2 diabetes in South Indians
    Cassell, PG
    Jackson, AE
    North, BV
    Evans, JC
    Syndercombe-Court, D
    Phillips, C
    Ramachandran, A
    Snehalatha, C
    Gelding, SV
    Vijayaravaghan, S
    Curtis, D
    Hitman, GA
    [J]. DIABETES, 2002, 51 (05) : 1622 - 1628
  • [3] Use of an artificial neural network to detect association between a disease and multiple marker genotypes
    Curtis, D
    North, BV
    Sham, PC
    [J]. ANNALS OF HUMAN GENETICS, 2001, 65 : 95 - 107
  • [4] Davison A. C., BOOTSTRAP METHODS TH
  • [5] A note on the calculation of empirical P values from Monte Carlo procedures
    North, BV
    Curtis, D
    Sham, PC
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2002, 71 (02) : 439 - 441
  • [6] Neural networks and logistic regression .1.
    Schumacher, M
    Rossner, R
    Vach, W
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1996, 21 (06) : 661 - 682
  • [7] Equivalence of single- and multilocus markers: Power to detect linkage with composite markers derived from biallelic loci
    Wilson, AF
    Sorant, AJM
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 66 (05) : 1610 - 1615
  • [8] YU CE, 1994, AM J HUM GENET, V54, P631