A novel approach to simulate gene-environment interactions in complex diseases

被引:16
作者
Amato, Roberto [1 ,2 ]
Pinelli, Michele [1 ,3 ]
D'Andrea, Daniel [1 ]
Miele, Gennaro [1 ,2 ,4 ]
Nicodemi, Mario [1 ,4 ,5 ,6 ]
Raiconi, Giancarlo [1 ,7 ]
Cocozza, Sergio [1 ,3 ]
机构
[1] Univ Salerno, Univ Napoli Federico II, Grp Interdipartimentale Bioinformat & Biol Comput, Salerno, Italy
[2] Univ Naples Federico II, Dipartimento Sci Fis, Naples, Italy
[3] Dipartimento Biol & Patol Cellulare & Mol L Calif, Naples, Italy
[4] Ist Nazl Fis Nucl, Sez Napoli, I-80125 Naples, Italy
[5] Univ Warwick, Dept Phys, Coventry CV4 7AL, W Midlands, England
[6] Univ Warwick, Complex Sci Ctr, Coventry CV4 7AL, W Midlands, England
[7] Univ Salerno, Dipartimento Matemat & Informat, Fisciano, SA, Italy
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
COMPUTER-SIMULATION; PROPOSED LINKAGE; POWER;
D O I
10.1186/1471-2105-11-8
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Complex diseases are multifactorial traits caused by both genetic and environmental factors. They represent the major part of human diseases and include those with largest prevalence and mortality (cancer, heart disease, obesity, etc.). Despite a large amount of information that has been collected about both genetic and environmental risk factors, there are few examples of studies on their interactions in epidemiological literature. One reason can be the incomplete knowledge of the power of statistical methods designed to search for risk factors and their interactions in these data sets. An improvement in this direction would lead to a better understanding and description of gene-environment interactions. To this aim, a possible strategy is to challenge the different statistical methods against data sets where the underlying phenomenon is completely known and fully controllable, for example simulated ones. Results: We present a mathematical approach that models gene-environment interactions. By this method it is possible to generate simulated populations having gene-environment interactions of any form, involving any number of genetic and environmental factors and also allowing non-linear interactions as epistasis. In particular, we implemented a simple version of this model in a Gene-Environment iNteraction Simulator (GENS), a tool designed to simulate case-control data sets where a one gene-one environment interaction influences the disease risk. The main aim has been to allow the input of population characteristics by using standard epidemiological measures and to implement constraints to make the simulator behaviour biologically meaningful. Conclusions: By the multi-logistic model implemented in GENS it is possible to simulate case-control samples of complex disease where gene-environment interactions influence the disease risk. The user has full control of the main characteristics of the simulated population and a Monte Carlo process allows random variability. A knowledge-based approach reduces the complexity of the mathematical model by using reasonable biological constraints and makes the simulation more understandable in biological terms. Simulated data sets can be used for the assessment of novel statistical methods or for the evaluation of the statistical power when designing a study.
引用
收藏
页数:9
相关论文
共 28 条
[1]  
[Anonymous], NATURE
[2]  
[Anonymous], WILEY SERIES PROBABI
[3]  
BOEHNKE M, 1986, AM J HUM GENET, V39, P513
[4]   Simulation of genomes:: A review [J].
Carvajal-Rodriguez, Antonio .
CURRENT GENOMICS, 2008, 9 (03) :155-159
[5]  
García-Closas M, 1999, AM J EPIDEMIOL, V149, P689
[6]   Backward simulation of ancestors of sampled individuals [J].
Gasbarra, D ;
Sillanpää, MJ ;
Arjas, E .
THEORETICAL POPULATION BIOLOGY, 2005, 67 (02) :75-83
[7]   Sample size requirements for matched case-control studies of gene-environment interaction [J].
Gauderman, WJ .
STATISTICS IN MEDICINE, 2002, 21 (01) :35-50
[8]  
GROUP TGC, 2007, NAT GENET, V39, P1045
[9]   Multifactor dimensionality reduction software for detecting gene-gene and gene-environment interactions [J].
Hahn, LW ;
Ritchie, MD ;
Moore, JH .
BIOINFORMATICS, 2003, 19 (03) :376-382
[10]  
Haldane J. B. S., 1946, ANN EUGENICS LOND, V13, P197