Bayesian analysis of population structure based on linked molecular information

被引:206
作者
Corander, Jukka [1 ]
Tang, Jing [1 ]
机构
[1] Univ Helsinki, Dept Math & Stat, FIN-00014 Helsinki, Finland
基金
芬兰科学院;
关键词
Bayesian analysis; genetic structure; linked loci; sequence information; unsupervised classification;
D O I
10.1016/j.mbs.2006.09.015
中图分类号
Q [生物科学];
学科分类号
07 [理学]; 0710 [生物学]; 09 [农学];
摘要
The Bayesian model-based approach to inferring hidden genetic population structures using multilocus molecular markers has become a popular tool within certain branches of biology. In particular, it has been shown that heterogeneous data arising from genetically dissimilar latent groups of individuals can be effectively modelled using an unsupervised classification formulation. However, most currently employed models ignore potential linkage within the employed molecular information, and can therefore lead to biased inferences under certain circumstances. Utilizing the general theory of graphical models, we develop a framework that accounts for dependences both within linked molecular marker loci and DNA sequence data. Due to a high level of sequence conservation among eukaryotic species, the latter aspect is particularly relevant for analyzing rapidly evolving microbial species. The advantages of incorporating the dependence due to linkage in the classification models are illustrated by analyses of both simulated data and real samples of Bacillus cereus. (c) 2006 Elsevier Inc. All rights reserved.
引用
收藏
页码:19 / 31
页数:13
相关论文
共 24 条
[1]
[Anonymous], 1979, Multivariate analysis
[2]
[Anonymous], 2004, Inferring Phylogenies
[3]
Bernardo J., 2009, Bayesian theory
[4]
TAN classifiers based on decomposable distributions [J].
Cerquides, J ;
De Mantaras, RL .
MACHINE LEARNING, 2005, 59 (03) :323-354
[5]
BAPS 2:: enhanced possibilities for the analysis of genetic population structure [J].
Corander, J ;
Waldmann, P ;
Marttinen, P ;
Sillanpää, MJ .
BIOINFORMATICS, 2004, 20 (15) :2363-2369
[6]
Corander J, 2003, GENETICS, V163, P367
[7]
CORANDER J, IN PRESS FISH B
[8]
CORANDER J, UNPUB J STAT COMPUT
[9]
Bayesian identification of admixture events using multilocus molecular markers [J].
Corander, Jukka ;
Marttinen, Pekka .
MOLECULAR ECOLOGY, 2006, 15 (10) :2833-2843
[10]
Bayesian model learning based on a parallel MCMC strategy [J].
Corander, Jukka ;
Gyllenberg, Mats ;
Koski, Timo .
STATISTICS AND COMPUTING, 2006, 16 (04) :355-362