THE DIVERSITY OF A DISTRIBUTED GENOME IN BACTERIAL POPULATIONS

被引:22
作者
Baumdicker, F. [1 ]
Hess, W. R. [2 ]
Pfaffelhuber, P. [1 ]
机构
[1] Univ Freiburg, Fak Math & Phys, D-79104 Freiburg, Germany
[2] Univ Freiburg, Fak Biol, D-79104 Freiburg, Germany
关键词
Kingman's coalescent; infinitely many genes model; infinitely many sites model; gene content; ESCHERICHIA-COLI; RECOMBINATION; SEQUENCE;
D O I
10.1214/09-AAP657
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The distributed genome hypothesis states that the set of genes in a population of bacteria is distributed over all individuals that belong to the specific taxon. It implies that certain genes can be gained and lost from generation to generation. We use the random genealogy given by a Kingman coalescent in order to superimpose events of gene gain and loss along ancestral lines. Gene gains occur at a constant rate along ancestral lines. We assume that gained genes have never been present in the population before. Gene losses occur at a rate proportional to the number of genes present along the ancestral line. In this infinitely many genes model we derive moments for several statistics within a sample: the average number of genes per individual, the average number of genes differing between individuals, the number of incongruent pairs of genes, the total number of different genes in the sample and the gene frequency spectrum. We demonstrate that the model gives a reasonable fit with gene frequency data from marine cyanobacteria.
引用
收藏
页码:1567 / 1606
页数:40
相关论文
共 31 条
[1]   Sequencing the species pan-genome [J].
Bentley, Stephen .
NATURE REVIEWS MICROBIOLOGY, 2009, 7 (04) :258-259
[2]   Unraveling the genomic mosaic of a ubiquitous genus of marine cyanobacteria [J].
Dufresne, Alexis ;
Ostrowski, Martin ;
Scanlan, David J. ;
Garczarek, Laurence ;
Mazard, Sophie ;
Palenik, Brian P. ;
Paulsen, Ian T. ;
de Marsac, Nicole Tandeau ;
Wincker, Patrick ;
Dossat, Carole ;
Ferriera, Steve ;
Johnson, Justin ;
Post, Anton F. ;
Hess, Wolfgang R. ;
Partensky, Frederic .
GENOME BIOLOGY, 2008, 9 (05)
[3]  
Durrett R, 2008, PROBAB APPL SER, P1, DOI 10.1007/978-0-387-78168-6_1
[4]   DEGENERATE DIFFUSIONS ARISING FROM GENE DUPLICATION MODELS [J].
Durrett, Rick ;
Popovic, Lea .
ANNALS OF APPLIED PROBABILITY, 2009, 19 (01) :15-48
[5]   RECOMBINATION IN ESCHERICHIA-COLI AND THE DEFINITION OF BIOLOGICAL SPECIES [J].
DYKHUIZEN, DE ;
GREEN, L .
JOURNAL OF BACTERIOLOGY, 1991, 173 (22) :7257-7268
[6]   Bacterial plurality as a general mechanism driving persistence in chronic infections [J].
Ehrlich, GD ;
Hu, FZ ;
Shen, K ;
Stoodley, P ;
Post, JC .
CLINICAL ORTHOPAEDICS AND RELATED RESEARCH, 2005, (437) :20-24
[7]   Non-equilibrium theory of the allele frequency spectrum [J].
Evans, Steven N. ;
Shvets, Yelena ;
Slatkin, Montgomery .
THEORETICAL POPULATION BIOLOGY, 2007, 71 (01) :109-119
[8]  
Ewens WarrenJ., 2004, INTERDISCIPLINARY AP, V27
[9]   Recombination and the nature of bacterial speciation [J].
Fraser, Christophe ;
Hanage, William P. ;
Spratt, Brian G. .
SCIENCE, 2007, 315 (5811) :476-480
[10]   STATISTICAL PROPERTIES OF SEGREGATING SITES [J].
FU, YX .
THEORETICAL POPULATION BIOLOGY, 1995, 48 (02) :172-197