Waiting for regulatory sequences to appear

被引:22
作者
Durrett, Richard
Schmidt, Deena
机构
[1] Cornell Univ, Dept Math, Ithaca, NY 14853 USA
[2] Cornell Univ, Ctr Appl Math, Ithaca, NY 14853 USA
关键词
regulatory sequence; population genetics; Moran model; Poisson approximation; clumping heuristic; FACTOR-BINDING SITES; EVOLUTION; SELECTION;
D O I
10.1214/105051606000000619
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
One possible explanation for the substantial organismal differences between humans and chimpanzees is that there have been changes in gene regulation. Given what is known about transcription factor binding sites, this motivates the following probability question: given a 1000 nucleotide region in our genome, how long does it take for a specified six to nine letter word to appear in that region in some individual? Stone and Wray [Mol. Biol. Evol. 18 (2001) 1764-1770] computed 5,950 years as the answer for six letter words. Here, we will show that for words of length 6, the average waiting time is 100,000 years, while for words of length 8, the waiting time has mean 375,000 years when there is a 7 out of 8 letter match in the population consensus sequence (an event of probability roughly 5/16) and has mean 650 million years when there is not. Fortunately, in biological reality, the match to the target word does not have to be perfect for binding to occur. If we model this by saying that a 7 out of 8 letter match is good enough, the mean reduces to about 60,000 years.
引用
收藏
页码:1 / 32
页数:32
相关论文
共 21 条
[1]  
Aldous D.J., 1989, The Poisson Clumping Heuristic
[2]  
[Anonymous], 2004, Mathematical Population Genetics 1: Theoretical Introduction
[3]   2 MOMENTS SUFFICE FOR POISSON APPROXIMATIONS - THE CHEN-STEIN METHOD [J].
ARRATIA, R ;
GOLDSTEIN, L ;
GORDON, L .
ANNALS OF PROBABILITY, 1989, 17 (01) :9-25
[4]   Adaptive evolution of transcription factor binding sites -: art. no. 42 [J].
Berg, J ;
Willmann, S ;
Lässig, M .
BMC EVOLUTIONARY BIOLOGY, 2004, 4 (1)
[5]   SELECTION OF DNA-BINDING SITES BY REGULATORY PROTEINS - STATISTICAL-MECHANICAL THEORY AND APPLICATION TO OPERATORS AND PROMOTERS [J].
BERG, OG ;
VONHIPPEL, PH .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 193 (04) :723-743
[6]   Evolution of functionally conserved enhancers can be accelerated in large populations: a population-genetic model [J].
Carter, AJR ;
Wagner, GP .
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2002, 269 (1494) :953-960
[7]   Evolution of transcription factor binding sites in mammalian gene regulatory regions: Conservation and turnover [J].
Dermitzakis, ET ;
Clark, AG .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (07) :1114-1121
[8]  
Durrett Richard., 2002, PROB APPL S, Vsecond
[9]  
Durrett Richard, 2005, Probability: Theory and Examples. Probability: Theory & Examples, V3
[10]  
EIGEN M, 1989, ADV CHEM PHYS, V75, P149