Identification of evolutionary hotspots in the rodent genomes

被引:15
作者
Yap, VB [1 ]
Pachter, L [1 ]
机构
[1] Univ Calif Berkeley, Dept Math, Berkeley, CA 94720 USA
关键词
D O I
10.1101/gr.1967904
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We describe a whole-genome comparative analysis of the human, Mouse, and rat genomes to describe the average Substitution patterns of four genomic regions: ancient repeats, rodent-specific DNA, exons, and conserved (coding and noncoding) regions, and to identify rodent evolutionary hotspots. In all types of regions, except the rodent-specific DNA, the rat branch is slightly longer than the mouse branch. Moreover, the mouse-rat distance is longer in the rodent-specific DNA than in the ancient repeats. Analysis of individual conserved regions with different substitution models yielded the conclusion that the Jukes-Cantor model is inadequate, and the Hasegawa-Kishino-Yano model is almost as good as the REV model. Using human as an outgroup, we identified 5055 evolutionary hotspots, which are highly conserved subalignment blocks (each consisting of at least 100 aligned sites and a small fraction of gaps) with a large and statistically significant difference in the branch lengths of the rodent species. The Cutoffs used to identify the hotspots are partially based on estimates of the average rates of substitution. The fractions of hotspots overlapping with the rodent RefSeq genes, RefSeq exons, and ESTs are all higher than expected. Still, more than half of the hotspots lie in noncoding regions of the Mouse genome. We believe that the hotspots represent biologically interesting regions in the rodent genomes.
引用
收藏
页码:574 / 579
页数:6
相关论文
共 15 条
[1]   Geometry of the space of phylogenetic trees [J].
Billera, LJ ;
Holmes, SP ;
Vogtmann, K .
ADVANCES IN APPLIED MATHEMATICS, 2001, 27 (04) :733-767
[2]   Phylogenetic shadowing of primate sequences to find functional regions of the human genome [J].
Boffelli, D ;
McAuliffe, J ;
Ovcharenko, D ;
Lewis, KD ;
Ovcharenko, I ;
Pachter, L ;
Rubin, EM .
SCIENCE, 2003, 299 (5611) :1391-1394
[3]   MAVID multiple alignment server [J].
Bray, N ;
Pachter, L .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3525-3526
[4]   Quantitative estimates of sequence divergence for comparative analyses of mammalian genomes [J].
Cooper, GM ;
Brudno, M ;
Green, ED ;
Batzoglou, S ;
Sidow, A .
GENOME RESEARCH, 2003, 13 (05) :813-820
[5]   Active conservation of noncoding sequences revealed by three-way species comparisons [J].
Dubchak, I ;
Brudno, M ;
Loots, GG ;
Pachter, L ;
Mayor, C ;
Rubin, EM ;
Frazer, KA .
GENOME RESEARCH, 2000, 10 (09) :1304-1306
[6]   Covariation in frequencies of substitution, deletion, transposition, and recombination during eutherian evolution [J].
Hardison, RC ;
Roskin, KM ;
Yang, S ;
Diekhans, M ;
Kent, WJ ;
Weber, R ;
Elnitski, L ;
Li, J ;
O'Connor, M ;
Kolbe, D ;
Schwartz, S ;
Furey, TS ;
Whelan, S ;
Goldman, N ;
Smit, A ;
Miller, W ;
Chiaromonte, F ;
Haussler, D .
GENOME RESEARCH, 2003, 13 (01) :13-26
[7]   Complete genomic sequence and analysis of the prion protein gene region from three mammalian species [J].
Lee, IY ;
Westaway, D ;
Smit, AFA ;
Wang, K ;
Seto, J ;
Chen, L ;
Acharya, C ;
Ankener, M ;
Baskin, D ;
Cooper, C ;
Yao, H ;
Prusiner, SB ;
Hood, LE .
GENOME RESEARCH, 1998, 8 (10) :1022-1037
[8]   Comparison of genomic DNA sequences: solved and unsolved problems [J].
Miller, W .
BIOINFORMATICS, 2001, 17 (05) :391-397
[9]  
Moret B M, 2001, Bioinformatics, V17 Suppl 1, pS165
[10]  
*RAT GEN SEQ PROJ, 2004, IN PRESS NATURE