Comparative analysis of protein coding sequences from human, mouse and the domesticated pig

被引:64
作者
Jorgensen, FG [1 ]
Hobolth, A
Hornshoj, H
Bendixen, C
Fredholm, M
Schierup, MH
机构
[1] Univ Aarhus, Dept Ecol & Genet, Aarhus C, Denmark
[2] Univ Aarhus, Bioinformat Res Ctr, Aarhus C, Denmark
[3] Danish Inst Agr Sci, Dept Genet & Biotechnol, Tjele, Denmark
[4] KVL, Dept Anim Sci & Anim Hlth, Frederiksberg C, Denmark
基金
美国国家科学基金会;
关键词
D O I
10.1186/1741-7007-3-2
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The availability of abundant sequence data from key model organisms has made large scale studies of molecular evolution an exciting possibility. Here we use full length cDNA alignments comprising more than 700,000 nucleotides from human, mouse, pig and the Japanese pufferfish Fugu rubrices in order to investigate 1) the relationships between three major lineages of mammals: rodents, artiodactyls and primates, and 2) the rate of evolution and the occurrence of positive Darwinian selection using codon based models of sequence evolution. Results: We provide evidence that the evolutionary splits among primates, rodents and artiodactyls happened shortly after each other, with most gene trees favouring a topology with rodents as outgroup to primates and artiodactyls. Using an unrooted topology of the three mammalian species we show that since their diversification, the pig and mouse lineages have on average experienced 1.44 and 2.86 times as many synonymous substitutions as humans, respectively, whereas the rates of non-synonymous substitutions are more similar. The analysis shows the highest average dN/dS ratio in the human lineage, followed by the pig and then the mouse lineages. Using codon based models we detect signals of positive Darwinian selection in approximately 5.3%, 4.9% and 6.0% of the genes on the human, pig and mouse lineages respectively. Approximately 16.8% of all the genes studied here are not currently annotated as functional genes in humans. Our analyses indicate that a large fraction of these genes may have lost their function quite recently or may still be functional genes in some or all of the three mammalian species. Conclusions: We present a comparative analysis of protein coding genes from three major mammalian lineages. Our study demonstrates the usefulness of codon-based likelihood models in detecting selection and it illustrates the value of sequencing organisms at different phylogenetic distances for comparative studies.
引用
收藏
页数:15
相关论文
共 49 条
[41]   Initial sequencing and comparative analysis of the mouse genome [J].
Waterston, RH ;
Lindblad-Toh, K ;
Birney, E ;
Rogers, J ;
Abril, JF ;
Agarwal, P ;
Agarwala, R ;
Ainscough, R ;
Alexandersson, M ;
An, P ;
Antonarakis, SE ;
Attwood, J ;
Baertsch, R ;
Bailey, J ;
Barlow, K ;
Beck, S ;
Berry, E ;
Birren, B ;
Bloom, T ;
Bork, P ;
Botcherby, M ;
Bray, N ;
Brent, MR ;
Brown, DG ;
Brown, SD ;
Bult, C ;
Burton, J ;
Butler, J ;
Campbell, RD ;
Carninci, P ;
Cawley, S ;
Chiaromonte, F ;
Chinwalla, AT ;
Church, DM ;
Clamp, M ;
Clee, C ;
Collins, FS ;
Cook, LL ;
Copley, RR ;
Coulson, A ;
Couronne, O ;
Cuff, J ;
Curwen, V ;
Cutts, T ;
Daly, M ;
David, R ;
Davies, J ;
Delehaunty, KD ;
Deri, J ;
Dermitzakis, ET .
NATURE, 2002, 420 (6915) :520-562
[42]   A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach [J].
Whelan, S ;
Goldman, N .
MOLECULAR BIOLOGY AND EVOLUTION, 2001, 18 (05) :691-699
[43]   Codon-substitution models for detecting molecular adaptation at individual sites along specific lineages [J].
Yang, ZH ;
Nielsen, R .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (06) :908-917
[44]  
Yang ZH, 2000, GENETICS, V155, P431
[45]   Inference of selection from multiple species alignments [J].
Yang, ZH .
CURRENT OPINION IN GENETICS & DEVELOPMENT, 2002, 12 (06) :688-694
[46]  
Yang ZH, 1997, COMPUT APPL BIOSCI, V13, P555
[47]   Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution [J].
Yang, ZH .
MOLECULAR BIOLOGY AND EVOLUTION, 1998, 15 (05) :568-573
[48]  
Zanotto PMD, 1999, GENETICS, V153, P1077
[49]   Frequent false detection of positive selection by the likelihood method with branch-site models [J].
Zhang, JZ .
MOLECULAR BIOLOGY AND EVOLUTION, 2004, 21 (07) :1332-1339