Recent de novo origin of human protein-coding genes

被引:214
作者
Knowles, David G. [1 ]
McLysaght, Aoife [1 ]
机构
[1] Univ Dublin Trinity Coll, Smurfit Inst Genet, Dublin 2, Ireland
基金
爱尔兰科学基金会;
关键词
GENOME; DNA; PROTEOMICS; EVOLUTION; SEQUENCE; BIRTH; TOOLS;
D O I
10.1101/gr.095026.109
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The origin of new genes is extremely important to evolutionary innovation. Most new genes arise from existing genes through duplication or recombination. The origin of new genes from noncoding DNA is extremely rare, and very few eukaryotic examples are known. We present evidence for the de novo origin of at least three human protein-coding genes since the divergence with chimp. Each of these genes has no protein-coding homologs in any other genome, but is supported by evidence from expression and, importantly, proteomics data. The absence of these genes in chimp and macaque cannot be explained by sequencing gaps or annotation error. High-quality sequence data indicate that these loci are noncoding DNA in other primates. Furthermore, chimp, gorilla, gibbon, and macaque share the same disabling sequence difference, supporting the inference that the ancestral sequence was noncoding over the alternative possibility of parallel gene inactivation in multiple primate lineages. The genes are not well characterized, but interestingly, one of them was first identified as an up-regulated gene in chronic lymphocytic leukemia. This is the first evidence for entirely novel human-specific protein-coding genes originating from ancestrally noncoding sequences. We estimate that 0.075% of human genes may have originated through this mechanism leading to a total expectation of 18 such cases in a genome of 24,000 protein-coding genes.
引用
收藏
页码:1752 / 1759
页数:8
相关论文
共 31 条
[1]   Evidence for de novo evolution of testis-expressed genes in the Drosophila yakuba Drosophila erecta clade [J].
Begun, David J. ;
Lindfors, Heather A. ;
Kern, Andrew D. ;
Jones, Corbin D. .
GENETICS, 2007, 176 (02) :1131-1137
[2]   Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project [J].
Birney, Ewan ;
Stamatoyannopoulos, John A. ;
Dutta, Anindya ;
Guigo, Roderic ;
Gingeras, Thomas R. ;
Margulies, Elliott H. ;
Weng, Zhiping ;
Snyder, Michael ;
Dermitzakis, Emmanouil T. ;
Stamatoyannopoulos, John A. ;
Thurman, Robert E. ;
Kuehn, Michael S. ;
Taylor, Christopher M. ;
Neph, Shane ;
Koch, Christoph M. ;
Asthana, Saurabh ;
Malhotra, Ankit ;
Adzhubei, Ivan ;
Greenbaum, Jason A. ;
Andrews, Robert M. ;
Flicek, Paul ;
Boyle, Patrick J. ;
Cao, Hua ;
Carter, Nigel P. ;
Clelland, Gayle K. ;
Davis, Sean ;
Day, Nathan ;
Dhami, Pawandeep ;
Dillon, Shane C. ;
Dorschner, Michael O. ;
Fiegler, Heike ;
Giresi, Paul G. ;
Goldy, Jeff ;
Hawrylycz, Michael ;
Haydock, Andrew ;
Humbert, Richard ;
James, Keith D. ;
Johnson, Brett E. ;
Johnson, Ericka M. ;
Frum, Tristan T. ;
Rosenzweig, Elizabeth R. ;
Karnani, Neerja ;
Lee, Kirsten ;
Lefebvre, Gregory C. ;
Navas, Patrick A. ;
Neri, Fidencio ;
Parker, Stephen C. J. ;
Sabo, Peter J. ;
Sandstrom, Richard ;
Shafer, Anthony .
NATURE, 2007, 447 (7146) :799-816
[3]   Identification of a gene on chromosome 12q22 unique, overexpressed in chronic lymphocytic leukemia [J].
Buhl, AM ;
Jurlander, J ;
Jorgensen, FS ;
Ottesen, AM ;
Cowland, JB ;
Gjerdrum, LM ;
Hansen, BV ;
Leffers, H .
BLOOD, 2006, 107 (07) :2904-2911
[4]   Birth and adaptive evolution of a hominoid gene that supports high neurotransmitter flux [J].
Burki, F ;
Kaessmann, H .
NATURE GENETICS, 2004, 36 (10) :1061-1063
[5]   De novo origination of a new protein-coding gene in Saccharomyces cerevisiae [J].
Cai, Jing ;
Zhao, Ruoping ;
Jiang, Huifeng ;
Wang, Wen .
GENETICS, 2008, 179 (01) :487-496
[6]   The Jalview Java']Java alignment editor [J].
Clamp, M ;
Cuff, J ;
Searle, SM ;
Barton, GJ .
BIOINFORMATICS, 2004, 20 (03) :426-427
[7]   Distinguishing protein-coding and noncoding genes in the human genome [J].
Clamp, Michele ;
Fry, Ben ;
Kamal, Mike ;
Xie, Xiaohui ;
Cuff, James ;
Lin, Michael F. ;
Kellis, Manolis ;
Lindblad-Toh, Kerstin ;
Lander, Eric S. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (49) :19428-19433
[8]   Human plasma PeptideAtlas [J].
Deutsch, EW ;
Eng, JK ;
Zhang, H ;
King, NL ;
Nesvizhskii, AI ;
Lin, BY ;
Lee, HK ;
Yi, EC ;
Ossola, R ;
Aebersold, R .
PROTEOMICS, 2005, 5 (13) :3497-3500
[9]   Extensive gene traffic on the mammalian X chromosome [J].
Emerson, JJ ;
Kaessmann, H ;
Betrán, E ;
Long, MY .
SCIENCE, 2004, 303 (5657) :537-540
[10]   The miniaturized nuclear genome of a eukaryotic endosymbiont contains genes that overlap, genes that are cotranscribed, and the smallest known spliceosomal introns [J].
Gilson, PR ;
McFadden, GI .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (15) :7737-7742