Retrocopy contributions to the evolution of the human genome

被引:82
作者
Baertsch, Robert [1 ]
Diekhans, Mark [1 ]
Kent, W. James [1 ]
Haussler, David [1 ]
Brosius, Juergen [2 ]
机构
[1] Univ Calif Santa Cruz, Ctr Biomol Sci & Engn, Santa Cruz, CA 95064 USA
[2] Univ Munster, ZMBE, Inst Expt Pathol, D-48149 Munster, Germany
关键词
D O I
10.1186/1471-2164-9-466
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Evolution via point mutations is a relatively slow process and is unlikely to completely explain the differences between primates and other mammals. By contrast, 45% of the human genome is composed of retroposed elements, many of which were inserted in the primate lineage. A subset of retroposed mRNAs (retrocopies) shows strong evidence of expression in primates, often yielding functional retrogenes. Results: To identify and analyze the relatively recently evolved retrogenes, we carried out BLASTZ alignments of all human mRNAs against the human genome and scored a set of features indicative of retroposition. Of over 12,000 putative retrocopy-derived genes that arose mainly in the primate lineage, 726 with strong evidence of transcript expression were examined in detail. These mRNA retroposition events fall into three categories: I) 34 retrocopies and antisense retrocopies that added potential protein coding space and UTRs to existing genes; II) 682 complete retrocopy duplications inserted into new loci; and III) an unexpected set of 13 retrocopies that contributed out-of-frame, or antisense sequences in combination with other types of transposed elements (SINEs, LINEs, LTRs), even unannotated sequence to form potentially novel genes with no homologs outside primates. In addition to their presence in human, several of the gene candidates also had potentially viable ORFs in chimpanzee, orangutan, and rhesus macaque, underscoring their potential of function. Conclusion: mRNA-derived retrocopies provide raw material for the evolution of genes in a wide variety of ways, duplicating and amending the protein coding region of existing genes as well as generating the potential for new protein coding space, or non-protein coding RNAs, by unexpected contributions out of frame, in reverse orientation, or from previously non-protein coding sequence.
引用
收藏
页数:19
相关论文
共 75 条
  • [1] HOPPSIGEN: a database of human and mouse processed pseudogenes
    Adel, K
    Laurent, D
    Dominique, M
    [J]. NUCLEIC ACIDS RESEARCH, 2005, 33 : D59 - D66
  • [2] Global analysis of exon creation versus loss and the role of alternative splicing in 17 vertebrate genomes
    Alekseyenko, Alexander V.
    Kim, Namshin
    Lee, Christopher J.
    [J]. RNA, 2007, 13 (05) : 661 - 670
  • [3] The Vertebrate Genome Annotation (Vega) database
    Ashurst, JL
    Chen, CK
    Gilbert, JGR
    Jekosch, K
    Keenan, S
    Meidl, P
    Searle, SM
    Stalker, J
    Storey, R
    Trevanion, S
    Wilming, L
    Hubbard, T
    [J]. NUCLEIC ACIDS RESEARCH, 2005, 33 : D459 - D465
  • [4] Hotspots of mammalian chromosomal evolution
    Bailey, JA
    Baertsch, R
    Kent, WJ
    Haussler, D
    Eichler, EE
    [J]. GENOME BIOLOGY, 2004, 5 (04)
  • [5] Recent segmental duplications in the human genome
    Bailey, JA
    Gu, ZP
    Clark, RA
    Reinert, K
    Samonte, RV
    Schwartz, S
    Adams, MD
    Myers, EW
    Li, PW
    Eichler, EE
    [J]. SCIENCE, 2002, 297 (5583) : 1003 - 1007
  • [6] Pseudogenes: Are they "Junk" or functional DNA?
    Balakirev, ES
    Ayala, FJ
    [J]. ANNUAL REVIEW OF GENETICS, 2003, 37 : 123 - 151
  • [7] Evidence for de novo evolution of testis-expressed genes in the Drosophila yakuba Drosophila erecta clade
    Begun, David J.
    Lindfors, Heather A.
    Kern, Andrew D.
    Jones, Corbin D.
    [J]. GENETICS, 2007, 176 (02) : 1131 - 1137
  • [8] A distal enhancer and an ultraconserved exon are derived from a novel retroposon
    Bejerano, G
    Lowe, CB
    Ahituv, N
    King, B
    Siepel, A
    Salama, SR
    Rubin, EM
    Kent, WJ
    Haussler, D
    [J]. NATURE, 2006, 441 (7089) : 87 - 90
  • [9] Ultraconserved elements in the human genome
    Bejerano, G
    Pheasant, M
    Makunin, I
    Stephen, S
    Kent, WJ
    Mattick, JS
    Haussler, D
    [J]. SCIENCE, 2004, 304 (5675) : 1321 - 1325
  • [10] Tandem repeats finder: a program to analyze DNA sequences
    Benson, G
    [J]. NUCLEIC ACIDS RESEARCH, 1999, 27 (02) : 573 - 580