Exonic remnants of whole-genome duplication reveal cis-regulatory function of coding exons

被引:40
作者
Dong, Xianjun [1 ,2 ]
Navratilova, Pavla [2 ]
Fredman, David [2 ]
Drivenes, Oyvind [2 ]
Becker, Thomas S. [2 ]
Lenhard, Boris [1 ,2 ]
机构
[1] Univ Bergen, Computat Biol Unit, Bergen Ctr Computat Sci, N-5008 Bergen, Norway
[2] Univ Bergen, Sars Ctr Marine Mol Biol, N-5008 Bergen, Norway
关键词
CONSERVED NONCODING ELEMENTS; RNA SELECTION PRESSURE; ULTRACONSERVED ELEMENTS; MESSENGER-RNA; KA/KS RATIO; REGION; GENE; TRANSCRIPTION; ENHANCER; EXPRESSION;
D O I
10.1093/nar/gkp1124
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Using a comparative genomics approach to reconstruct the fate of genomic regulatory blocks (GRBs) and identify exonic remnants that have survived the disappearance of their host genes after whole-genome duplication (WGD) in teleosts, we discover a set of 38 candidate cis-regulatory coding exons (RCEs) with predicted target genes. These elements demonstrate evolutionary separation of overlapping protein-coding and regulatory information after WGD in teleosts. We present evidence that the corresponding mammalian exons are still under both coding and non-coding selection pressure, are more conserved than other protein coding exons in the host gene and several control sets, and share key characteristics with highly conserved non-coding elements in the same regions. Their dual function is corroborated by existing experimental data. Additionally, we show examples of human exon remnants stemming from the vertebrate 2R WGD. Our findings suggest that long-range cis-regulatory inputs for developmental genes are not limited to non-coding regions, but can also overlap the coding sequence of unrelated genes. Thus, exonic regulatory elements in GRBs might be functionally equivalent to those in non-coding regions, calling for a re-evaluation of the sequence space in which to look for long-range regulatory elements and experimentally test their activity.
引用
收藏
页码:1071 / 1085
页数:15
相关论文
共 63 条
[41]   Functional noncoding sequences derived from SINEs in the mammalian genome [J].
Nishihara, Hidenori ;
Smit, Arian F. A. ;
Okada, Norihiro .
GENOME RESEARCH, 2006, 16 (07) :864-874
[42]   Mutational inactivation of two distinct negative RNA elements in the human papillomavirus type 16 L2 coding region induces production of high levels of L2 in human cells [J].
Öberg, D ;
Collier, B ;
Zhao, XM ;
Schwartz, S .
JOURNAL OF VIROLOGY, 2003, 77 (21) :11674-11684
[43]  
Ohno S., 1970, Evolution by gene duplication
[44]   In vivo enhancer analysis of human conserved non-coding sequences [J].
Pennacchio, Len A. ;
Ahituv, Nadav ;
Moses, Alan M. ;
Prabhakar, Shyam ;
Nobrega, Marcelo A. ;
Shoukry, Malak ;
Minovitsky, Simon ;
Dubchak, Inna ;
Holt, Amy ;
Lewis, Keith D. ;
Plajzer-Frick, Ingrid ;
Akiyama, Jennifer ;
De Val, Sarah ;
Afzal, Veena ;
Black, Brian L. ;
Couronne, Olivier ;
Eisen, Michael B. ;
Visel, Axel ;
Rubin, Edward M. .
NATURE, 2006, 444 (7118) :499-502
[45]   Arrays of ultraconserved non-coding regions span the loci of key developmental genes in vertebrate genomes -: art. no. 99 [J].
Sandelin, A ;
Bailey, P ;
Bruce, S ;
Engström, PG ;
Klos, JM ;
Wasserman, WW ;
Ericson, J ;
Lenhard, B .
BMC GENOMICS, 2004, 5 (1)
[46]   Constrained binding site diversity within families of transcription factors enhances pattern discovery bioinformatics [J].
Sandelin, A ;
Wasserman, WW .
JOURNAL OF MOLECULAR BIOLOGY, 2004, 338 (02) :207-215
[47]  
Siepel A, 2006, LECT NOTES COMPUT SC, V3909, P190
[48]   mRNA instability elements in the human papillomavirus type 16 L2 coding region [J].
Sokolowski, M ;
Tan, W ;
Jellne, M ;
Schwartz, S .
JOURNAL OF VIROLOGY, 1998, 72 (02) :1504-1515
[49]   Large-scale appearance of ultraconserved elements in tetrapod genomes and slowdown of the molecular clock [J].
Stephen, Stuart ;
Pheasant, Michael ;
Makunin, Igor V. ;
Mattick, John S. .
MOLECULAR BIOLOGY AND EVOLUTION, 2008, 25 (02) :402-408
[50]   oPOSSUM: identification of over-represented transcription factor binding sites in co-expressed genes [J].
Sui, SJH ;
Mortimer, JR ;
Arenillas, DJ ;
Brumm, J ;
Walsh, CJ ;
Kennedy, BP ;
Wasserman, WW .
NUCLEIC ACIDS RESEARCH, 2005, 33 (10) :3154-3164