Ancient duplicated conserved noncoding elements in vertebrates: A genomic and functional analysis

被引:79
作者
McEwen, GK
Woolfe, A
Goode, D
Vavouri, T
Callaway, H
Elgar, G [1 ]
机构
[1] Queen Mary Univ London, Sch Biol & Chem Sci, London E1 4NS, England
[2] Wellcome Trust Sanger Inst, Cambridge CB10 1SB, England
[3] MRC, Biostat Unit, Inst Publ Hlth, Cambridge CB2 2SR, England
基金
英国医学研究理事会;
关键词
D O I
10.1101/gr.4143406
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Fish-mammal genomic comparisons have proved powerful in identifying conserved noncoding elements likely to be cis-regulatory in nature, and the majority of those tested in vivo have been shown to act as tissue-specific enhancers associated with genes involved in transcriptional regulation of development. Although most of these elements share little sequence identity to each other, a small number are remarkably similar and appear to be the product of duplication events. Here, we searched for duplicated conserved noncoding elements in the human genome, using comparisons with Fugu to select putative cis-regulatory sequences. We identified 124 families of duplicated elements, each containing between two and five members, that are highly conserved within and between vertebrate genomes. In 74% of cases, we were able to assign a specific set of paralogous genes with annotation relating to transcriptional regulation and/or development to each family, thus removing much of the ambiguity in identifying associated genes. We find that duplicate elements have the potential to up-regulate reporter gene expression in a tissue-specific manner and that expression domains often overlap, but are not necessarily identical, between family members. Over two thirds of the families are conserved in duplicate in fish and appear to predate the large-scale duplication events thought to have occurred at the origin of vertebrates. We propose a model whereby gene duplication and the evolution of cis-regulatory elements can be considered in the context of increased morphological diversity and the emergence of the modern vertebrate body plan.
引用
收藏
页码:451 / 465
页数:15
相关论文
共 93 条
[21]   Shufflers:: shuffling sequences while conserving the k-let counts [J].
Coward, E .
BIOINFORMATICS, 1999, 15 (12) :1058-1059
[22]   A genomic regulatory network for development [J].
Davidson, EH ;
Rast, JP ;
Oliveri, P ;
Ransick, A ;
Calestani, C ;
Yuh, CH ;
Minokawa, T ;
Amore, G ;
Hinman, V ;
Arenas-Mena, C ;
Otim, O ;
Brown, CT ;
Livi, CB ;
Lee, PY ;
Revilla, R ;
Rust, AG ;
Pan, ZJ ;
Schilstra, MJ ;
Clarke, PJC ;
Arnone, MI ;
Rowen, L ;
Cameron, RA ;
McClay, DR ;
Hood, L ;
Bolouri, H .
SCIENCE, 2002, 295 (5560) :1669-1678
[23]   A functional survey of the enhancer activity of conserved non-coding sequences from vertebrate Iroquois cluster gene deserts [J].
de la Calle-Mustienes, E ;
Feijóo, CG ;
Manzanares, M ;
Tena, JJ ;
Rodríguez-Seguel, E ;
Letizia, A ;
Allende, ML ;
Gómez-Skarmeta, JL .
GENOME RESEARCH, 2005, 15 (08) :1061-1072
[24]   The draft genome of Ciona intestinalis:: Insights into chordate and vertebrate origins [J].
Dehal, P ;
Satou, Y ;
Campbell, RK ;
Chapman, J ;
Degnan, B ;
De Tomaso, A ;
Davidson, B ;
Di Gregorio, A ;
Gelpke, M ;
Goodstein, DM ;
Harafuji, N ;
Hastings, KEM ;
Ho, I ;
Hotta, K ;
Huang, W ;
Kawashima, T ;
Lemaire, P ;
Martinez, D ;
Meinertzhagen, IA ;
Necula, S ;
Nonaka, M ;
Putnam, N ;
Rash, S ;
Saiga, H ;
Satake, M ;
Terry, A ;
Yamada, L ;
Wang, HG ;
Awazu, S ;
Azumi, K ;
Boore, J ;
Branno, M ;
Chin-bow, S ;
DeSantis, R ;
Doyle, S ;
Francino, P ;
Keys, DN ;
Haga, S ;
Hayashi, H ;
Hino, K ;
Imai, KS ;
Inaba, K ;
Kano, S ;
Kobayashi, K ;
Kobayashi, M ;
Lee, BI ;
Makabe, KW ;
Manohar, C ;
Matassi, G ;
Medina, M .
SCIENCE, 2002, 298 (5601) :2157-2167
[25]   Evolutionary discrimination of mammalian conserved non-genic sequences (CNGs) [J].
Dermitzakis, ET ;
Reymond, A ;
Scamuffa, N ;
Ucla, C ;
Kirkness, E ;
Rossier, C ;
Antonarakis, SE .
SCIENCE, 2003, 302 (5647) :1033-1035
[26]   Expression profiling and comparative genomics identify a conserved regulatory region controlling midline expression in the zebrafish embryo [J].
Dickmeis, T ;
Plessy, C ;
Rastegar, S ;
Aanstad, P ;
Herwig, R ;
Chalmel, F ;
Fischer, N ;
Strähle, U .
GENOME RESEARCH, 2004, 14 (02) :228-238
[27]   An efficient algorithm for large-scale detection of protein families [J].
Enright, AJ ;
Van Dongen, S ;
Ouzounis, CA .
NUCLEIC ACIDS RESEARCH, 2002, 30 (07) :1575-1584
[28]  
Force A, 1999, GENETICS, V151, P1531
[29]   Optimizing clinical management of bipolar disorder: translating science into practice [J].
Goodwin, GM .
BIPOLAR DISORDERS, 2005, 7 :3-3
[30]   The microRNA Registry [J].
Griffiths-Jones, S .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D109-D111