Genome-wide characterization of centromeric satellites from multiple mammalian genomes

被引:68
作者
Alkan, Can [1 ]
Cardone, Maria Francesca [2 ]
Catacchio, Claudia Rita [2 ]
Antonacci, Francesca [1 ]
O'Brien, Stephen J. [3 ]
Ryder, Oliver A. [4 ]
Purgato, Stefania [5 ]
Zoli, Monica [5 ]
Della Valle, Giuliano [5 ]
Eichler, Evan E. [1 ]
Ventura, Mario [1 ,2 ]
机构
[1] Univ Washington, Howard Hughes Med Inst, Sch Med, Dept Genome Sci, Seattle, WA 98195 USA
[2] Univ Bari, Dept Genet & Microbiol, I-70126 Bari, Italy
[3] NCI Frederick, Lab Genom Divers, Frederick, MD 21702 USA
[4] Zool Soc San Diego, San Diego, CA 92112 USA
[5] Univ Bologna, Dipartimento Biol Evoluzionist Sperimentale, I-40126 Bologna, Italy
关键词
CENP-B; DNA-SEQUENCES; X-CHROMOSOME; MONODELPHIS-DOMESTICA; REGIONS; EVOLUTION; PHYLOGENY; PRIMATES; PROTEIN; ORGANIZATION;
D O I
10.1101/gr.111278.110
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Despite its importance in cell biology and evolution, the centromere has remained the final frontier in genome assembly and annotation due to its complex repeat structure. However, isolation and characterization of the centromeric repeats from newly sequenced species are necessary for a complete understanding of genome evolution and function. In recent years, various genomes have been sequenced, but the characterization of the corresponding centromeric DNA has lagged behind. Here, we present a computational method (RepeatNet) to systematically identify higher-order repeat structures from unassembled whole-genome shotgun sequence and test whether these sequence elements correspond to functional centromeric sequences. We analyzed genome datasets from six species of mammals representing the diversity of the mammalian lineage, namely, horse, dog, elephant, armadillo, opossum, and platypus. We define candidate monomer satellite repeats and demonstrate centromeric localization for five of the six genomes. Our analysis revealed the greatest diversity of centromeric sequences in horse and dog in contrast to elephant and armadillo, which showed high-centromeric sequence homogeneity. We could not isolate centromeric sequences within the platypus genome, suggesting that centromeres in platypus are not enriched in satellite DNA. Our method can be applied to the characterization of thousands of other vertebrate genomes anticipated for sequencing in the near future, providing an important tool for annotation of centromeres.
引用
收藏
页码:137 / 145
页数:9
相关论文
共 59 条
[31]   Genome of the marsupial Monodelphis domestica reveals innovation in non-coding sequences [J].
Mikkelsen, Tarjei S. ;
Wakefield, Matthew J. ;
Aken, Bronwen ;
Amemiya, Chris T. ;
Chang, Jean L. ;
Duke, Shannon ;
Garber, Manuel ;
Gentles, Andrew J. ;
Goodstadt, Leo ;
Heger, Andreas ;
Jurka, Jerzy ;
Kamal, Michael ;
Mauceli, Evan ;
Searle, Stephen M. J. ;
Sharpe, Ted ;
Baker, Michelle L. ;
Batzer, Mark A. ;
Benos, Panayiotis V. ;
Belov, Katherine ;
Clamp, Michele ;
Cook, April ;
Cuff, James ;
Das, Radhika ;
Davidow, Lance ;
Deakin, Janine E. ;
Fazzari, Melissa J. ;
Glass, Jacob L. ;
Grabherr, Manfred ;
Greally, John M. ;
Gu, Wanjun ;
Hore, Timothy A. ;
Huttley, Gavin A. ;
Kleber, Michael ;
Jirtle, Randy L. ;
Koina, Edda ;
Lee, Jeannie T. ;
Mahony, Shaun ;
Marra, Marco A. ;
Miller, Robert D. ;
Nicholls, Robert D. ;
Oda, Mayumi ;
Papenfuss, Anthony T. ;
Parra, Zuly E. ;
Pollock, David D. ;
Ray, David A. ;
Schein, Jacqueline E. ;
Speed, Terence P. ;
Thompson, Katherine ;
VandeBerg, John L. ;
Wade, Claire M. .
NATURE, 2007, 447 (7141) :167-U1
[32]   Centromere assembly and propagation [J].
Morris, Corey A. ;
Moazed, Danesh .
CELL, 2007, 128 (04) :647-650
[33]   CENTROMERE PROTEIN-B ASSEMBLES HUMAN CENTROMERIC ALPHA-SATELLITE DNA AT THE 17-BP SEQUENCE, CENP-B BOX [J].
MURO, Y ;
MASUMOTO, H ;
YODA, K ;
NOZAKI, N ;
OHASHI, M ;
OKAZAKI, T .
JOURNAL OF CELL BIOLOGY, 1992, 116 (03) :585-596
[34]   CENTROMERE - HUB OF CHROMOSOMAL ACTIVITIES [J].
PLUTA, AF ;
MACKAY, AM ;
AINSZTEIN, AM ;
GOLDBERG, IG ;
EARNSHAW, WC .
SCIENCE, 1995, 270 (5242) :1591-1594
[35]   Confirming the phylogeny of mammals by use of large comparative sequence data sets [J].
Prasad, Arjun B. ;
Allard, Marc W. ;
Green, Eric D. .
MOLECULAR BIOLOGY AND EVOLUTION, 2008, 25 (09) :1795-1808
[36]   Karyotype relationships between distantly related marsupials from South America and Australia [J].
Rens, W ;
O'Brien, PCM ;
Yang, F ;
Solanky, N ;
Perelman, P ;
Graphodatsky, AS ;
Ferguson, MWJ ;
Svartman, M ;
De Leo, AA ;
Graves, JAM ;
Ferguson-Smith, MA .
CHROMOSOME RESEARCH, 2001, 9 (04) :301-308
[37]   Human centromeric alphoid domains are periodically homogenized so that they vary substantially between homologues.: Mechanism and implications for centromere functioning [J].
Roizès, G .
NUCLEIC ACIDS RESEARCH, 2006, 34 (06) :1912-1924
[38]   The DNA sequence of the human X chromosome [J].
Ross, MT ;
Grafham, DV ;
Coffey, AJ ;
Scherer, S ;
McLay, K ;
Muzny, D ;
Platzer, M ;
Howell, GR ;
Burrows, C ;
Bird, CP ;
Frankish, A ;
Lovell, FL ;
Howe, KL ;
Ashurst, JL ;
Fulton, RS ;
Sudbrak, R ;
Wen, GP ;
Jones, MC ;
Hurles, ME ;
Andrews, TD ;
Scott, CE ;
Searle, S ;
Ramser, J ;
Whittaker, A ;
Deadman, R ;
Carter, NP ;
Hunt, SE ;
Chen, R ;
Cree, A ;
Gunaratne, P ;
Havlak, P ;
Hodgson, A ;
Metzker, ML ;
Richards, S ;
Scott, G ;
Steffen, D ;
Sodergren, E ;
Wheeler, DA ;
Worley, KC ;
Ainscough, R ;
Ambrose, KD ;
Ansari-Lari, MA ;
Aradhya, S ;
Ashwell, RIS ;
Babbage, AK ;
Bagguley, CL ;
Ballabio, A ;
Banerjee, R ;
Barker, GE ;
Barlow, KF .
NATURE, 2005, 434 (7031) :325-337
[39]   Analysis of the centromeric regions of the human genome assembly [J].
Rudd, MK ;
Willard, HF .
TRENDS IN GENETICS, 2004, 20 (11) :529-533
[40]   Genomic and genetic definition of a functional human centromere [J].
Schueler, MG ;
Higgins, AW ;
Rudd, MK ;
Gustashaw, K ;
Willard, HF .
SCIENCE, 2001, 294 (5540) :109-115