Integration of mate pair sequences to improve shotgun assemblies of flow-sorted chromosome arms of hexaploid wheat

被引:12
作者
Belova, Tatiana [1 ]
Zhan, Bujie [1 ]
Wright, Jonathan [2 ]
Caccamo, Mario [2 ]
Asp, Torben [3 ]
Simkova, Hana [4 ]
Kent, Matthew [5 ,6 ]
Bendixen, Christian [7 ]
Panitz, Frank [7 ]
Lien, Sigbjorn [5 ,6 ]
Dolezel, Jaroslav [4 ]
Olsen, Odd-Arne [1 ]
Sandve, Simen R. [1 ]
机构
[1] Univ Life Sci, Dept Plant & Environm Sci, As, Norway
[2] Genome Anal Ctr TGAC, Norwich NR4 7UH, Norfolk, England
[3] Aarhus Univ, Dept Mol Biol & Genet, DK-4200 Slagelse, Denmark
[4] Ctr Reg Hana, Inst Expt Bot, Olomouc 77200, Czech Republic
[5] Norwegian Univ Life Sci, Ctr Integrat Genet CIGENE, N-1432 As, Norway
[6] Norwegian Univ Life Sci, Dept Anim & Aquacultural Sci, N-1432 As, Norway
[7] Aarhus Univ, Fac Agr Sci, Dept Genet & Biotechnol, DK-8830 Tjele, Denmark
来源
BMC GENOMICS | 2013年 / 14卷
基金
英国生物技术与生命科学研究理事会;
关键词
Wheat; Assembly; Scaffold; Mate-pair; MDA; Improvement; GENOME SEQUENCE; GENE; ORGANIZATION; EVOLUTION;
D O I
10.1186/1471-2164-14-222
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: The assembly of the bread wheat genome sequence is challenging due to allohexaploidy and extreme repeat content (>80%). Isolation of single chromosome arms by flow sorting can be used to overcome the polyploidy problem, but the repeat content cause extreme assembly fragmentation even at a single chromosome level. Long jump paired sequencing data (mate pairs) can help reduce assembly fragmentation by joining multiple contigs into single scaffolds. The aim of this work was to assess how mate pair data generated from multiple displacement amplified DNA of flow-sorted chromosomes affect assembly fragmentation of shotgun assemblies of the wheat chromosomes. Results: Three mate pair (MP) libraries (2 Kb, 3 Kb, and 5 Kb) were sequenced to a total coverage of 89x and 64x for the short and long arm of chromosome 7B, respectively. Scaffolding using SSPACE improved the 7B assembly contiguity and decreased gene space fragmentation, but the degree of improvement was greatly affected by scaffolding stringency applied. At the lowest stringency the assembly N50 increased by similar to 7 fold, while at the highest stringency N50 was only increased by similar to 1.5 fold. Furthermore, a strong positive correlation between estimated scaffold reliability and scaffold assembly stringency was observed. A 7BS scaffold assembly with reduced MP coverage proved that assembly contiguity was affected only to a small degree down to similar to 50% of the original coverage. Conclusion: The effect of MP data integration into pair end shotgun assemblies of wheat chromosome was moderate; possibly due to poor contig assembly contiguity, the extreme repeat content of wheat, and the use of amplified chromosomal DNA for MP library construction.
引用
收藏
页数:11
相关论文
共 32 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]   The genome of Theobroma cacao [J].
Argout, Xavier ;
Salse, Jerome ;
Aury, Jean-Marc ;
Guiltinan, Mark J. ;
Droc, Gaetan ;
Gouzy, Jerome ;
Allegre, Mathilde ;
Chaparro, Cristian ;
Legavre, Thierry ;
Maximova, Siela N. ;
Abrouk, Michael ;
Murat, Florent ;
Fouet, Olivier ;
Poulain, Julie ;
Ruiz, Manuel ;
Roguet, Yolande ;
Rodier-Goud, Maguy ;
Barbosa-Neto, Jose Fernandes ;
Sabot, Francois ;
Kudrna, Dave ;
Ammiraju, Jetty Siva S. ;
Schuster, Stephan C. ;
Carlson, John E. ;
Sallet, Erika ;
Schiex, Thomas ;
Dievart, Anne ;
Kramer, Melissa ;
Gelley, Laura ;
Shi, Zi ;
Berard, Aurelie ;
Viot, Christopher ;
Boccara, Michel ;
Risterucci, Ange Marie ;
Guignon, Valentin ;
Sabau, Xavier ;
Axtell, Michael J. ;
Ma, Zhaorong ;
Zhang, Yufan ;
Brown, Spencer ;
Bourge, Mickael ;
Golser, Wolfgang ;
Song, Xiang ;
Clement, Didier ;
Rivallan, Ronan ;
Tahi, Mathias ;
Akaza, Joseph Moroh ;
Pitollat, Bertrand ;
Gramacho, Karina ;
D'Hont, Angelique ;
Brunel, Dominique .
NATURE GENETICS, 2011, 43 (02) :101-108
[3]   Sequencing wheat chromosome arm 7BS delimits the 7BS/4AL translocation and reveals homoeologous gene conservation [J].
Berkman, Paul J. ;
Skarshewski, Adam ;
Manoli, Sahana ;
Lorenc, Micha T. ;
Stiller, Jiri ;
Smits, Lars ;
Lai, Kaitao ;
Campbell, Emma ;
Kubalakova, Marie ;
Simkova, Hana ;
Batley, Jacqueline ;
Dolezel, Jaroslav ;
Hernandez, Pilar ;
Edwards, David .
THEORETICAL AND APPLIED GENETICS, 2012, 124 (03) :423-432
[4]   Sequencing and assembly of low copy and genic regions of isolated Triticum aestivum chromosome arm 7DS [J].
Berkman, Paul J. ;
Skarshewski, Adam ;
Lorenc, Michal T. ;
Lai, Kaitao ;
Duran, Chris ;
Ling, Edmund Y. S. ;
Stiller, Jiri ;
Smits, Lars ;
Imelfort, Michael ;
Manoli, Sahana ;
McKenzie, Megan ;
Kubalakova, Marie ;
Simkova, Hana ;
Batley, Jacqueline ;
Fleury, Delphine ;
Dolezel, Jaroslav ;
Edwards, David .
PLANT BIOTECHNOLOGY JOURNAL, 2011, 9 (07) :768-775
[5]   Megabase Level Sequencing Reveals Contrasted Organization and Evolution Patterns of the Wheat Gene and Transposable Element Spaces [J].
Choulet, Frederic ;
Wicker, Thomas ;
Rustenholz, Camille ;
Paux, Etienne ;
Salse, Jerome ;
Leroy, Philippe ;
Schlub, Stephane ;
Le Paslier, Marie-Christine ;
Magdelenat, Ghislaine ;
Gonthier, Catherine ;
Couloux, Arnaud ;
Budak, Hikmet ;
Breen, James ;
Pumphrey, Michael ;
Liu, Sixin ;
Kong, Xiuying ;
Jia, Jizeng ;
Gut, Marta ;
Brunel, Dominique ;
Anderson, James A. ;
Gill, Bikram S. ;
Appels, Rudi ;
Keller, Beat ;
Feuillet, Catherine .
PLANT CELL, 2010, 22 (06) :1686-1701
[6]   How to apply de Bruijn graphs to genome assembly [J].
Compeau, Phillip E. C. ;
Pevzner, Pavel A. ;
Tesler, Glenn .
NATURE BIOTECHNOLOGY, 2011, 29 (11) :987-991
[7]   Chromosome-based genomics in the cereals [J].
Dolezel, Jaroslav ;
Kubalakova, Marie ;
Paux, Etienne ;
Bartos, Jan ;
Feuillet, Catherine .
CHROMOSOME RESEARCH, 2007, 15 (01) :51-66
[8]  
Dolezel J, 2009, PLANT GENET GENOMICS, V7, P285, DOI 10.1007/978-0-387-77489-3_10
[9]  
DVORAK J, 1993, GENOME, V36, P21, DOI 10.1139/g93-004
[10]   VARIATION IN REPEATED NUCLEOTIDE-SEQUENCES SHEDS LIGHT ON THE PHYLOGENY OF THE WHEAT B-GENOMES AND G-GENOMES [J].
DVORAK, J ;
ZHANG, HB .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1990, 87 (24) :9640-9644