Single nucleotide polymorphism (SNP) discovery in the polyploid Brassica napus using Solexa transcriptome sequencing

被引:243
作者
Trick, Martin [1 ]
Long, Yan [2 ]
Meng, Jinling [2 ]
Bancroft, Ian [1 ]
机构
[1] John Innes Inst, Norwich NR4 7UH, Norfolk, England
[2] Huazhong Agr Univ, Natl Key Lab Crop Genet Improvement, Wuhan 430070, Peoples R China
基金
英国生物技术与生命科学研究理事会;
关键词
Brassica; marker; next generation sequencing; sequence; single nucleotide polymorphism; Solexa; ARABIDOPSIS-THALIANA; COMPARATIVE GENOMICS; OILSEED RAPE; GENE; SEGMENTS; OLERACEA; REGION; MAIZE;
D O I
10.1111/j.1467-7652.2008.00396.x
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Oilseed rape (Brassica napus) was selected as an example of a polyploid crop, and the Solexa sequencing system was used to generate approximately 20 million expressed sequence tags (ESTs) from each of two cultivars: Tapidor and Ningyou 7. A methodology and computational tools were developed to exploit, as a reference sequence, a publicly available set of approximately 94 000 Brassica species unigenes. Sequences transcribed in the leaves of juvenile plants were aligned to approximately 26 Mb of the reference sequences. The aligned sequences enabled the detection of 23 330-41 593 putative single nucleotide polymorphisms (SNPs) between the cultivars, depending on the read depth stringency applied. The majority of the detected polymorphisms (87.5-91.2%) were of a type indicative of transcription from homoeologous genes from the two parental genomes within oilseed rape, and are termed here 'hemi-SNPs'. The overall estimated polymorphism rate (similar to 0.047%-0.084%) is consistent with that previously observed between the cultivars analysed. To demonstrate the heritability of SNPs and to assess their suitability for applications such as linkage map construction and association genetics, approximately nine million ESTs were generated, using the Solexa system, from each of four lines of a doubled haploid mapping population derived from a cross between Tapidor and Ningyou 7. Computational tools were developed to score the alleles present in these lines for each of the potential SNPs identified between their parents. For a specimen region of the genome analysed in detail, segregation of alleles largely, although not entirely, followed the pattern expected for genomic markers.
引用
收藏
页码:334 / 346
页数:13
相关论文
共 27 条
[1]  
Arumuganathan K, 1991, PLANT MOL BIOL REP, V9, P208, DOI DOI 10.1007/BF02672069
[2]   SNP discovery via 454 transcriptome sequencing [J].
Barbazuk, W. Brad ;
Emrich, Scott J. ;
Chen, Hsin D. ;
Li, Li ;
Schnable, Patrick S. .
PLANT JOURNAL, 2007, 51 (05) :910-918
[3]   Association study designs for complex diseases [J].
Cardon, LR ;
Bell, JI .
NATURE REVIEWS GENETICS, 2001, 2 (02) :91-99
[4]   Structure of linkage disequilibrium in plants [J].
Flint-Garcia, SA ;
Thornsberry, JM ;
Buckler, ES .
ANNUAL REVIEW OF PLANT BIOLOGY, 2003, 54 :357-374
[5]   Quality assessment of maize assembled genomic islands (MAGIs) and large-scale experimental verification of predicted genes [J].
Fu, Y ;
Emrich, SJ ;
Guo, L ;
Wen, TJ ;
Ashlock, DA ;
Aluru, S ;
Schnable, PS .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (34) :12282-12287
[6]   A Sanger/pyrosequencing hybrid approach tor the generation of high-quality draft assemblies of marine microbial genomes [J].
Goldberg, Susanne M. D. ;
Johnson, Justin ;
Busam, Dana ;
Feldblyum, Tamara ;
Ferriera, Steve ;
Friedman, Robert ;
Halpern, Aaron ;
Khouri, Hoda ;
Kravitz, Saul A. ;
Lauro, Federico M. ;
Li, Kelvin ;
Rogers, Yu-Hui ;
Strausberg, Robert ;
Sutton, Granger ;
Tallon, Luke ;
Thomas, Torsten ;
Venter, Eli ;
Frazier, Marvin ;
Venter, J. Craig .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (30) :11240-11245
[7]   Phylogenetic analysis of Brassiceae based on the nucleotide sequences of the S-locus related gene, SLR1 [J].
Inaba, R ;
Nishio, T .
THEORETICAL AND APPLIED GENETICS, 2002, 105 (08) :1159-1165
[8]   Analysis of the genome sequence of the flowering plant Arabidopsis thaliana [J].
Kaul, S ;
Koo, HL ;
Jenkins, J ;
Rizzo, M ;
Rooney, T ;
Tallon, LJ ;
Feldblyum, T ;
Nierman, W ;
Benito, MI ;
Lin, XY ;
Town, CD ;
Venter, JC ;
Fraser, CM ;
Tabata, S ;
Nakamura, Y ;
Kaneko, T ;
Sato, S ;
Asamizu, E ;
Kato, T ;
Kotani, H ;
Sasamoto, S ;
Ecker, JR ;
Theologis, A ;
Federspiel, NA ;
Palm, CJ ;
Osborne, BI ;
Shinn, P ;
Conway, AB ;
Vysotskaia, VS ;
Dewar, K ;
Conn, L ;
Lenz, CA ;
Kim, CJ ;
Hansen, NF ;
Liu, SX ;
Buehler, E ;
Altafi, H ;
Sakano, H ;
Dunn, P ;
Lam, B ;
Pham, PK ;
Chao, Q ;
Nguyen, M ;
Yu, GX ;
Chen, HM ;
Southwick, A ;
Lee, JM ;
Miranda, M ;
Toriumi, MJ ;
Davis, RW .
NATURE, 2000, 408 (6814) :796-815
[9]  
LADEANA WH, 2008, NAT METHODS, V5, P183
[10]  
Lagercrantz U, 1996, GENETICS, V144, P1903