Oligonucleotide array discovery of polymorphisms in cultivated tomato (Solanum lycopersicum L.) reveals patterns of SNP variation associated with breeding

被引:40
作者
Sim, Sung-Chur [1 ]
Robbins, Matthew D. [1 ]
Chilcott, Charles [2 ]
Zhu, Tong [2 ]
Francis, David M. [1 ]
机构
[1] Ohio State Univ, Ohio Agr Res & Dev Ctr, Dept Hort & Crop Sci, Wooster, OH 44691 USA
[2] Syngenta Biotechnol Inc, Res Triangle Pk, NC 27709 USA
来源
BMC GENOMICS | 2009年 / 10卷
关键词
SINGLE-FEATURE POLYMORPHISMS; MAP-BASED CLONING; DISEASE RESISTANCE; NUCLEOTIDE-BINDING; GENE; DIVERSITY; SUBSTITUTIONS; ESCULENTUM; GENOME; MEMBER;
D O I
10.1186/1471-2164-10-466
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Cultivated tomato (Solanum lycopersicum L.) has narrow genetic diversity that makes it difficult to identify polymorphisms between elite germplasm. We explored array-based single feature polymorphism (SFP) discovery as a high-throughput approach for marker development in cultivated tomato. Results: Three varieties, FL7600 (fresh-market), OH9242 (processing), and PI114490 (cherry) were used as a source of genomic DNA for hybridization to oligonucleotide arrays. Identification of SFPs was based on outlier detection using regression analysis of normalized hybridization data within a probe set for each gene. A subset of 189 putative SFPs was sequenced for validation. The rate of validation depended on the desired level of significance (alpha) used to define the confidence interval (CI), and ranged from 76% for polymorphisms identified at alpha <= 10(-6) to 60% for those identified at alpha <= 10(-2). Validation percentage reached a plateau between alpha <= 10(-4) and alpha <= 10(-7), but failure to identify known SFPs (Type II error) increased dramatically at alpha <= 10(-6). Trough sequence validation, we identified 279 SNPs and 27 InDels in 111 loci. Sixty loci contained >= 2 SNPs per locus. We used a subset of validated SNPs for genetic diversity analysis of 92 tomato varieties and accessions. Pairwise estimation of theta (Fst) suggested significant differentiation between collections of fresh-market, processing, vintage, Latin American (landrace), and S. pimpinellifolium accessions. The fresh-market and processing groups displayed high genetic diversity relative to vintage and landrace groups. Furthermore, the patterns of SNP variation indicated that domestication and early breeding practices have led to progressive genetic bottlenecks while modern breeding practices have reintroduced genetic variation into the crop from wild species. Finally, we examined the ratio of nonsynonymous (Ka) to synonymous substitutions (Ks) for 20 loci with multiple SNPs (>= 4 per locus). Six of 20 loci showed ratios of Ka/Ks >= 0.9. Conclusion: Array-based SFP discovery was an efficient method to identify a large number of molecular markers for genetics and breeding in elite tomato germplasm. Patterns of sequence variation across five major tomato groups provided insight into to the effect of human selection on genetic variation.
引用
收藏
页数:10
相关论文
共 48 条
[1]   Large-scale identification of single-feature polymorphisms in complex genomes [J].
Borevitz, JO ;
Liang, D ;
Plouffe, D ;
Chang, HS ;
Zhu, T ;
Weigel, D ;
Berry, CC ;
Winzeler, E ;
Chory, J .
GENOME RESEARCH, 2003, 13 (03) :513-523
[2]  
Boswell V.R., 1937, U.S. Dept. Agr. Yearbook, V1937, P176
[3]  
Chen ZY, 1994, INST MATH S, V24, P163, DOI 10.1214/lnms/1215463794
[4]   SNP frequency, haplotype structure and linkage disequilibrium in elite maize inbred lines [J].
Ching, A ;
Caldwell, KS ;
Jung, M ;
Dolan, M ;
Smith, OS ;
Tingey, S ;
Morgante, M ;
Rafalski, AJ .
BMC GENETICS, 2002, 3 (1)
[5]   K-estimator: calculation of the number of nucleotide substitutions per site and the confidence intervals [J].
Comeron, JM .
BIOINFORMATICS, 1999, 15 (09) :763-764
[6]  
Comeron JM, 1995, J MOL EVOL, V41, P1152, DOI 10.1007/BF00173196
[7]   Detecting single-feature polymorphisms using oligonucleotide arrays and robustified projection pursuit [J].
Cui, XP ;
Xu, J ;
Asghar, R ;
Condamine, P ;
Svensson, JT ;
Wanamaker, S ;
Stein, N ;
Roose, M ;
Close, TJ .
BIOINFORMATICS, 2005, 21 (20) :3852-3858
[8]   Detection and validation of single feature polymorphisms in cowpea (Vigna unguiculata L. Walp) using a soybean genome array [J].
Das, Sayan ;
Bhat, Prasanna R. ;
Sudhakar, Chinta ;
Ehlers, Jeffrey D. ;
Wanamaker, Steve ;
Roberts, Philip A. ;
Cui, Xinping ;
Close, Timothy J. .
BMC GENOMICS, 2008, 9 (1)
[9]   MICROSATELLITE ANALYSER (MSA): a platform independent analysis tool for large microsatellite data sets [J].
Dieringer, D ;
Schlotterer, C .
MOLECULAR ECOLOGY NOTES, 2003, 3 (01) :167-169
[10]   The tomato Cf-2 disease resistance locus comprises two functional genes encoding leucine-rich repeat proteins [J].
Dixon, MS ;
Jones, DA ;
Keddie, JS ;
Thomas, CM ;
Harrison, K ;
Jones, JDG .
CELL, 1996, 84 (03) :451-459