Development of a EST dataset and characterization of EST-SSRs in a traditional Chinese medicinal plant, Epimedium sagittatum (Sieb. Et Zucc.) Maxim

被引:163
作者
Zeng, Shaohua [1 ,2 ]
Xiao, Gong [1 ,2 ]
Guo, Juan [1 ,2 ]
Fei, Zhangjun [3 ,4 ]
Xu, Yanqin [1 ]
Roe, Bruce A. [5 ]
Wang, Ying [1 ]
机构
[1] Chinese Acad Sci, Wuhan Bot Garden, Key Lab Pant Germplasm Enhancement & Special Agr, Wuhan 430074, Hubei, Peoples R China
[2] Chinese Acad Sci, Grad Sch, Beijing 100039, Peoples R China
[3] Cornell Univ, Boyce Thompson Inst Plant Res, Ithaca, NY 14853 USA
[4] USDA, Robert W Holley Ctr Agr & Hlth, Ithaca, NY 14853 USA
[5] Univ Oklahoma, Stephenson Res & Technol Ctr, Norman, OK 73019 USA
来源
BMC GENOMICS | 2010年 / 11卷
关键词
CROSS-SPECIES AMPLIFICATION; SIMPLE SEQUENCE REPEATS; GENIC MICROSATELLITE MARKERS; HERBA EPIMEDII; DNA; TRANSFERABILITY; IDENTIFICATION; BERBERIDACEAE; GENOME; BARLEY;
D O I
10.1186/1471-2164-11-94
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Epimedium sagittatum (Sieb. Et Zucc.) Maxim, a traditional Chinese medicinal plant species, has been used extensively as genuine medicinal materials. Certain Epimedium species are endangered due to commercial overexploition, while sustainable application studies, conservation genetics, systematics, and marker-assisted selection (MAS) of Epimedium is less-studied due to the lack of molecular markers. Here, we report a set of expressed sequence tags (ESTs) and simple sequence repeats (SSRs) identified in these ESTs for E. sagittatum. Results: cDNAs of E. sagittatum are sequenced using 454 GS-FLX pyrosequencing technology. The raw reads are cleaned and assembled into a total of 76,459 consensus sequences comprising of 17,231 contigs and 59,228 singlets. About 38.5% (29,466) of the consensus sequences significantly match to the non-redundant protein database (E-value < 1e-10), 22,295 of which are further annotated using Gene Ontology (GO) terms. A total of 2,810 EST-SSRs is identified from the Epimedium EST dataset. Trinucleotide SSR is the dominant repeat type (55.2%) followed by dinucleotide (30.4%), tetranuleotide (7.3%), hexanucleotide (4.9%), and pentanucleotide (2.2%) SSR. The dominant repeat motif is AAG/CTT (23.6%) followed by AG/CT (19.3%), ACC/GGT (11.1%), AT/AT (7.5%), and AAC/GTT (5.9%). Thirty-two SSR-ESTs are randomly selected and primer pairs are synthesized for testing the transferability across 52 Epimedium species. Eighteen primer pairs (85.7%) could be successfully transferred to Epimedium species and sixteen of those show high genetic diversity with 0.35 of observed heterozygosity (Ho) and 0.65 of expected heterozygosity (He) and high number of alleles per locus (11.9). Conclusion: A large EST dataset with a total of 76,459 consensus sequences is generated, aiming to provide sequence information for deciphering secondary metabolism, especially for flavonoid pathway in Epimedium. A total of 2,810 EST-SSRs is identified from EST dataset and similar to 1580 EST-SSR markers are transferable. E. sagittatum EST-SSR transferability to the major Epimedium germplasm is up to 85.7%. Therefore, this EST dataset and EST-SSRs will be a powerful resource for further studies such as taxonomy, molecular breeding, genetics, genomics, and secondary metabolism in Epimedium species.
引用
收藏
页数:11
相关论文
共 56 条
[1]   Identification, characterization and utilization of EST-derived genic microsatellite markers for genome analyses of coffee and related species [J].
Aggarwal, Ramesh K. ;
Hendre, Prasad S. ;
Varshney, Rajeev K. ;
Bhat, Prasanna R. ;
Krishnakumar, V. ;
Singh, Lalji .
THEORETICAL AND APPLIED GENETICS, 2007, 114 (02) :359-372
[2]   FAST AND SENSITIVE SILVER STAINING OF DNA IN POLYACRYLAMIDE GELS [J].
BASSAM, BJ ;
CAETANOANOLLES, G ;
GRESSHOFF, PM .
ANALYTICAL BIOCHEMISTRY, 1991, 196 (01) :80-83
[3]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
[4]   A cyclophilin gene marker confirming geographical differentiation of Norway spruce populations and indicating viability response on excess soil-born salinity [J].
Bozhko, M ;
Riegel, R ;
Schubert, R ;
Müller-Starck, G .
MOLECULAR ECOLOGY, 2003, 12 (11) :3147-3155
[5]   Diversity of microsatellites derived from genomic libraries and GenBank sequences in rice (Oryza sativa L.) [J].
Cho, YG ;
Ishii, T ;
Temnykh, S ;
Chen, X ;
Lipovich, L ;
McCouch, SR ;
Park, WD ;
Ayres, N ;
Cartinhour, S .
THEORETICAL AND APPLIED GENETICS, 2000, 100 (05) :713-722
[6]   DNA sequence quality trimming and vector removal [J].
Chou, HH ;
Holmes, MH .
BIOINFORMATICS, 2001, 17 (12) :1093-1104
[7]   RNA secondary structure:: An important cis-element in rat calcitonin/CGRP pre-messenger RNA splicing [J].
Coleman, TP ;
Roesser, JR .
BIOCHEMISTRY, 1998, 37 (45) :15941-15950
[8]  
*COMM CP, 2005, HERB EP PHARM PEOPL, P229
[9]   Structure identification of a prenylflavonol glycoside from Epimedium koreanum by electrospray ionization tandem mass spectrometry [J].
Dou, JP ;
Liu, ZQ ;
Liu, SY .
ANALYTICAL SCIENCES, 2006, 22 (03) :449-452
[10]   Gene discovery and annotation using LCM-454 transcriptome sequencing [J].
Emrich, Scott J. ;
Barbazuk, W. Brad ;
Li, Li ;
Schnable, Patrick S. .
GENOME RESEARCH, 2007, 17 (01) :69-73