A global assembly of cotton ESTs

被引:111
作者
Udall, JA
Swanson, JM
Haller, K
Rapp, RA
Sparks, ME
Hatfield, J
Yu, YS
Wu, YR
Dowd, C
Arpat, AB
Sickler, BA
Wilkins, TA
Guo, JY
Chen, XY
Scheffler, J
Taliercio, E
Turley, R
McFadden, H
Payton, P
Klueva, N
Allen, R
Zhang, DS
Haigler, C
Wilkerson, C
Suo, JF
Schulze, SR
Pierce, ML
Essenberg, M
Kim, H
Llewellyn, DJ
Dennis, ES
Kudrna, D
Wing, R
Paterson, AH
Soderlund, C
Wendel, JF [1 ]
机构
[1] Iowa State Univ, Dept Ecol Evolut & Organismal Biol, Ames, IA 50011 USA
[2] BIO5 Inst, Arizona Genom Computat Lab, Tucson, AZ 85721 USA
[3] Univ Arizona, Dept Plant Sci, Genom Inst, Tucson, AZ 85721 USA
[4] CSIRO Plant Ind, Canberra, ACT 2601, Australia
[5] Univ Calif Davis, Dept Plant Sci, Davis, CA 95616 USA
[6] Shanghai Inst Biol Sci, Inst Plant Physiol & Ecol, Shanghai 200032, Peoples R China
[7] USDA ARS, Stoneville, MS 38776 USA
[8] USDA, ARS, Lubbock, TX 79415 USA
[9] Texas Tech Univ, Dept Biol, Lubbock, TX 79409 USA
[10] N Carolina State Univ, Dept Crop Sci, Raleigh, NC 27695 USA
[11] N Carolina State Univ, Dept Bot, Raleigh, NC 27695 USA
[12] Michigan State Univ, Bioinformat Core Facil, E Lansing, MI 48824 USA
[13] Inst Genet & Dev Biol, Beijing 100101, Peoples R China
[14] Univ Georgia, Plant Genome Mapping Lab, Athens, GA 30602 USA
[15] Oklahoma State Univ, Oklahoma Agr Expt Stn, Stillwater, OK 74078 USA
关键词
D O I
10.1101/gr.4602906
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Approximately 185,000 Gossypium EST sequences comprising > 94,800,000 nucleotides were amassed from 30 cDNA libraries constructed from a variety Of tissues and organs under a range of conditions, including drought stress and pathogen challenges. These libraries were derived from allopolyploid cotton (Gossypium hirsutum; A(T) and D-T genomes) as well as its two diploid progenitors, Gossypium arboreum (A genome) and Gossypium raimondii (D genome). ESTs were assembled using the Program for Assembling and Viewing ESTs (PAVE), resulting in 22,030 contigs and 29,077 singletons (51,107 unigenes). Further comparisons among the singletons and contigs led to recognition of 33,665 exemplar sequences that represent a nonredundant set of putative Gossypium genes containing partial or full-length coding regions and usually one or two UTRs. The assembly, along with their UniProt BLASTX hits, GO annotation, and Pfam analysis results, are freely accessible as a public resource for cotton genomics. Because ESTs from diploid and allotetraploid Gossypium were combined in a single assembly, we were in many cases able to bioinformatically distinguish duplicated genes in allotetraploid cotton and assign them to either the A or D genome. The assembly and associated information provide a framework for future investigation of cotton functional and evolutionary genomics.
引用
收藏
页码:441 / 450
页数:10
相关论文
共 68 条
[51]   Phylogeny, duplication, and intraspecific variation of Adh sequences in new world diploid cottons (Gossypium L., Malvaceae) [J].
Small, RL ;
Wendel, JF .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2000, 16 (01) :73-84
[52]   Differential evolutionary dynamics of duplicated paralogous Adh loci in allotetraploid cotton (Gossypium) [J].
Small, RL ;
Wendel, JF .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (05) :597-607
[53]   The bioperl toolkit:: Perl modules for the life sciences [J].
Stajich, JE ;
Block, D ;
Boulez, K ;
Brenner, SE ;
Chervitz, SA ;
Dagdigian, C ;
Fuellen, G ;
Gilbert, JGR ;
Korf, I ;
Lapp, H ;
Lehväslaiho, H ;
Matsalla, C ;
Mungall, CJ ;
Osborne, BI ;
Pocock, MR ;
Schattner, P ;
Senger, M ;
Stein, LD ;
Stupka, E ;
Wilkinson, MD ;
Birney, E .
GENOME RESEARCH, 2002, 12 (10) :1611-1618
[54]   The comparison of gene expression from multiple cDNA libraries [J].
Stekel, DJ ;
Git, Y ;
Falciani, F .
GENOME RESEARCH, 2000, 10 (12) :2055-2061
[55]   Identification of GhMYB109 encoding a R2R3 MYB transcription factor that expressed specifically in fiber initials and elongating fibers of cotton (Gossypium Hirsutum L.) [J].
Suo, JF ;
Liang, XO ;
Pu, L ;
Zhang, YS ;
Xue, YB .
BIOCHIMICA ET BIOPHYSICA ACTA-GENE STRUCTURE AND EXPRESSION, 2003, 1630 (01) :25-34
[56]  
*USDA FAS, 2005, COTT WORLD MARK TRAD
[57]   Analysis and functional annotation of an expressed sequence tag collection for tropical crop sugarcane [J].
Vettore, AL ;
da Silva, FR ;
Kemper, EL ;
Souza, GM ;
da Silva, AM ;
Ferro, MIT ;
Henrique-Silva, F ;
Giglioti, ÉA ;
Lemos, MVF ;
Coutinho, LL ;
Nobrega, MP ;
Carrer, H ;
França, SC ;
Bacci, M ;
Goldman, MHS ;
Gomes, SL ;
Nunes, LR ;
Camargo, LEA ;
Siqueira, WJ ;
Van Sluys, MA ;
Thiemann, OH ;
Kuramae, EE ;
Santelli, RV ;
Marino, CL ;
Targon, MLPN ;
Ferro, JA ;
Silveira, HCS ;
Marini, DC ;
Lemos, EGM ;
Monteiro-Vitorello, CB ;
Tambor, JHM ;
Carraro, DM ;
Roberto, PG ;
Martins, VG ;
Goldman, GH ;
de Oliveira, RC ;
Truffi, D ;
Colombo, CA ;
Rossi, M ;
de Araujo, PG ;
Sculaccio, SA ;
Angella, A ;
Lima, MMA ;
de Rosa, VE ;
Siviero, F ;
Coscrato, VE ;
Machado, MA ;
Grivet, L ;
Di Mauro, SMZ ;
Nobrega, FG .
GENOME RESEARCH, 2003, 13 (12) :2725-2735
[58]   Polyploidy and the evolutionary history of cotton [J].
Wendel, JF ;
Cronn, RC .
ADVANCES IN AGRONOMY, VOL 78, 2003, 78 :139-186
[59]  
Wendel JF, 1995, EVOLUTION CROP PLANT, P358
[60]   Annotated expressed sequence tags and cDNA microarrays for studies of brain and behavior in the honey bee [J].
Whitfield, CW ;
Band, MR ;
Bonaldo, MF ;
Kumar, CG ;
Liu, L ;
Pardinas, JR ;
Robertson, HM ;
Soares, MB ;
Robinson, GE .
GENOME RESEARCH, 2002, 12 (04) :555-566