Analysis and functional annotation of an expressed sequence tag collection for tropical crop sugarcane

被引:189
作者
Vettore, AL
da Silva, FR
Kemper, EL
Souza, GM
da Silva, AM
Ferro, MIT
Henrique-Silva, F
Giglioti, ÉA
Lemos, MVF
Coutinho, LL
Nobrega, MP
Carrer, H
França, SC
Bacci, M
Goldman, MHS
Gomes, SL
Nunes, LR
Camargo, LEA
Siqueira, WJ
Van Sluys, MA
Thiemann, OH
Kuramae, EE
Santelli, RV
Marino, CL
Targon, MLPN
Ferro, JA
Silveira, HCS
Marini, DC
Lemos, EGM
Monteiro-Vitorello, CB
Tambor, JHM
Carraro, DM
Roberto, PG
Martins, VG
Goldman, GH
de Oliveira, RC
Truffi, D
Colombo, CA
Rossi, M
de Araujo, PG
Sculaccio, SA
Angella, A
Lima, MMA
de Rosa, VE
Siviero, F
Coscrato, VE
Machado, MA
Grivet, L
Di Mauro, SMZ
Nobrega, FG
机构
[1] CIRAD, UMR 1096, F-34398 Montpellier 5, France
[2] Univ Sao Paulo, Fac Ciencias Farmaceut Ribeirao Preto, Dept Ciencias Farmaceut, BR-14040903 Ribeirao Preto, SP, Brazil
[3] Univ Estadual Paulista, Dept Fonoaudiol, BR-17525900 Marilia, SP, Brazil
[4] Inst Agron Campinas, Ctr APTA Citros Sylvio Moreira, BR-13490970 Cordeiropolis, SP, Brazil
[5] Univ Estadual Paulista, Inst Biociencias, Dept Genet, BR-18618000 Botucatu, SP, Brazil
[6] Univ Estadual Paulista, Fac Ciencias Agron, Dept Defesa Fitossanitaria, BR-18603970 Botucatu, SP, Brazil
[7] Univ Sao Paulo, Inst Fis Sao Carlos, BR-13560970 Sao Carlos, SP, Brazil
[8] Inst Agron Campinas, Ctr Genet Biol Mol & Fitoquim, BR-13001970 Campinas, SP, Brazil
[9] Univ Mogi das Cruzes, Nucleo Integrado Biotecnol, BR-08780911 Mogi Das Cruzes, SP, Brazil
[10] Univ Sao Paulo, Fac Filosofia Ciencias & Letras Ribeirao Preto, Dept Biol, BR-14040901 Ribeirao Preto, SP, Brazil
[11] Univ Estadual Paulista, Inst Biociencias, Ctr Estudos Insetos Socias, BR-13506900 Rio Claro, SP, Brazil
[12] Univ Ribeirao Preto, Dept Biotecnol Plantas Med, BR-14096380 Ribeirao Preto, Brazil
[13] Univ Vale Paraiba, Inst Pesquisa & Desenvolvimento, Lab Genet Mol & Genomas, BR-12244000 Sao Jose Dos Campos, SP, Brazil
[14] Univ Sao Paulo, Escoal Super Agr Luiz de Queiroz, BR-13418900 Piracicaba, SP, Brazil
[15] Univ Fed Sao Carlos, Ctr Ciencias Agrarias, Dept Biotecnol Vegetal, BR-13600970 Araras, SP, Brazil
[16] Univ Fed Sao Carlos, Dept Genet & Evolucao, BR-13565905 Sao Carlos, SP, Brazil
[17] Univ Estadual Paulista, Fac Ciencias Agrarias & Vet Jaboticabal, Dept Biol Aplicada Agropecuaria, BR-14884900 Jaboticabal, SP, Brazil
[18] Univ Estadual Paulista, Fac Ciencias Agrarias & Vet Jaboticabal, Dept Tecnol, BR-14884900 Jaboticabal, SP, Brazil
[19] Univ Sao Paulo, Inst Ciencias Biomed, Dept Microbiol, BR-05508900 Sao Paulo, Brazil
[20] Univ Sao Paulo, Inst Biociencias, Dept Bot, BR-05508900 Sao Paulo, Brazil
[21] Univ Sao Paulo, Inst Quim, Dept Bioquim, BR-05508900 Sao Paulo, Brazil
[22] Univ Estadual Campinas, Inst Computacao, Lab Bioinformat, BR-13083970 Campinas, SP, Brazil
[23] Univ Estadual Campinas, Ctr Biol Mol & Engn Genet, BR-13083970 Campinas, SP, Brazil
关键词
D O I
10.1101/gr.1532103
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
To contribute to our understanding of the genome complexity of sugarcane, we undertook a large-scale expressed sequence tag (EST),program. More than 260,000 cDNA clones were partially sequenced from 26 standard cDNA libraries generated from different sugarcane tissues. After the processing of the sequences, 237,954 high-quality ESTs were identified. These ESTs were assembled into 43,141 putative transcripts. Of the assembled sequences, 35.6% presented no matches with existing sequences in public databases. A global analysis of the whole SUCEST data set indicated that 14,409 assembled sequences (33% of the total) contained at least one cDNA clone with a full-length insert. Annotation of the 43,141 assembled sequences associated almost 50% of the putative identified sugarcane genes with protein metabolism, cellular communication/signal transduction, bioenergetics, and stress responses. Inspection of the translated assembled sequences for conserved protein domains revealed 40,821 amino acid sequences with 1415 Pfam domains. Reassembling the consensus sequences of the 43,141 transcripts revealed a 22% redundancy in the first assembling. This indicated that possibly 33,620 unique genes had been identified and indicated that >90% of the sugarcane expressed genes were tagged.
引用
收藏
页码:2725 / 2735
页数:11
相关论文
共 51 条
[1]  
ADAMS MD, 1995, NATURE, V377, P3
[2]  
ALJANABI SM, 1993, GENETICS, V134, P1249
[3]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[4]   Analysis of the genome sequence of the flowering plant Arabidopsis thaliana [J].
Kaul, S ;
Koo, HL ;
Jenkins, J ;
Rizzo, M ;
Rooney, T ;
Tallon, LJ ;
Feldblyum, T ;
Nierman, W ;
Benito, MI ;
Lin, XY ;
Town, CD ;
Venter, JC ;
Fraser, CM ;
Tabata, S ;
Nakamura, Y ;
Kaneko, T ;
Sato, S ;
Asamizu, E ;
Kato, T ;
Kotani, H ;
Sasamoto, S ;
Ecker, JR ;
Theologis, A ;
Federspiel, NA ;
Palm, CJ ;
Osborne, BI ;
Shinn, P ;
Conway, AB ;
Vysotskaia, VS ;
Dewar, K ;
Conn, L ;
Lenz, CA ;
Kim, CJ ;
Hansen, NF ;
Liu, SX ;
Buehler, E ;
Altafi, H ;
Sakano, H ;
Dunn, P ;
Lam, B ;
Pham, PK ;
Chao, Q ;
Nguyen, M ;
Yu, GX ;
Chen, HM ;
Southwick, A ;
Lee, JM ;
Miranda, M ;
Toriumi, MJ ;
Davis, RW .
NATURE, 2000, 408 (6814) :796-815
[5]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
[6]   GenBank [J].
Benson, DA ;
Karsch-Mizrachi, I ;
Lipman, DJ ;
Ostell, J ;
Rapp, BA ;
Wheeler, DL .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :17-20
[7]  
BREMER G, 1961, EUPHYTICA, V10, P121, DOI 10.1007/BF00037206
[8]   Isolation of a full-length cDNA encoding polyphenol oxidase from sugarcane, a C4 grass [J].
Bucheli, CS ;
Dry, IB ;
Robinson, SP .
PLANT MOLECULAR BIOLOGY, 1996, 31 (06) :1233-1238
[9]   Reliable identification of large numbers of candidate SNPs from public EST data [J].
Buetow, KH ;
Edmonson, MN ;
Cassidy, AB .
NATURE GENETICS, 1999, 21 (03) :323-325
[10]   GLUCOSE-TRANSPORTER CDNAS FROM SUGARCANE [J].
BUGOS, RC ;
THOM, M .
PLANT PHYSIOLOGY, 1993, 103 (04) :1469-1470