The institute for genomic research Osa1 rice genome annotation database

被引:161
作者
Yuan, QP [1 ]
Shu, OY [1 ]
Wang, AH [1 ]
Zhu, W [1 ]
Maiti, R [1 ]
Lin, HN [1 ]
Hamilton, J [1 ]
Haas, B [1 ]
Sultana, R [1 ]
Cheung, F [1 ]
Wortman, J [1 ]
Buell, CR [1 ]
机构
[1] Inst Genom Res, Rockville, MD 20850 USA
关键词
D O I
10.1104/pp.104.059063
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
We have developed a rice ( Oryza sativa) genome annotation database ( Osa1) that provides structural and functional annotation for this emerging model species. Using the sequence of O. sativa subsp. japonica cv Nipponbare from the International Rice Genome Sequencing Project, pseudomolecules, or virtual contigs, of the 12 rice chromosomes were constructed. Our most recent release, version 3, represents our third build of the pseudomolecules and is composed of 98% finished sequence. Genes were identified using a series of computational methods developed for Arabidopsis ( Arabidopsis thaliana) that were modified for use with the rice genome. In release 3 of our annotation, we identified 57,915 genes, of which 14,196 are related to transposable elements. Of these 43,719 nontransposable element- related genes, 18,545 ( 42.4%) were annotated with a putative function, 5,777 ( 13.2%) were annotated as encoding an expressed protein with no known function, and the remaining 19,397 ( 44.4%) were annotated as encoding a hypothetical protein. Multiple splice forms ( 5,873) were detected for 2,538 genes, resulting in a total of 61,250 gene models in the rice genome. We incorporated experimental evidence into 18,252 gene models to improve the quality of the structural annotation. A series of functional data types has been annotated for the rice genome that includes alignment with genetic markers, assignment of gene ontologies, identification of flanking sequence tags, alignment with homologs from related species, and syntenic mapping with other cereal species. All structural and functional annotation data are available through interactive search and display windows as well as through download of flat files. To integrate the data with other genome projects, the annotation data are available through a Distributed Annotation System and a Genome Browser.
引用
收藏
页码:17 / 26
页数:9
相关论文
共 37 条
[1]  
Arumuganathan K., 1991, PLANT MOL BIOL REP, V9, P229, DOI DOI 10.1007/BF02672073
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]   The use of the Monsanto draft rice genome sequence in research [J].
Barry, GF .
PLANT PHYSIOLOGY, 2001, 125 (03) :1164-1165
[4]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkr1065, 10.1093/nar/gkh121]
[5]   Prediction of complete gene structures in human genomic DNA [J].
Burge, C ;
Karlin, S .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) :78-94
[6]  
CAUSSE MA, 1994, GENETICS, V138, P1251
[7]   The Distributed Annotation System [J].
Dowell, Robin D. ;
Jokerst, Rodney M. ;
Day, Allen ;
Eddy, Sean R. ;
Stein, Lincoln .
BMC BIOINFORMATICS, 2001, 2 (1)
[8]   Comparative genetics in the grasses [J].
Gale, MD ;
Devos, KM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (05) :1971-1974
[9]   A draft sequence of the rice genome (Oryza sativa L. ssp japonica) [J].
Goff, SA ;
Ricke, D ;
Lan, TH ;
Presting, G ;
Wang, RL ;
Dunn, M ;
Glazebrook, J ;
Sessions, A ;
Oeller, P ;
Varma, H ;
Hadley, D ;
Hutchinson, D ;
Martin, C ;
Katagiri, F ;
Lange, BM ;
Moughamer, T ;
Xia, Y ;
Budworth, P ;
Zhong, JP ;
Miguel, T ;
Paszkowski, U ;
Zhang, SP ;
Colbert, M ;
Sun, WL ;
Chen, LL ;
Cooper, B ;
Park, S ;
Wood, TC ;
Mao, L ;
Quail, P ;
Wing, R ;
Dean, R ;
Yu, YS ;
Zharkikh, A ;
Shen, R ;
Sahasrabudhe, S ;
Thomas, A ;
Cannings, R ;
Gutin, A ;
Pruss, D ;
Reid, J ;
Tavtigian, S ;
Mitchell, J ;
Eldredge, G ;
Scholl, T ;
Miller, RM ;
Bhatnagar, S ;
Adey, N ;
Rubano, T ;
Tusneem, N .
SCIENCE, 2002, 296 (5565) :92-100
[10]   Early and multiple Ac transpositions in rice suitable for efficient insertional mutagenesis [J].
Greco, R ;
Ouwerkerk, PBF ;
Taal, AJC ;
Favalli, C ;
Beguiristain, T ;
Puigdomènech, P ;
Colombo, L ;
Hoge, JHC ;
Pereira, A .
PLANT MOLECULAR BIOLOGY, 2001, 46 (02) :215-227