Complete reannotation of the Arabidopsis genome:: methods, tools, protocols and the final release

被引:109
作者
Haas, BJ [1 ]
Wortman, JR [1 ]
Ronning, CM [1 ]
Hannick, LI [1 ]
Smith, RK [1 ]
Maiti, R [1 ]
Chan, AP [1 ]
Yu, CH [1 ]
Farzad, M [1 ]
Wu, DY [1 ]
White, O [1 ]
Town, CD [1 ]
机构
[1] Inst Genom Res, Rockville, MD 20850 USA
关键词
D O I
10.1186/1741-7007-3-7
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Since the initial publication of its complete genome sequence, Arabidopsis thaliana has become more important than ever as a model for plant research. However, the initial genome annotation was submitted by multiple centers using inconsistent methods, making the data difficult to use for many applications. Results: Over the course of three years, TIGR has completed its effort to standardize the structural and functional annotation of the Arabidopsis genome. Using both manual and automated methods, Arabidopsis gene structures were refined and gene products were renamed and assigned to Gene Ontology categories. We present an overview of the methods employed, tools developed, and protocols followed, summarizing the contents of each data release with special emphasis on our final annotation release (version 5). Conclusion: Over the entire period, several thousand new genes and pseudogenes were added to the annotation. Approximately one third of the originally annotated gene models were significantly refined yielding improved gene structure annotations, and every protein-coding gene was manually inspected and classified using Gene Ontology terms.
引用
收藏
页数:19
相关论文
共 104 条
  • [51] Alternative splicing of pre-mRNA: Developmental consequences and mechanisms of regulation
    Lopez, AJ
    [J]. ANNUAL REVIEW OF GENETICS, 1998, 32 : 279 - 305
  • [52] GeneMark.hmm: new solutions for gene finding
    Lukashin, AV
    Borodovsky, M
    [J]. NUCLEIC ACIDS RESEARCH, 1998, 26 (04) : 1107 - 1115
  • [53] Identification and analysis of Arabidopsis expressed sequence tags characteristic of non-coding RNAs
    MacIntosh, GC
    Wilkerson, C
    Green, PJ
    [J]. PLANT PHYSIOLOGY, 2001, 127 (03) : 765 - 776
  • [54] Functional significance of the alternative transcript processing of the Arabidopsis floral promoter FCA
    Macknight, R
    Duroux, M
    Laurie, R
    Dijkwel, P
    Simpson, G
    Dean, C
    [J]. PLANT CELL, 2002, 14 (04) : 877 - 888
  • [55] Light regulates alternative splicing of hydroxypyruvate reductase in pumpkin
    Mano, S
    Hayashi, M
    Nishimura, M
    [J]. PLANT JOURNAL, 1999, 17 (03) : 309 - 320
  • [56] Experimental RNomics:: Identification of 140 candidates for small non-messenger RNAs in the plant Arabidopsis thaliana
    Marker, C
    Zemann, A
    Terhörst, T
    Kiefmann, M
    Kastenmayer, JP
    Green, P
    Bachellerie, JP
    Brosius, J
    Hüttenhofer, A
    [J]. CURRENT BIOLOGY, 2002, 12 (23) : 2002 - 2013
  • [57] The use of MPSS for whole-genome transcriptional analysis in Arabidopsis
    Meyers, BC
    Tej, SS
    Vu, TH
    Haudenschild, CD
    Agrawal, V
    Edberg, SB
    Ghazal, H
    Decola, S
    [J]. GENOME RESEARCH, 2004, 14 (08) : 1641 - 1653
  • [58] Misra S., 2002, Genome Biol, V3, DOI [10.1186/gb-2002-3-12-research0083, DOI 10.1186/GB-2002-3-12-RESEARCH0083]
  • [59] Genomic sequence, splicing, and gene annotation
    Mount, SM
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 67 (04) : 788 - 792
  • [60] The InterPro Database, 2003 brings increased coverage and new features
    Mulder, NJ
    Apweiler, R
    Attwood, TK
    Bairoch, A
    Barrell, D
    Bateman, A
    Binns, D
    Biswas, M
    Bradley, P
    Bork, P
    Bucher, P
    Copley, RR
    Courcelle, E
    Das, U
    Durbin, R
    Falquet, L
    Fleischmann, W
    Griffiths-Jones, S
    Haft, D
    Harte, N
    Hulo, N
    Kahn, D
    Kanapin, A
    Krestyaninova, M
    Lopez, R
    Letunic, I
    Lonsdale, D
    Silventoinen, V
    Orchard, SE
    Pagni, M
    Peyruc, D
    Ponting, CP
    Selengut, JD
    Servant, F
    Sigrist, CJA
    Vaughan, R
    Zdobnov, EM
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 315 - 318