Assessment of community-submitted ontology annotations from a novel database-journal partnership

被引:11
作者
Berardini, Tanya Z. [1 ]
Li, Donghui [1 ]
Muller, Robert [1 ]
Chetty, Raymond [1 ]
Ploetz, Larry [1 ]
Singh, Shanker [1 ]
Wensel, April [1 ]
Huala, Eva [1 ]
机构
[1] Carnegie Inst Sci, Dept Plant Biol, Stanford, CA 94305 USA
来源
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION | 2012年
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
GENE; MODEL;
D O I
10.1093/database/bas030
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
As the scientific literature grows, leading to an increasing volume of published experimental data, so does the need to access and analyze this data using computational tools. The most commonly used method to convert published experimental data on gene function into controlled vocabulary annotations relies on a professional curator, employed by a model organism database or a more general resource such as UniProt, to read published articles and compose annotation statements based on the articles' contents. A more cost-effective and scalable approach capable of capturing gene function data across the whole range of biological research organisms in computable form is urgently needed. We have analyzed a set of ontology annotations generated through collaborations between the Arabidopsis Information Resource and several plant science journals. Analysis of the submissions entered using the online submission tool shows that most community annotations were well supported and the ontology terms chosen were at an appropriate level of specificity. Of the 503 individual annotations that were submitted, 97% were approved and community submissions captured 72% of all possible annotations. This new method for capturing experimental results in a computable form provides a cost-effective way to greatly increase the available body of annotations without sacrificing annotation quality.
引用
收藏
页数:13
相关论文
共 22 条
  • [1] The Renal Gene Ontology Annotation Initiative
    Alam-Faruque, Yasmin
    Dimmer, Emily C.
    Huntley, Rachael P.
    O'Donovan, Claire
    Scambler, Peter
    Apweiler, Rolf
    [J]. ORGANOGENESIS, 2010, 6 (02) : 71 - 75
  • [2] [Anonymous], CURR PROTOC BIOINFOR
  • [3] The GOA database in 2009-an integrated Gene Ontology Annotation resource
    Barrell, Daniel
    Dimmer, Emily
    Huntley, Rachael P.
    Binns, David
    O'Donovan, Claire
    Apweiler, Rolf
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 : D396 - D403
  • [4] The Gene Ontology in 2010: extensions and refinements The Gene Ontology Consortium
    Berardini, Tanya Z.
    Li, Donghui
    Huala, Eva
    Bridges, Susan
    Burgess, Shane
    McCarthy, Fiona
    Carbon, Seth
    Lewis, Suzanna E.
    Mungall, Christopher J.
    Abdulla, Amina
    Wood, Valerie
    Feltrin, Erika
    Valle, Giorgio
    Chisholm, Rex L.
    Fey, Petra
    Gaudet, Pascale
    Kibbe, Warren
    Basu, Siddhartha
    Bushmanova, Yulia
    Eilbeck, Karen
    Siegele, Deborah A.
    McIntosh, Brenley
    Renfro, Daniel
    Zweifel, Adrienne
    Hu, James C.
    Ashburner, Michael
    Tweedie, Susan
    Alam-Faruque, Yasmin
    Apweiler, Rolf
    Auchinchloss, Andrea
    Bairoch, Amos
    Barrell, Daniel
    Binns, David
    Blatter, Marie-Claude
    Bougueleret, Lydie
    Boutet, Emmanuel
    Breuza, Lionel
    Bridge, Alan
    Browne, Paul
    Chan, Wei Mun
    Coudert, Elizabeth
    Daugherty, Louise
    Dimmer, Emily
    Eberhardt, Ruth
    Estreicher, Anne
    Famiglietti, Livia
    Ferro-Rojas, Serenella
    Feuermann, Marc
    Foulger, Rebecca
    Gruaz-Gumowski, Nadine
    [J]. NUCLEIC ACIDS RESEARCH, 2010, 38 : D331 - D335
  • [5] The Mouse Genome Database (MGD): premier model organism resource for mammalian genomics and genetics
    Blake, Judith A.
    Bult, Carol J.
    Kadin, James A.
    Richardson, Joel E.
    Eppig, Janan T.
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 : D842 - D848
  • [6] Web-Queryable Large-Scale Data Sets for Hypothesis Generation in Plant Biology
    Brady, Siobhan M.
    Provart, Nicholas J.
    [J]. PLANT CELL, 2009, 21 (04) : 1034 - 1051
  • [7] Using computational predictions to improve literature-based Gene Ontology annotations: a feasibility study
    Costanzo, Maria C.
    Park, Julie
    Balakrishnan, Rama
    Cherry, J. Michael
    Hong, Eurie L.
    [J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2011,
  • [8] Gene Ontology annotations: what they mean and where they come from
    Hill, David P.
    Smith, Barry
    McAndrews-Hill, Monica S.
    Blake, Judith A.
    [J]. BMC BIOINFORMATICS, 2008, 9 (Suppl 5)
  • [9] Big data: The future of biocuration
    Howe, Doug
    Costanzo, Maria
    Fey, Petra
    Gojobori, Takashi
    Hannick, Linda
    Hide, Winston
    Hill, David P.
    Kania, Renate
    Schaeffer, Mary
    St Pierre, Susan
    Twigger, Simon
    White, Owen
    Rhee, Seung Yon
    [J]. NATURE, 2008, 455 (7209) : 47 - 50
  • [10] Systematic prediction of gene function in Arabidopsis thaliana using a probabilistic functional gene network
    Hwang, Sohyun
    Rhee, Seung Y.
    Marcotte, Edward M.
    Lee, Insuk
    [J]. NATURE PROTOCOLS, 2011, 6 (09) : 1429 - 1442