Targeted journal curation as a method to improve data currency at the Comparative Toxicogenomics Database

被引:8
作者
Davis, Allan Peter [1 ]
Johnson, Robin J. [2 ]
Lennon-Hopkins, Kelley [2 ]
Sciaky, Daniela [2 ]
Rosenstein, Michael C. [2 ]
Wiegers, Thomas C. [1 ]
Mattingly, Carolyn J. [1 ]
机构
[1] N Carolina State Univ, Dept Biol, Raleigh, NC 27695 USA
[2] Mt Desert Isl Biol Lab, Dept Bioinformat, Salsbury Cove, ME 04672 USA
来源
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION | 2012年
关键词
EXPOSURE;
D O I
10.1093/database/bas051
中图分类号
Q [生物科学];
学科分类号
090105 [作物生产系统与生态工程];
摘要
The Comparative Toxicogenomics Database (CTD) is a public resource that promotes understanding about the effects of environmental chemicals on human health. CTD biocurators read the scientific literature and manually curate a triad of chemical-gene, chemical-disease and gene-disease interactions. Typically, articles for CTD are selected using a chemical-centric approach by querying PubMed to retrieve a corpus containing the chemical of interest. Although this technique ensures adequate coverage of knowledge about the chemical (i. e. data completeness), it does not necessarily reflect the most current state of all toxicological research in the community at large (i.e. data currency). Keeping databases current with the most recent scientific results, as well as providing a rich historical background from legacy articles, is a challenging process. To address this issue of data currency, CTD designed and tested a journal-centric approach of curation to complement our chemical-centric method. We first identified priority journals based on defined criteria. Next, over 7 weeks, three biocurators reviewed 2425 articles from three consecutive years (2009-2011) of three targeted journals. From this corpus, 1252 articles contained relevant data for CTD and 52 752 interactions were manually curated. Here, we describe our journal selection process, two methods of document delivery for the biocurators and the analysis of the resulting curation metrics, including data currency, and both intra-journal and inter-journal comparisons of research topics. Based on our results, we expect that curation by select journals can (i) be easily incorporated into the curation pipeline to complement our chemical-centric approach; (ii) build content more evenly for chemicals, genes and diseases in CTD (rather than biasing data by chemicals-of-interest); (iii) reflect developing areas in environmental health and (iv) improve overall data currency for chemicals, genes and diseases.
引用
收藏
页数:13
相关论文
共 22 条
[1]
Environmental health research in the post-genome era: New fields, new challenges, and new opportunities [J].
Bower, JJ ;
Shi, XL .
JOURNAL OF TOXICOLOGY AND ENVIRONMENTAL HEALTH-PART B-CRITICAL REVIEWS, 2005, 8 (02) :71-94
[2]
Directly e-mailing authors of newly published papers encourages community curation [J].
Bunt, Stephanie M. ;
Grumbling, Gary B. ;
Field, Helen I. ;
Marygold, Steven J. ;
Brown, Nicholas H. ;
Millburn, Gillian H. .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2012,
[3]
New mutant phenotype data curation system in the Saccharomyces Genome Database [J].
Costanzo, Maria C. ;
Skrzypek, Marek S. ;
Nash, Robert ;
Wong, Edith ;
Binkley, Gail ;
Engel, Stacia R. ;
Hitz, Benjamin ;
Hong, Eurie L. ;
Cherry, J. Michael .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2009,
[4]
The Comparative Toxicogenomics Database facilitates identification and understanding of chemical-gene-disease associations: arsenic as a case study [J].
Davis, Allan P. ;
Murphy, Cynthia G. ;
Rosenstein, Michael C. ;
Wiegers, Thomas C. ;
Mattingly, Carolyn J. .
BMC MEDICAL GENOMICS, 2008, 1 (1)
[5]
MEDIC: a practical disease vocabulary used at the Comparative Toxicogenomics Database [J].
Davis, Allan Peter ;
Wiegers, Thomas C. ;
Rosenstein, Michael C. ;
Mattingly, Carolyn J. .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2012,
[6]
The curation paradigm and application tool used for manual curation of the scientific literature at the Comparative Toxicogenomics Database [J].
Davis, Allan Peter ;
Wiegers, Thomas C. ;
Murphy, Cynthia G. ;
Mattingly, Carolyn J. .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2011,
[7]
The Comparative Toxicogenomics Database: update 2011 [J].
Davis, Allan Peter ;
King, Benjamin L. ;
Mockus, Susan ;
Murphy, Cynthia G. ;
Saraceni-Richards, Cynthia ;
Rosenstein, Michael ;
Wiegers, Thomas ;
Mattingly, Carolyn J. .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D1067-D1072
[8]
Comparative Toxicogenomics Database: a knowledgebase and discovery tool for chemical-gene-disease networks [J].
Davis, Allan Peter ;
Murphy, Cynthia G. ;
Saraceni-Richards, Cynthia A. ;
Rosenstein, Michael C. ;
Wiegers, Thomas C. ;
Mattingly, Carolyn J. .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D786-D792
[9]
Integrating text mining into the MGI biocuration workflow [J].
Dowell, K. G. ;
McAndrews-Hill, M. S. ;
Hill, D. P. ;
Drabkin, H. J. ;
Blake, J. A. .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2009,
[10]
Automatic categorization of diverse experimental information in the bioscience literature [J].
Fang, Ruihua ;
Schindelman, Gary ;
Van Auken, Kimberly ;
Fernandes, Jolene ;
Chen, Wen ;
Wang, Xiaodong ;
Davis, Paul ;
Tuli, Mary Ann ;
Marygold, Steven J. ;
Millburn, Gillian ;
Matthews, Beverley ;
Zhang, Haiyan ;
Brown, Nick ;
Gelbart, William M. ;
Sternberg, Paul W. .
BMC BIOINFORMATICS, 2012, 13