Quantitative measures for the management and comparison of annotated genomes

被引:104
作者
Eilbeck, Karen
Moore, Barry
Holt, Carson
Yandell, Mark [1 ]
机构
[1] Univ Utah, Dept Human Genet, Eccles Inst Human Genet, Salt Lake City, UT 84112 USA
关键词
CONSERVATION; EVOLUTION; SEQUENCE; EGASP;
D O I
10.1186/1471-2105-10-67
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Background: The ever-increasing number of sequenced and annotated genomes has made management of their annotations a significant undertaking, especially for large eukaryotic genomes containing many thousands of genes. Typically, changes in gene and transcript numbers are used to summarize changes from release to release, but these measures say nothing about changes to individual annotations, nor do they provide any means to identify annotations in need of manual review. Results: In response, we have developed a suite of quantitative measures to better characterize changes to a genome's annotations between releases, and to prioritize problematic annotations for manual review. We have applied these measures to the annotations of five eukaryotic genomes over multiple releases-H. sapiens, M. musculus, D. melanogaster, A. gambiae, and C. elegans. Conclusion: Our results provide the first detailed, historical overview of how these genomes' annotations have changed over the years, and demonstrate the usefulness of these measures for genome annotation management.
引用
收藏
页数:15
相关论文
共 29 条
[1]
Benson DA, 2017, NUCLEIC ACIDS RES, V45, pD37, DOI [10.1093/nar/gkp1024, 10.1093/nar/gkw1070, 10.1093/nar/gkq1079, 10.1093/nar/gkl986, 10.1093/nar/gkr1202, 10.1093/nar/gkx1094, 10.1093/nar/gks1195, 10.1093/nar/gkn723, 10.1093/nar/gkg057]
[2]
WormBase:: new content and better access [J].
Bieri, Tamberlyn ;
Blasiar, Darin ;
Ozersky, Philip ;
Antoshechkin, Igor ;
Bastiani, Carol ;
Canaran, Payan ;
Chan, Juancarlos ;
Chen, Nansheng ;
Chen, Wen J. ;
Davis, Paul ;
Fiedler, Tristan J. ;
Girard, Lisa ;
Han, Michael ;
Harris, Todd W. ;
Kishore, Ranjana ;
Lee, Raymond ;
McKay, Sheldon ;
Muller, Hans-Michael ;
Nakamura, Cecilia ;
Petcherski, Andrei ;
Rangarajan, Arun ;
Rogers, Anthony ;
Schindelman, Gary ;
Schwarz, Erich M. ;
Spooner, Will ;
Tuli, Mary Ann ;
Van Auken, Kimberly ;
Wang, Daniel ;
Wang, Xiaodong ;
Williams, Gary ;
Durbin, Richard ;
Stein, Lincoln D. ;
Sternberg, Paul W. ;
Spieth, John .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D506-D510
[3]
Evaluation of gene structure prediction programs [J].
Burset, M ;
Guigo, R .
GENOMICS, 1996, 34 (03) :353-367
[4]
The Drosophila melanogaster genome [J].
Celniker, SE ;
Rubin, GM .
ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, 2003, 4 :89-117
[5]
CELNIKER SE, 2002, GENOME BIOL, V0003
[6]
FlyBase: genomes by the dozen [J].
Crosby, Madeline A. ;
Goodman, Joshua L. ;
Strelets, Victor B. ;
Zhang, Peili ;
Gelbart, William M. .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D486-D491
[7]
The Sequence Ontology: a tool for the unification of genome annotations [J].
Eilbeck, K ;
Lewis, SE ;
Mungall, CJ ;
Yandell, M ;
Stein, L ;
Durbin, R ;
Ashburner, M .
GENOME BIOLOGY, 2005, 6 (05)
[8]
EGASP:: collaboration through competition to find human genes [J].
Guigó, R ;
Reese, MG .
NATURE METHODS, 2005, 2 (08) :575-577
[9]
EGASP:: the human ENCODE genome annotation assessment project [J].
Guigo, Roderic ;
Flicek, Paul ;
Abril, Josep F. ;
Reymond, Alexandre ;
Lagarde, Julien ;
Denoeud, France ;
Antonarakis, Stylianos ;
Ashburner, Michael ;
Bajic, Vladimir B. ;
Birney, Ewan ;
Castelo, Robert ;
Eyras, Eduardo ;
Ucla, Catherine ;
Gingeras, Thomas R. ;
Harrow, Jennifer ;
Hubbard, Tim ;
Lewis, Suzanna E. ;
Reese, Martin G. .
GENOME BIOLOGY, 2006, 7 (Suppl 1)
[10]
ASAP: the Alternative Splicing Annotation Project [J].
Lee, C ;
Atanelov, L ;
Modrek, B ;
Xing, Y .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :101-105