Positional candidate gene selection from livestock EST databases using Gene Ontology

被引:17
作者
Harhay, GP [1 ]
Keele, JW [1 ]
机构
[1] USDA ARS, Roman L Hruska US Meat Anim Res Ctr, Prod Syst, Clay Ctr, Clay Ctr, NE 68933 USA
关键词
D O I
10.1093/bioinformatics/19.2.249
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The number of expressed sequence tags (ESTs) in GenBank has now surpassed 200 000 for cattle and 100 000 for swine. The Institute of Genome Research (TIGR) has organized these sequences into approximately 60000 non-redundant consensus sequences (identified by TIGR Gene Indices) for cattle and 40000 for swine. Anonymous ESTs are of limited value unless they are connected to function. Functional information is difficult to manage electronically because of heterogeneity of meaning and form among databases. The Gene Ontology (GO) Consortium has produced ontologies for gene function with consistent meaning and form across species. Linking livestock EST to gene function through similarity with sequences from other annotation-rich mammals could accelerate: (1) the discovery of positional candidate genes underlying a livestock quantitative trait locus (QTL) and (2) comparative mapping between livestock and other mammals (e.g. humans, mouse and rat). We initiated this investigation to determine if incorporation of the GO into the annotation process could accelerate livestock positional candidate gene discovery. Results: We have associated livestock ESTs with GO nodes through sequence similarity to the NCBI Reference Sequences (RefSeq). Positional candidate genes are identified within minutes that otherwise required days. The schema described here accommodates queries that return GO nodes from terms familiar to biologists, such as gene name, alternate/alias symbol, and OMIM phenotype. Availability: Scripts and schema are available on request from the authors. Contact: harhay@email.marc.usda.gov.
引用
收藏
页码:249 / 255
页数:7
相关论文
共 14 条
[1]  
Ashburner M, 2001, GENOME RES, V11, P1425
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]  
Freking BA, 1999, J ANIM SCI, V77, P2336
[4]   Strategies for biological annotation of mammalian systems: Implementing gene ontologies in mouse genome informatics [J].
Hill, DP ;
Davis, AP ;
Richardson, JE ;
Corradi, JP ;
Ringwald, M ;
Eppig, JT ;
Blake, JA .
GENOMICS, 2001, 74 (01) :121-128
[5]  
Keele JW, 1999, J ANIM SCI, V77, P1364
[6]  
Koohmaraie M, 1995, J ANIM SCI, V73, P3596
[7]   THE ROLE OF CA-2+-DEPENDENT PROTEASES (CALPAINS) IN POSTMORTEM PROTEOLYSIS AND MEAT TENDERNESS [J].
KOOHMARAIE, M .
BIOCHIMIE, 1992, 74 (03) :239-245
[8]   Annotating eukaryote genomes [J].
Lewis, S ;
Ashburner, M ;
Reese, MG .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2000, 10 (03) :349-354
[9]   RefSeq and LocusLink: NCBI gene-centered resources [J].
Pruitt, KD ;
Maglott, DR .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :137-140
[10]   Comparative mapping of BTA15 and HSA11 including a region containing a QTL for meat tenderness [J].
Rexroad, CE ;
Bennett, GL ;
Stone, RT ;
Keele, JW ;
Fahrenkrug, SC ;
Freking, BA ;
Kappes, SM ;
Smith, TPL .
MAMMALIAN GENOME, 2001, 12 (07) :561-565