EpoDB: a prototype database for the analysis of genes expressed during vertebrate erythropoiesis

被引:24
作者
Stoeckert, CJ
Salas, F
Brunk, B
Overton, GC
机构
[1] Childrens Hosp Philadelphia, Div Hematol, Abramson Res Ctr 316E, Philadelphia, PA 19104 USA
[2] Pangea Syst Inc, Oakland, CA 94612 USA
[3] Univ Penn, Sch Med, Dept Genet, Philadelphia, PA 19104 USA
关键词
D O I
10.1093/nar/27.1.200
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
EpoDB is a database of genes expressed in vertebrate red blood cells. It is also a prototype for the creation of cell and tissue-specific databases from multiple external sources. The information in EpoDB obtained from GenBank, SWISS-PROT, Transfac, TRRD and GERD is curated to provide high quality data for sequence analysis aimed at understanding gene regulation during erythropoiesis, New protocols have been developed for data integration and updating entries. Using a BLAST-based algorithm, we have grouped GenBank entries representing the same gene together, This sequence similarity protocol was also used to identify new entries to be included in EpoDB. We have recently implemented our database in Sybase (relational tables) in addition to SICStus Prolog to provide us with greater flexibility in asking complex queries that utilize information from multiple sources. New additions to the public web site (http://www.cbil.upenn.edu/epodb) for accessing EpoDB are the ability to retrieve groups of entries representing different variants of the same gene and to retrieve gene expression data. The BLAST query has been enhanced by incorporating BLAST-View, an interactive and graphical display of BLAST results. We have also enhanced the queries for retrieving sequence from specified genes by the addition of MEME, a motif discovery tool, to the integrated analysis tools which include CLUSTALW and TESS.
引用
收藏
页码:200 / 203
页数:4
相关论文
共 11 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]  
BAILEY TL, 1995, P 3 INT C INT SYST M, P21
[3]   The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1998 [J].
Bairoch, A ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 1998, 26 (01) :38-42
[4]   GenBank [J].
Benson, DA ;
Boguski, MS ;
Lipman, DJ ;
Ostell, J ;
Ouellette, BFF .
NUCLEIC ACIDS RESEARCH, 1998, 26 (01) :1-7
[5]   GENE STRUCTURE PREDICTION BY LINGUISTIC METHODS [J].
DONG, S ;
SEARLS, DB .
GENOMICS, 1994, 23 (03) :540-551
[6]   Databases on transcriptional regulation: TRANSFAC, TRRD and COMPEL [J].
Heinemeyer, T ;
Wingender, E ;
Reuter, I ;
Hermjakob, H ;
Kel, AE ;
Kel, OV ;
Ignatieva, EV ;
Ananko, EA ;
Podkolodnaya, OA ;
Kolpakov, FA ;
Podkolodny, NL ;
Kolchanov, NA .
NUCLEIC ACIDS RESEARCH, 1998, 26 (01) :362-367
[7]  
Overton G C, 1994, J Comput Biol, V1, P3
[8]   EpoDB: a database of genes expressed during vertebrate erythropoiesis [J].
Salas, F ;
Haas, J ;
Brunk, B ;
Stoeckert, CJ ;
Overton, GC .
NUCLEIC ACIDS RESEARCH, 1998, 26 (01) :288-289
[9]  
SEARLS DB, 1995, GENE, V163, P1
[10]  
STOECKERT C, 1998, P 1 INT C BIOINF GEN, V1, P20