The PEDANT genome database

被引:90
作者
Frishman, D
Mokrejs, M
Kosykh, D
Kastenmüller, G
Kolesov, G
Zubrzycki, I
Gruber, C
Geier, B
Kaps, A
Albermann, K
Volz, A
Wagner, C
Fellenberg, M
Heumann, K
Mewes, HW
机构
[1] GSF, Natl Res Ctr Environm & Hlth, Inst Bioinformat, D-85764 Neuherberg, Germany
[2] Biomax Informat AG, D-82152 Martinsried, Germany
[3] Tech Univ Munich, Wissensch Zentrum Weihenstephan, Dept Genome Oriented Bioinformat, D-85350 Freising Weihenstephan, Germany
关键词
D O I
10.1093/nar/gkg005
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The PEDANT genome database (http://pedant.gsf.de) provides exhaustive automatic analysis of genomic sequences by a large variety of established bioinformatics tools through a comprehensive Web-based user interface. One hundred and seventy seven completely sequenced and unfinished genomes have been processed so far, including large eukaryotic genomes (mouse, human) published recently. In this contribution, we describe the current status of the PEDANT database and novel analytical features added to the PEDANT server in 2002. Those include: (i) integration with the BioRS(TM) data retrieval system which allows fast text queries, (ii) pre-computed sequence clusters in each complete genome, (iii) a comprehensive set of tools for genome comparison, including genome comparison tables and protein function prediction based on genomic context, and (iv) computation and visualization of protein protein interaction (PPI) networks based on experimental data. The availability of functional and structural predictions for 650 000 genomic proteins in well organized form makes PEDANT a useful resource for both functional and structural genomics.
引用
收藏
页码:207 / 211
页数:5
相关论文
共 29 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   The InterPro database, an integrated documentation resource for protein families, domains and functional sites [J].
Apweiler, R ;
Attwood, TK ;
Bairoch, A ;
Bateman, A ;
Birney, E ;
Biswas, M ;
Bucher, P ;
Cerutti, T ;
Corpet, F ;
Croning, MDR ;
Durbin, R ;
Falquet, L ;
Fleischmann, W ;
Gouzy, J ;
Hermjakob, H ;
Hulo, N ;
Jonassen, I ;
Kahn, D ;
Kanapin, A ;
Karavidopoulou, Y ;
Lopez, R ;
Marx, B ;
Mulder, NJ ;
Oinn, TM ;
Pagni, M ;
Servant, F ;
Sigrist, CJA ;
Zdobnov, EM .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :37-40
[3]   The Protein Information Resource (PIR) [J].
Barker, WC ;
Garavelli, JS ;
Huang, HZ ;
McGarvey, PB ;
Orcutt, BC ;
Srinivasarao, GY ;
Xiao, CL ;
Yeh, LSL ;
Ledley, RS ;
Janda, JF ;
Pfeiffer, F ;
Mewes, HW ;
Tsugita, A ;
Wu, C .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :41-44
[4]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
[5]   GenBank [J].
Benson, DA ;
Karsch-Mizrachi, I ;
Lipman, DJ ;
Ostell, J ;
Rapp, BA ;
Wheeler, DL .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :17-20
[6]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[7]   Profile hidden Markov models [J].
Eddy, SR .
BIOINFORMATICS, 1998, 14 (09) :755-763
[8]   The PROSITE database, its status in 2002 [J].
Falquet, L ;
Pagni, M ;
Bucher, P ;
Hulo, N ;
Sigrist, CJA ;
Hofmann, K ;
Bairoch, A .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :235-238
[9]  
Fellenberg M, 2000, Proc Int Conf Intell Syst Mol Biol, V8, P152
[10]   PEDANTic genome analysis [J].
Frishman, D ;
Mewes, HW .
TRENDS IN GENETICS, 1997, 13 (10) :415-416