PANDIT: an evolution-centric database of protein and associated nucleotide domains with inferred trees

被引:50
作者
Whelan, Simon
de Bakker, Paul I. W.
Quevillon, Emmanuel
Rodriguez, Nicolas
Goldman, Nick
机构
[1] EMBL European Bioinformat Inst, Hinxton CB10 1SD, Cambs, England
[2] Massachusetts Gen Hosp, Dept Mol Biol, Boston, MA 02114 USA
关键词
D O I
10.1093/nar/gkj087
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
PANDIT is a database of homologous sequence alignments accompanied by estimates of their corresponding phylogenetic trees. It provides a valuable resource to those studying phylogenetic methodology and the evolution of coding-DNA and protein sequences. Currently in version 17.0, PANDIT comprises 7738 families of homologous protein domains; for each family, DNA and corresponding amino acid sequence multiple alignments are available together with high quality phylogenetic tree estimates. Recent improvements include expanded methods for phylogenetic tree inference, assessment of alignment quality and a redesigned web interface, available at the URL http://www.ebi.ac.uk/goldman-srv/pandit.
引用
收藏
页码:D327 / D331
页数:5
相关论文
共 33 条
[1]   Determinants of adaptive evolution at the molecular level: the extended complexity hypothesis [J].
Aris-Brosou, S .
MOLECULAR BIOLOGY AND EVOLUTION, 2005, 22 (02) :200-209
[2]   The universal protein resource (UniProt) [J].
Bairoch, A ;
Apweiler, R ;
Wu, CH ;
Barker, WC ;
Boeckmann, B ;
Ferro, S ;
Gasteiger, E ;
Huang, HZ ;
Lopez, R ;
Magrane, M ;
Martin, MJ ;
Natale, DA ;
O'Donovan, C ;
Redaschi, N ;
Yeh, LSL .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D154-D159
[3]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkr1065, 10.1093/nar/gkh121]
[4]   Weighted neighbor joining: A likelihood-based approach to distance-based phylogeny reconstruction [J].
Bruno, WJ ;
Socci, ND ;
Halpern, AL .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (01) :189-197
[5]   Fast and accurate phylogeny reconstruction algorithms based on the minimum-evolution principle [J].
Desper, R ;
Gascuel, O .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (05) :687-705
[6]  
Durbin R., 1998, Biological sequence analysis: Probabilistic models of proteins and nucleic acids
[7]   HOVERGEN - A DATABASE OF HOMOLOGOUS VERTEBRATE GENES [J].
DURET, L ;
MOUCHIROUD, D ;
GOUY, M .
NUCLEIC ACIDS RESEARCH, 1994, 22 (12) :2360-2365
[8]   Profile hidden Markov models [J].
Eddy, SR .
BIOINFORMATICS, 1998, 14 (09) :755-763
[9]  
*ENC PROJ CONS, 2004, SCIENCE, V5696, P636
[10]  
G. O. Consortium, 2004, Nucleic Acids Res, V32, pD258, DOI DOI 10.1093/NAR/GKH036