Browsing protein families via the 'Rich Family Description' format

被引：18

作者：

Corpet, F

Gouzy, J

Kahn, D

机构：

[1] Ctr INRA Toulouse, Lab Genet Cellulaire, F-31326 Castanet Tolosan, France

[2] INRA, CNRS, Lab Biol Mol RElat Plantes Microorganisms, F-31326 Castanet Tolosan, France

来源：

BIOINFORMATICS | 1999年 / 15卷 / 12期

关键词：

D O I：

10.1093/bioinformatics/15.12.1020

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Motivation: Multiple alignments of protein sequences are the basis of structural and functional analysis of protein families. It is however difficult even for an expert biologist to comprehend an alignment of more than 50 to 100 homologous sequences. Results: This paper presents a browser for the analysis of multiple alignments of large numbers of protein sequences. Phylogenetic trees and consensus sequences are computed and used to summarise the alignments; these data are stored in a structure called Rich Family Description. Summary alignments and trees ave displayed in HTML pages and can be developed or reduced by the user. This browser is used to display the ProDom domain families on the Web. Its zooming facilities allow extracting information from alignments of more than 1000 homologous sequences.

引用

页码：1020 / 1027

页数：8

共 22 条

[1] Pfam 3.1: 1313 multiple alignments and profile HMMs match the majority of proteins [J].