phyloXML: XML for evolutionary biology and comparative genomics

被引:422
作者
Han, Mira V. [2 ]
Zmasek, Christian M. [1 ]
机构
[1] Burnham Inst Med Res, La Jolla, CA 92037 USA
[2] Indiana Univ, Sch Informat, Bloomington, IN 47408 USA
来源
BMC BIOINFORMATICS | 2009年 / 10卷
关键词
BIOINFORMATICS; INFORMATION; NETWORK; TOOLS;
D O I
10.1186/1471-2105-10-356
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Evolutionary trees are central to a wide range of biological studies. In many of these studies, tree nodes and branches need to be associated ( or annotated) with various attributes. For example, in studies concerned with organismal relationships, tree nodes are associated with taxonomic names, whereas tree branches have lengths and oftentimes support values. Gene trees used in comparative genomics or phylogenomics are usually annotated with taxonomic information, genome-related data, such as gene names and functional annotations, as well as events such as gene duplications, speciations, or exon shufflings, combined with information related to the evolutionary tree itself. The data standards currently used for evolutionary trees have limited capacities to incorporate such annotations of different data types. Results: We developed a XML language, named phyloXML, for describing evolutionary trees, as well as various associated data items. PhyloXML provides elements for commonly used items, such as branch lengths, support values, taxonomic names, and gene names and identifiers. By using "property" elements, phyloXML can be adapted to novel and unforeseen use cases. We also developed various software tools for reading, writing, conversion, and visualization of phyloXML formatted data. Conclusion: PhyloXML is an XML language defined by a complete schema in XSD that allows storing and exchanging the structures of evolutionary trees as well as associated data. More information about phyloXML itself, the XSD schema, as well as tools implementing and supporting phyloXML, is available at http://www.phyloxml.org.
引用
收藏
页数:6
相关论文
共 18 条
[1]   Biological knowledge management: the emerging role of the Semantic Web technologies [J].
Antezana, Erick ;
Kuiper, Martin ;
Mironov, Vladimir .
BRIEFINGS IN BIOINFORMATICS, 2009, 10 (04) :392-407
[2]  
Avise J. C., 2000, PHYLOGEOGRAPHY HIST, DOI DOI 10.2307/J.CTV1NZFGJ7
[3]  
Bray Tim., 1998, EXTENSIBLE MARKUP LA
[4]   A Semantic Web for bioinformatics:: goals, tools, systems, applications [J].
Cannata, Nicola ;
Schroeder, Michael ;
Marangoni, Roberto ;
Romano, Paolo .
BMC BIOINFORMATICS, 2008, 9 (Suppl 4)
[5]   Biopython']python: freely available Python']Python tools for computational molecular biology and bioinformatics [J].
Cock, Peter J. A. ;
Antao, Tiago ;
Chang, Jeffrey T. ;
Chapman, Brad A. ;
Cox, Cymon J. ;
Dalke, Andrew ;
Friedberg, Iddo ;
Hamelryck, Thomas ;
Kauff, Frank ;
Wilczynski, Bartek ;
de Hoon, Michiel J. L. .
BIOINFORMATICS, 2009, 25 (11) :1422-1423
[6]   Phylogenomics: Intersection of evolution and genomics [J].
Eisen, JA ;
Fraser, CM .
SCIENCE, 2003, 300 (5626) :1706-1707
[7]  
FELSENSTEIN J, 1989, CLADISTICS, V5, P166
[8]  
Felsenstein Joseph, 2004, Inferring Phylogenies, V2
[9]   Taxonomic markup language: applying XML to systematic data [J].
Gilmour, R .
BIOINFORMATICS, 2000, 16 (04) :406-407
[10]  
GOTO N, 2003, GENOME INFORM, V14, P629