ETE: a python']python Environment for Tree Exploration

被引:299
作者
Huerta-Cepas, Jaime [1 ]
Dopazo, Joaquin [2 ]
Gabaldon, Toni [1 ]
机构
[1] Ctr Genom Regulat CRG, Comparat Genom Grp, Bioinformat & Genom Programme, Barcelona 08003, Spain
[2] Ctr Invest Principe Felipe, Bioinformat Dept, Valencia, Spain
关键词
DISPLAY; LIFE; TOOLKIT; GENE;
D O I
10.1186/1471-2105-11-24
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Background: Many bioinformatics analyses, ranging from gene clustering to phylogenetics, produce hierarchical trees as their main result. These are used to represent the relationships among different biological entities, thus facilitating their analysis and interpretation. A number of standalone programs are available that focus on tree visualization or that perform specific analyses on them. However, such applications are rarely suitable for large-scale surveys, in which a higher level of automation is required. Currently, many genome-wide analyses rely on tree-like data representation and hence there is a growing need for scalable tools to handle tree structures at large scale. Results: Here we present the Environment for Tree Exploration (ETE), a python programming toolkit that assists in the automated manipulation, analysis and visualization of hierarchical trees. ETE libraries provide a broad set of tree handling options as well as specific methods to analyze phylogenetic and clustering trees. Among other features, ETE allows for the independent analysis of tree partitions, has support for the extended newick format, provides an integrated node annotation system and permits to link trees to external data such as multiple sequence alignments or numerical arrays. In addition, ETE implements a number of built-in analytical tools, including phylogeny-based orthology prediction and cluster validation techniques. Finally, ETE's programmable tree drawing engine can be used to automate the graphical rendering of trees with customized node-specific visualizations. Conclusions: ETE provides a complete set of methods to manipulate tree data structures that extends current functionality in other bioinformatic toolkits of a more general purpose. ETE is free software and can be downloaded from http://ete.cgenomics.org.
引用
收藏
页数:7
相关论文
共 28 条
[1]
Bassi S, 2007, PLoS Comput Biol, V3, pe199
[2]
TreeDyn:: towards dynamic graphics and annotations for analyses of trees [J].
Chevenet, Francois ;
Brun, Christine ;
Banuls, Anne-Laure ;
Jacq, Bernard ;
Christen, Richard .
BMC BIOINFORMATICS, 2006, 7 (1)
[3]
Biopython']python: freely available Python']Python tools for computational molecular biology and bioinformatics [J].
Cock, Peter J. A. ;
Antao, Tiago ;
Chang, Jeffrey T. ;
Chapman, Brad A. ;
Cox, Cymon J. ;
Dalke, Andrew ;
Friedberg, Iddo ;
Hamelryck, Thomas ;
Kauff, Frank ;
Wilczynski, Bartek ;
de Hoon, Michiel J. L. .
BIOINFORMATICS, 2009, 25 (11) :1422-1423
[4]
Dunn J.C., 1974, J CYBERNETICS, V3, P95, DOI [DOI 10.1080/01969727408546059, 10.1080/019697274085460590304.68093]
[5]
Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[6]
DISTINGUISHING HOMOLOGOUS FROM ANALOGOUS PROTEINS [J].
FITCH, WM .
SYSTEMATIC ZOOLOGY, 1970, 19 (02) :99-&
[7]
Large-scale assignment of orthology: back to phylogenetics? [J].
Gabaldon, Toni .
GENOME BIOLOGY, 2008, 9 (10) :235
[8]
PhylomeDB: a database for genome-wide collections of gene phylogenies [J].
Huerta-Cepas, Jaime ;
Bueno, Anibal ;
Dopazo, Joaquin ;
Gabaldon, Toni .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D491-D496
[9]
The human phylome [J].
Huerta-Cepas, Jaime ;
Dopazo, Hernan ;
Dopazo, Joaquin ;
Gabaldon, Toni .
GENOME BIOLOGY, 2007, 8 (06)
[10]
HUERTACEPAS J, 2009, INSECT MOL BIOL