Bio++:: a set of C++ libraries for sequence analysis, phylogenetics, molecular evolution and population genetics

被引:83
作者
Dutheil, Julien [1 ]
Gaillard, Sylvain
Bazin, Eric
Glemin, Sylvain
Ranwez, Vincent
Galtier, Nicolas
Belkhir, Khalid
机构
[1] Univ Montpellier 2, CNRS, UMR 5171, Genome Populat Interact Adaptat, F-34095 Montpellier 5, France
[2] Univ Montpellier 2, CNRS, UMR 5554, Inst Sci Evolut Montpellier, F-34095 Montpellier 5, France
关键词
Bioinformatics Application; Ancestral State Reconstruction; Parsimony Score; Utilitary Function; Container Hierarchy;
D O I
10.1186/1471-2105-7-188
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: A large number of bioinformatics applications in the fields of bio-sequence analysis, molecular evolution and population genetics typically share input/ouput methods, data storage requirements and data analysis algorithms. Such common features may be conveniently bundled into re-usable libraries, which enable the rapid development of new methods and robust applications. Results: We present Bio++, a set of Object Oriented libraries written in C++. Available components include classes for data storage and handling (nucleotide/amino-acid/codon sequences, trees, distance matrices, population genetics datasets), various input/output formats, basic sequence manipulation (concatenation, transcription, translation, etc.), phylogenetic analysis (maximum parsimony, markov models, distance methods, likelihood computation and maximization), population genetics/genomics (diversity statistics, neutrality tests, various multi-locus analyses) and various algorithms for numerical calculus. Conclusion: Implementation of methods aims at being both efficient and user-friendly. A special concern was given to the library design to enable easy extension and new methods development. We defined a general hierarchy of classes that allow the developer to implement its own algorithms while remaining compatible with the rest of the libraries. Bio++ source code is distributed free of charge under the CeCILL general public licence from its website http:// kimura.univ-montp2.fr/BioPP.
引用
收藏
页数:6
相关论文
共 14 条
[1]   libcov: A C++ bioinformatic library to manipulate protein structures, sequence alignments and phylogeny [J].
Butt, D ;
Roger, AJ ;
Blouin, C .
BMC BIOINFORMATICS, 2005, 6 (1)
[2]  
CHANG W, 1994, COMPUT APPL BIOSCI, V10, P575
[3]   A model-based approach for detecting coevolving positions in a molecule [J].
Dutheil, J ;
Pupko, T ;
Jean-Marie, A ;
Galtier, N .
MOLECULAR BIOLOGY AND EVOLUTION, 2005, 22 (09) :1919-1928
[4]  
Felsenstein Joseph, 2004, Inferring_phylogenies, V2
[5]   A structural EM algorithm for phylogenetic inference [J].
Friedman, N ;
Ninio, M ;
Pe'er, I ;
Pupko, T .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (02) :331-353
[6]   GC-biased segregation of noncoding polymorphisms in Drosophila [J].
Galtier, N ;
Bazin, E ;
Bierne, N .
GENETICS, 2006, 172 (01) :221-228
[7]   THE RAPID GENERATION OF MUTATION DATA MATRICES FROM PROTEIN SEQUENCES [J].
JONES, DT ;
TAYLOR, WR ;
THORNTON, JM .
COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1992, 8 (03) :275-282
[8]   Different versions of the Dayhoff rate matrix [J].
Kosiol, C ;
Goldman, N .
MOLECULAR BIOLOGY AND EVOLUTION, 2005, 22 (02) :193-199
[9]   The Bioinformatics Template Library - generic components for biocomputing [J].
Pitt, WR ;
Williams, MA ;
Steven, M ;
Sweeney, B ;
Bleasby, AJ ;
Moss, DS .
BIOINFORMATICS, 2001, 17 (08) :729-737
[10]   THE NEIGHBOR-JOINING METHOD - A NEW METHOD FOR RECONSTRUCTING PHYLOGENETIC TREES [J].
SAITOU, N ;
NEI, M .
MOLECULAR BIOLOGY AND EVOLUTION, 1987, 4 (04) :406-425