PyCogent: a toolkit for making sense from sequence

被引:141
作者
Knight, Rob [1 ]
Maxwell, Peter [2 ]
Birmingham, Amanda [1 ]
Carnes, Jason [4 ]
Caporaso, J. Gregory [5 ]
Easton, Brett C. [2 ]
Eaton, Michael [6 ]
Hamady, Micah [7 ]
Lindsay, Helen [2 ]
Liu, Zongzhi
Lozupone, Catherine
McDonald, Daniel [7 ]
Robeson, Michael [1 ,8 ]
Sammut, Raymond
Smit, Sandra [1 ]
Wakefield, Matthew J. [2 ,3 ,9 ]
Widmann, Jeremy [1 ]
Wikman, Shandy [1 ]
Wilson, Stephanie [7 ]
Ying, Hua
Huttley, Gavin A. [2 ]
机构
[1] Univ Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA
[2] Australian Natl Univ, John Curtin Sch Med Res, Computat Genom Lab, Canberra, ACT 2601, Australia
[3] Thermo Fisher Sci, Lafayette, CO USA
[4] Seattle Biomed Res Inst, Seattle, WA 98109 USA
[5] Univ Colorado, Hlth Sci Ctr, Dept Biochem & Mol Genet, Aurora, CO USA
[6] Sci Applicat Int Corp, Englewood, CO USA
[7] Univ Colorado, Dept Comp Sci, Boulder, CO 80309 USA
[8] Univ Colorado, Dept Ecol & Evolut Biol, Boulder, CO 80309 USA
[9] Walter & Eliza Hall Inst Med Res, Melbourne, Vic 3050, Australia
关键词
D O I
10.1186/gb-2007-8-8-r171
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
We have implemented in Python the COmparative GENomic Toolkit, a fully integrated and thoroughly tested framework for novel probabilistic analyses of biological sequences, devising workflows, and generating publication quality graphics. PyCogent includes connectors to remote databases, built-in generalized probabilistic techniques for working with biological sequences, and controllers for third-party applications. The toolkit takes advantage of parallel architectures and runs on a range of hardware and operating systems, and is available under the general public license from http://sourceforge.net/projects/pycogent.
引用
收藏
页数:16
相关论文
共 48 条
  • [11] MUSCLE: multiple sequence alignment with high accuracy and high throughput
    Edgar, RC
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 (05) : 1792 - 1797
  • [12] Crystal structure of the von Willebrand factor A1 domain and implications for the binding of platelet glycoprotein Ib
    Emsley, J
    Cruz, M
    Handin, R
    Liddington, R
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 1998, 273 (17) : 10396 - 10401
  • [13] A hidden Markov Model approach to variation among sites in rate of evolution
    Felsenstein, J
    Churchill, GA
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 1996, 13 (01) : 93 - 104
  • [14] Felsenstein J., PHYLIP PHYLOGENY INF
  • [15] Bioconductor: open software development for computational biology and bioinformatics
    Gentleman, RC
    Carey, VJ
    Bates, DM
    Bolstad, B
    Dettling, M
    Dudoit, S
    Ellis, B
    Gautier, L
    Ge, YC
    Gentry, J
    Hornik, K
    Hothorn, T
    Huber, W
    Iacus, S
    Irizarry, R
    Leisch, F
    Li, C
    Maechler, M
    Rossini, AJ
    Sawitzki, G
    Smith, C
    Smyth, G
    Tierney, L
    Yang, JYH
    Zhang, JH
    [J]. GENOME BIOLOGY, 2004, 5 (10)
  • [16] Mutation hotspots in the p53 gene in tumors of different origin: correlation with evolutionary conservation and signs of positive selection
    Glazko, GV
    Koonin, EV
    Rogozin, IB
    [J]. BIOCHIMICA ET BIOPHYSICA ACTA-GENE STRUCTURE AND EXPRESSION, 2004, 1679 (02): : 95 - 106
  • [17] GOFFE WL, 1994, J ECONOMETRICS, V99, P60
  • [18] GOLDMAN N, 1994, MOL BIOL EVOL, V11, P725
  • [19] Predicting the oncogenicity of missense mutations reported in the International Agency for Cancer Research (IARC) mutation database on p53
    Gorlov, IP
    Gorlova, OY
    Amos, CI
    [J]. HUMAN MUTATION, 2005, 26 (05) : 446 - 454
  • [20] Fast-Find: A novel computational approach to analyzing combinatorial motifs
    Hamady, M
    Peden, E
    Knight, R
    Singh, R
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)