PyCogent: a toolkit for making sense from sequence

被引：141

作者：

Knight, Rob ^{[1
]}

Maxwell, Peter ^{[2
]}

Birmingham, Amanda ^{[1
]}

Carnes, Jason ^{[4
]}

Caporaso, J. Gregory ^{[5
]}

Easton, Brett C. ^{[2
]}

Eaton, Michael ^{[6
]}

Hamady, Micah ^{[7
]}

Lindsay, Helen ^{[2
]}

Liu, Zongzhi

Lozupone, Catherine

McDonald, Daniel ^{[7
]}

Robeson, Michael ^{[1
,8
]}

Sammut, Raymond

Smit, Sandra ^{[1
]}

Wakefield, Matthew J. ^{[2
,3
,9
]}

Widmann, Jeremy ^{[1
]}

Wikman, Shandy ^{[1
]}

Wilson, Stephanie ^{[7
]}

Ying, Hua

Huttley, Gavin A. ^{[2
]}

机构：

[1] Univ Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA

[2] Australian Natl Univ, John Curtin Sch Med Res, Computat Genom Lab, Canberra, ACT 2601, Australia

[3] Thermo Fisher Sci, Lafayette, CO USA

[4] Seattle Biomed Res Inst, Seattle, WA 98109 USA

[5] Univ Colorado, Hlth Sci Ctr, Dept Biochem & Mol Genet, Aurora, CO USA

[6] Sci Applicat Int Corp, Englewood, CO USA

[7] Univ Colorado, Dept Comp Sci, Boulder, CO 80309 USA

[8] Univ Colorado, Dept Ecol & Evolut Biol, Boulder, CO 80309 USA

[9] Walter & Eliza Hall Inst Med Res, Melbourne, Vic 3050, Australia

来源：

GENOME BIOLOGY | 2007年 / 8卷 / 08期

关键词：

D O I：

10.1186/gb-2007-8-8-r171

中图分类号：

Q81 [生物工程学（生物技术）]; Q93 [微生物学];

学科分类号：

071005 ; 0836 ; 090102 ; 100705 ;

摘要：

We have implemented in Python the COmparative GENomic Toolkit, a fully integrated and thoroughly tested framework for novel probabilistic analyses of biological sequences, devising workflows, and generating publication quality graphics. PyCogent includes connectors to remote databases, built-in generalized probabilistic techniques for working with biological sequences, and controllers for third-party applications. The toolkit takes advantage of parallel architectures and runs on a range of hardware and operating systems, and is available under the general public license from http://sourceforge.net/projects/pycogent.

引用

页数：16

共 48 条

[11] MUSCLE: multiple sequence alignment with high accuracy and high throughput
Edgar, RC
[J]. NUCLEIC ACIDS RESEARCH, 2004, 32 (05) : 1792 - 1797
[12] Crystal structure of the von Willebrand factor A1 domain and implications for the binding of platelet glycoprotein Ib
Emsley, J
Cruz, M
Handin, R
Liddington, R
[J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 1998, 273 (17) : 10396 - 10401
[13] A hidden Markov Model approach to variation among sites in rate of evolution
Felsenstein, J
Churchill, GA
[J]. MOLECULAR BIOLOGY AND EVOLUTION, 1996, 13 (01) : 93 - 104
[14] Felsenstein J., PHYLIP PHYLOGENY INF
[15] Bioconductor: open software development for computational biology and bioinformatics
Gentleman, RC
Carey, VJ
Bates, DM
Bolstad, B
Dettling, M
Dudoit, S
Ellis, B
Gautier, L
Ge, YC
Gentry, J
Hornik, K
Hothorn, T
Huber, W
Iacus, S
Irizarry, R
Leisch, F
Li, C
Maechler, M
Rossini, AJ
Sawitzki, G
Smith, C
Smyth, G
Tierney, L
Yang, JYH
Zhang, JH
[J]. GENOME BIOLOGY, 2004, 5 (10)
[16] Mutation hotspots in the p53 gene in tumors of different origin: correlation with evolutionary conservation and signs of positive selection
Glazko, GV
Koonin, EV
Rogozin, IB
[J]. BIOCHIMICA ET BIOPHYSICA ACTA-GENE STRUCTURE AND EXPRESSION, 2004, 1679 (02): : 95 - 106
[17] GOFFE WL, 1994, J ECONOMETRICS, V99, P60
[18] GOLDMAN N, 1994, MOL BIOL EVOL, V11, P725
[19] Predicting the oncogenicity of missense mutations reported in the International Agency for Cancer Research (IARC) mutation database on p53
Gorlov, IP
Gorlova, OY
Amos, CI
[J]. HUMAN MUTATION, 2005, 26 (05) : 446 - 454
[20] Fast-Find: A novel computational approach to analyzing combinatorial motifs
Hamady, M
Peden, E
Knight, R
Singh, R
[J]. BMC BIOINFORMATICS, 2006, 7 (1)

← 1 2 3 4 5 →