Motivation: Biological sequence data is accumulating rapidly, motivating the development of improved high-throughput methods for sequence classification. Results: UBLAST and USEARCH are new algorithms enabling sensitive local and global search of large sequence databases at exceptionally high speeds. They are often orders of magnitude faster than BLAST in practical applications, though sensitivity to distant protein relationships is lower. UCLUST is a new clustering method that exploits USEARCH to assign sequences to clusters. UCLUST offers several advantages over the widely used program CD-HIT, including higher speed, lower memory use, improved sensitivity, clustering at lower identities and classification of much larger datasets.
机构:
Univ Colorado, Dept Chem & Biochem, Boulder, CO 80309 USAUniv Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA
Costello, Elizabeth K.
;
Lauber, Christian L.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Colorado, Cooperat Inst Res Environm Sci, Boulder, CO 80309 USAUniv Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA
Lauber, Christian L.
;
Hamady, Micah
论文数: 0引用数: 0
h-index: 0
机构:
Univ Colorado, Dept Comp Sci, Boulder, CO 80309 USAUniv Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA
Hamady, Micah
;
Fierer, Noah
论文数: 0引用数: 0
h-index: 0
机构:
Univ Colorado, Cooperat Inst Res Environm Sci, Boulder, CO 80309 USA
Univ Colorado, Dept Ecol & Evolutionary Biol, Boulder, CO 80309 USAUniv Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA
Fierer, Noah
;
Gordon, Jeffrey I.
论文数: 0引用数: 0
h-index: 0
机构:
Washington Univ, Sch Med, Ctr Genome Sci, St Louis, MO 63108 USAUniv Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA
Gordon, Jeffrey I.
;
Knight, Rob
论文数: 0引用数: 0
h-index: 0
机构:
Univ Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA
Howard Hughes Med Inst, Chevy Chase, MD USAUniv Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA
机构:
Univ Colorado, Dept Chem & Biochem, Boulder, CO 80309 USAUniv Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA
Costello, Elizabeth K.
;
Lauber, Christian L.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Colorado, Cooperat Inst Res Environm Sci, Boulder, CO 80309 USAUniv Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA
Lauber, Christian L.
;
Hamady, Micah
论文数: 0引用数: 0
h-index: 0
机构:
Univ Colorado, Dept Comp Sci, Boulder, CO 80309 USAUniv Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA
Hamady, Micah
;
Fierer, Noah
论文数: 0引用数: 0
h-index: 0
机构:
Univ Colorado, Cooperat Inst Res Environm Sci, Boulder, CO 80309 USA
Univ Colorado, Dept Ecol & Evolutionary Biol, Boulder, CO 80309 USAUniv Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA
Fierer, Noah
;
Gordon, Jeffrey I.
论文数: 0引用数: 0
h-index: 0
机构:
Washington Univ, Sch Med, Ctr Genome Sci, St Louis, MO 63108 USAUniv Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA
Gordon, Jeffrey I.
;
Knight, Rob
论文数: 0引用数: 0
h-index: 0
机构:
Univ Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA
Howard Hughes Med Inst, Chevy Chase, MD USAUniv Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA