An open source chimera checker for the fungal ITS region

被引:63
作者
Nilsson, R. H. [1 ,2 ]
Abarenkov, Kessy [2 ]
Veldre, Vilmar [2 ]
Nylinder, Stephan [1 ]
De Wit, Pierre [3 ]
Brosche, Sara [1 ]
Alfredsson, Johan F. [4 ]
Ryberg, Martin [5 ]
Kristiansson, Erik [3 ,6 ]
机构
[1] Univ Gothenburg, Dept Plant & Environm Sci, S-40530 Gothenburg, Sweden
[2] Univ Tartu, Inst Ecol & Earth Sci, Dept Bot, EE-51005 Tartu, Estonia
[3] Univ Gothenburg, Dept Zool, S-40530 Gothenburg, Sweden
[4] Oepir Consulting, S-41137 Gothenburg, Sweden
[5] Univ Tennessee, Dept Ecol & Evolutionary Biol, Knoxville, TN 37996 USA
[6] Univ Gothenburg, Dept Neurosci & Physiol, Sahlgrenska Acad, S-40530 Gothenburg, Sweden
关键词
chimeric sequences; environmental sampling; fungi; internal transcribed spacer; INTERNAL TRANSCRIBED SPACER; SEQUENCE ALIGNMENT; KINGDOM FUNGI; DNA; DATABASES; GENES; INTERFACE; GENBANK; ECOLOGY;
D O I
10.1111/j.1755-0998.2010.02850.x
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The internal transcribed spacer (ITS) region of the nuclear ribosomal repeat unit holds a central position in the pursuit of the taxonomic affiliation of fungi recovered through environmental sampling. Newly generated fungal ITS sequences are typically compared against the International Nucleotide Sequence Databases for a species or genus name using the sequence similarity software suite blast. Such searches are not without complications however, and one of them is the presence of chimeric entries among the query or reference sequences. Chimeras are artificial sequences, generated unintentionally during the polymerase chain reaction step, that feature sequence data from two (or possibly more) distinct species. Available software solutions for chimera control do not readily target the fungal ITS region, but the present study introduces a blast-based open source software package (available at http://www.emerencia.org/chimerachecker.html) to examine newly generated fungal ITS sequences for the presence of potentially chimeric elements in batch mode. We used the software package on a random set of 12 300 environmental fungal ITS sequences in the public sequence databases and found 1.5% of the entries to be chimeric at the ordinal level after manual verification of the results. The proportion of chimeras in the sequence databases can be hypothesized to increase as emerging sequencing technologies drawing from pooled DNA samples are becoming important tools in molecular ecology research.
引用
收藏
页码:1076 / 1081
页数:6
相关论文
共 37 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   At least 1 in 20 16S rRNA sequence records currently held in public repositories is estimated to contain substantial anomalies [J].
Ashelford, KE ;
Chuzhanova, NA ;
Fry, JC ;
Jones, AJ ;
Weightman, AJ .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2005, 71 (12) :7724-7736
[3]  
Benson DA, 2013, NUCLEIC ACIDS RES, V41, pD36, DOI [10.1093/nar/gkn723, 10.1093/nar/gkp1024, 10.1093/nar/gkw1070, 10.1093/nar/gkr1202, 10.1093/nar/gkx1094, 10.1093/nar/gkl986, 10.1093/nar/gkq1079, 10.1093/nar/gks1195, 10.1093/nar/gkg057]
[4]  
Bidartondo MI, 2008, SCIENCE, V319, P1616, DOI 10.1126/science.319.5870.1616a
[5]   The phylogenetic distribution of resupinate forms across the major clades of mushroom-forming fungi (Homobasidiomycetes) [J].
Binder, M ;
Hibbett, DS ;
Larsson, KH ;
Larsson, E ;
Langer, E ;
Langer, G .
SYSTEMATICS AND BIODIVERSITY, 2005, 3 (02) :113-157
[6]   Research coordination networks: a phylogeny for kingdom Fungi (Deep Hypha) [J].
Blackwell, Meredith ;
Hibbett, David S. ;
Taylor, John W. ;
Spatafora, Joseph W. .
MYCOLOGIA, 2006, 98 (06) :829-837
[7]   454 Pyrosequencing analyses of forest soils reveal an unexpectedly high fungal diversity [J].
Buee, M. ;
Reich, M. ;
Murat, C. ;
Morin, E. ;
Nilsson, R. H. ;
Uroz, S. ;
Martin, F. .
NEW PHYTOLOGIST, 2009, 184 (02) :449-456
[8]   Global Sequencing: A Review of Current Molecular Data and New Methods Available to Assess Microbial Diversity [J].
Christen, Richard .
MICROBES AND ENVIRONMENTS, 2008, 23 (04) :253-268
[9]   Profile hidden Markov models [J].
Eddy, SR .
BIOINFORMATICS, 1998, 14 (09) :755-763
[10]   LENGTH VARIATION IN THE INTERNAL TRANSCRIBED SPACER OF RIBOSOMAL DNA IN CHANTERELLES [J].
FEIBELMAN, T ;
BAYMAN, P ;
CIBULA, WG .
MYCOLOGICAL RESEARCH, 1994, 98 :614-618