Increased gene sampling yields robust support for higher-level clades within Bombycoidea (Lepidoptera)

被引:81
作者
Zwick, Andreas [1 ,2 ]
Regier, Jerome C. [2 ,3 ,4 ]
Mitter, Charles [3 ]
Cummings, Michael P. [5 ]
机构
[1] State Museum Nat Hist Stuttgart, D-70191 Stuttgart, Germany
[2] Univ Maryland, Inst Biotechnol, Ctr Biosyst Res, College Pk, MD 20742 USA
[3] Univ Maryland, Dept Entomol, College Pk, MD 20742 USA
[4] Univ Maryland, Inst Biosci & Biotechnol Res, College Pk, MD 20742 USA
[5] Univ Maryland, Lab Mol Evolut, Ctr Bioinformat & Computat Biol, College Pk, MD 20742 USA
基金
美国国家科学基金会;
关键词
MAXIMUM-LIKELIHOOD; SILKWORM BOMBYX; DNA-SEQUENCES; MISSING DATA; PHYLOGENY; INFERENCE; ACCURACY; HISTORY; DESIGN; TAXA;
D O I
10.1111/j.1365-3113.2010.00543.x
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
This study has as its primary aim the robust resolution of higher-level relationships within the lepidopteran superfamily Bombycoidea. Our study builds on an earlier analysis of five genes (similar to 6.6 kbp) sequenced for 50 taxa from Bombycoidea and its sister group Lasiocampidae, plus representatives of other macrolepidoteran superfamilies. The earlier study failed to yield strong support for the monophyly of and basal splits within Bombycoidea, among others. Therefore, in an effort to increase support specifically for higher-level nodes, we generated 11.7 kbp of additional data from 20 genes for 24 of 50 bombycoid and lasiocampid taxa. The data from the genes are all derived from protein-coding nuclear genes previously used to resolve other lepidopteran relationships. With these additional data, all but a few higher-level nodes are strongly supported. Given our decision to minimize project costs by augmenting genes for only 24 of the 50 taxa, we explored whether the resulting pattern of missing data in the combined-gene matrix introduced a nonphylogenetic bias, a possibility reported by others. This was achieved by comparing node support values (i.e. nonparametric bootstrap values) based on likelihood and parsimony analyses of three datasets that differ in their number of taxa and level of missing data: 50 taxa/5 genes (dataset A), 50 taxa/25 genes (dataset B) and 24 taxa/25 genes (dataset C). Whereas datasets B and C provided similar results for common nodes, both frequently yielded higher node support relative to dataset A, arguing that: (i) more data yield increased node support and (ii) partial gene augmentation does not introduce an obvious nonphylogenetic bias. A comparison of single-gene bootstrap analyses identified four nodes for which one or two of the 25 genes provided modest to strong support for a grouping not recovered by the combined-gene result. As a summary proposal, two of these four groupings (one each within Bombycoidea and Lasiocampidae) were deemed sufficiently problematic to regard them as unresolved trichotomies. Since the alternative groupings were always highly localized on the tree, we did not judge a combined-gene analysis to present a problem outside those regions. Based on our robustly resolved results, we have revised the classification of Bombycoidea: the family Bombycidae is restricted to its nominate subfamily, and its tribe Epiini is elevated to subfamily rank (Epiinae stat.rev.), whereas the bombycid subfamily Phiditiinae is reinstated as a separate family (Phiditiidae stat.rev.). The bombycid subfamilies Oberthueriinae Kuznetzov & Stekolnikov, 1985, syn.nov. and Prismostictinae Forbes, 1955, syn.nov., and the family Mirinidae Kozlov, 1985, syn.nov. are established as subjective junior synonyms of Endromidae Boisduval, 1828. The family Anthelidae (Lasiocampoidea) is reincluded in the superfamily Bombycoidea.
引用
收藏
页码:31 / 43
页数:13
相关论文
共 33 条
[1]  
[Anonymous], 2006, GENETIC ALGORITHM AP
[2]  
Bazinet A.L., 2008, Distributed Grid Computing - Science Made Transparent for Everyone. Principles, Applications
[3]  
CHO S, 2010, DELIBERATELY U UNPUB
[4]  
Common I. F. B., 1966, Journal of the Entomological Society of Queensland, V5, P29
[5]  
Cummings M.P., 2005, Educause Review, V40, P116
[6]  
GOLDMAN N, 1994, MOL BIOL EVOL, V11, P725
[7]  
Goldsmith M.R., 2010, Molecular Biology and Genetics of the Lepidoptera
[8]  
Goldsmith M.R., 1995, MOL MODEL SYSTEMS LE
[9]   Leveraging skewed transcript abundance by RNA-Seq to increase the genomic depth of the tree of life [J].
Hittinger, Chris Todd ;
Johnston, Mark ;
Tossberg, John T. ;
Rokas, Antonis .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (04) :1476-1481
[10]  
LANAVE C, 1994, MOL BIOL EVOL, V20, P86