The Dawn of Open Access to Phylogenetic Data

被引:35
作者
Magee, Andrew F. [1 ]
May, Michael R. [1 ]
Moore, Brian R. [1 ]
机构
[1] Univ Calif Davis, Dept Ecol & Evolut, Davis, CA 95616 USA
来源
PLOS ONE | 2014年 / 9卷 / 10期
关键词
INFORMATION; AVAILABILITY; REUSE; TREE; NEED;
D O I
10.1371/journal.pone.0110268
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The scientific enterprise depends critically on the preservation of and open access to published data. This basic tenet applies acutely to phylogenies (estimates of evolutionary relationships among species). Increasingly, phylogenies are estimated from increasingly large, genome-scale datasets using increasingly complex statistical methods that require increasing levels of expertise and computational investment. Moreover, the resulting phylogenetic data provide an explicit historical perspective that critically informs research in a vast and growing number of scientific disciplines. One such use is the study of changes in rates of lineage diversification (speciation - extinction) through time. As part of a meta-analysis in this area, we sought to collect phylogenetic data (comprising nucleotide sequence alignment and tree files) from 217 studies published in 46 journals over a 13-year period. We document our attempts to procure those data (from online archives and by direct request to corresponding authors), and report results of analyses (using Bayesian logistic regression) to assess the impact of various factors on the success of our efforts. Overall, complete phylogenetic data for similar to 60% of these studies are effectively lost to science. Our study indicates that phylogenetic data are more likely to be deposited in online archives and/or shared upon request when: (1) the publishing journal has a strong data-sharing policy; (2) the publishing journal has a higher impact factor, and; (3) the data are requested from faculty rather than students. Importantly, our survey spans recent policy initiatives and infrastructural changes; our analyses indicate that the positive impact of these community initiatives has been both dramatic and immediate. Although the results of our study indicate that the situation is dire, our findings also reveal tremendous recent progress in the sharing and preservation of phylogenetic data.
引用
收藏
页数:10
相关论文
共 53 条
[1]   Public Availability of Published Research Data in High-Impact Journals [J].
Alsheikh-Ali, Alawi A. ;
Qureshi, Waqas ;
Al-Mallah, Mouaz H. ;
Ioannidis, John P. A. .
PLOS ONE, 2011, 6 (09)
[2]   A fair share [J].
不详 .
NATURE, 2006, 444 (7120) :653-654
[3]   Class of Multiple Sequence Alignment Algorithm Affects Genomic Analysis [J].
Blackburne, Benjamin P. ;
Whelan, Simon .
MOLECULAR BIOLOGY AND EVOLUTION, 2013, 30 (03) :642-653
[4]   General methods for monitoring convergence of iterative simulations [J].
Brooks, SP ;
Gelman, A .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1998, 7 (04) :434-455
[5]   PRIVATE ARCHIVES AND PUBLIC NEEDS [J].
CECI, SJ ;
WALKER, E .
AMERICAN PSYCHOLOGIST, 1983, 38 (04) :414-423
[6]   Best Practices for Data Sharing in Phylogenetic Research [J].
Cranston, Karen ;
Harmon, Luke J. ;
O'Leary, Maureen A. ;
Lisle, Curtis .
PLOS CURRENTS-TREE OF LIFE, 2014,
[7]   A new age of discovery [J].
Donoghue, MJ ;
Alverson, WS .
ANNALS OF THE MISSOURI BOTANICAL GARDEN, 2000, 87 (01) :110-126
[8]   Lost Branches on the Tree of Life [J].
Drew, Bryan T. ;
Gazis, Romina ;
Cabezas, Patricia ;
Swithers, Kristen S. ;
Deng, Jiabin ;
Rodriguez, Roseana ;
Katz, Laura A. ;
Crandall, Keith A. ;
Hibbett, David S. ;
Soltis, Douglas E. .
PLOS BIOLOGY, 2013, 11 (09)
[9]   Missing data mean holes in tree of life [J].
Drew, Bryan T. .
NATURE, 2013, 493 (7432) :305-305
[10]  
Drummond AJ, 2005, MOL BIOL EVOL, V22, P1185, DOI [10.1093/molbev/msi103, 10.1093/molbev/mss075]